pgloader

mirror of https://github.com/dimitri/pgloader.git synced 2026-03-14 00:21:56 +01:00

Author	SHA1	Message	Date
Christoph Berg	10ee9d931a	Remove regression output on clean	2022-06-26 23:32:55 +02:00
Christoph Berg	087ddce749	Run testsuite	2022-06-23 16:20:14 +02:00
Christoph Berg	8451ca5158	test: Don't run sakila test by default; set variables for tests Sakila needs a MySQL database running. Set DBPATH and SOURCEFILE for the sqlite-env.load and csv-districts-env.out tests.	2022-06-23 15:59:11 +02:00
Dimitri Fontaine	12e788094b	Improve sexp parser and standard symbols support. Also add split-sequence to the list of special cases that we can use and is not found in the pgloader.transforms package. Fixes #965.	2019-05-14 15:49:24 +02:00
Dimitri Fontaine	e5f78d978e	Remove added DBF tests from the Travis target. Clozure doesn't have the CP866 encoding that the DBF files are using, and then PostgreSQL 9.6 doesn't have "create schema if not exists", which makes the tests fail on Travis.	2019-05-11 23:52:47 +02:00
Dimitri Fontaine	98b465fbef	Add the new DBF tests in the test suite. All with expected results so that we can track regressions there.	2019-05-11 22:13:18 +02:00
Christoph Berg	30f90cb848	test/Makefile: Allow configuring the superuser database name Also, don't ignore errors while setting up the database	2018-06-04 10:52:14 +02:00
Dimitri Fontaine	38712d98e0	Fix regression testing. Previous patch made regression failures obvious that were hidden by strange bugs with CCL. One such regression was introduced in commit ab7e77c2d00decce64ab739d0eb3d2ca5bdb6a7e where we played with the complex code generation for field projection, where the following two cases weren't cleanly processed anymore: column text using "constant" column text using "field-name" In the first case we want to load a user-defined constant in the column, in the second case we want to load the value of the field "field-name" in the column --- we just have different source and target names. Another regression was introduced in the recent commit 01e5c2376390749c2b7041b17b9a974ee8efb6b2 where the create-table function was called too early, before we have fetched pgsql-reserved-keywords. As a consequence table names weren't always properly quoted as shown in the test/csv-header.load file which targets a table named "group". Finally, skip the test/dbf.load regression test when using CCL as this environment doesn't have the necessary CP850 code page / encoding.	2017-09-09 00:51:07 +02:00
Dimitri Fontaine	b685c8801d	Improve guessing of CSV parameters. In this commit we fail the guess faster, allowing to test for a much larger sample. The sample is still hard-coded, but this time to 1000 lines. Also add a test case, see #618.	2017-08-24 13:30:14 +02:00
Dimitri Fontaine	b3cb7b256d	Travis: let's actually use the new EXTRA_OPTS.	2017-06-14 21:32:41 +02:00
Dimitri Fontaine	1469789ede	Try to get more information from CCL in testing. The “magic” options --batch and --heap-reserve will be processed by CCL itself before pgloader gets to see them, so try that in the testing environment.	2017-06-14 21:12:54 +02:00
Dimitri Fontaine	40c1581794	Review transaction and error handling in COPY. The PostgreSQL COPY protocol requires an explicit initialization phase that may fail, and in this case the Postmodern driver transaction is already dead, so there's no way we can even send ABORT to it. Review the error handling of our copy-batch function to cope with that fact, and add some logging of non-retryable errors we may have. Also improve the thread error reporting when using a binary image from where it might be difficult to open an interactive debugger, while still having the full blown Common Lisp debugging experience for the project developers. Add a test case for a missing column as in issue #339. Fix #339, see #337.	2016-02-21 15:56:06 +01:00
Dimitri Fontaine	150d288d7a	Improve our regression testing facility. Next parallelism improvements will allow pgloader to use more than one COPY thread to load data, with the impact of changing the order of rows in the database. Rather than doing a copy out and `diff` of the data just loaded, load the reference data and do the diff in SQL: select * from loaded.data except select * from expected.data If such a query returns any row, we know we didn't load what was expected and the regression test is failing. This regression testing facility should also allow us to finally add support for multiple-table regression tests (sqlite, mysql, etc).	2015-11-17 17:03:08 +01:00
Dimitri Fontaine	598c860cf5	Improve user code parsing, fix #297 . To be able to use "t" (or "nil") as a column name, pgloader needs to be able to generate lisp code where those symbols are available. It's simple enough in that a Common Lisp package that doesn't :use :cl fullfills the condition, so intern user symbols in a specially crafted package that doesn't :use :cl. Now, we still need to be able to run transformation code that is using the :cl package symbols and the pgloader.transforms functions too. In this commit we introduce a heuristic to pick symbols either as functions from pgloader.transforms or anything else in pgloader.user-symbols. And so that user code may use NIL too, we provide an override mechanism to the intern-symbol heuristic and use it only when parsing user code, not when producing Common Lisp code from the parsed load command.	2015-09-21 13:23:21 +02:00
Dimitri Fontaine	04ddf940d9	Left pad COPY octal chars with 0, fix #275 . The COPY TEXT format accepts non printable characters with an escaped sequence wherin pgloader can pass in the octal number for the character in its encoding. When doing that with small numbers like \6 and the non-printable character is then followed by other numbers, then it becomes e.g. \646 which might not be part of the target encoding... To fix, always left pad the character octal number with zeroes, so that we now send in \00646 which COPY knows how to read: the char at \006 then 4 then 6. Also copy the test case over to pgloader and run it in the test suite.	2015-08-20 18:17:18 +02:00
Dimitri Fontaine	1c7de22096	Add test coverage for #80 .	2015-06-25 14:16:12 +02:00
Dimitri Fontaine	bffec4cc63	Allow for more options in the CSV escape character, fix #38 . To allow for importing JSON one-liners as-is in the database it can be interesting to leverage the CSV parser in a compatible setup. That setup requires being able to use any separator character as the escape character.	2015-05-22 12:31:06 +02:00
Dimitri Fontaine	abbc105c41	Implement CSV headers support. Some CSV files are given with an header line containing the list of their column names, use that when given the option "csv header". Note that when both "skip header" and "csv header" options are used, pgloader first skip as many required lines and then uses the next one as the csv header. Because of temporary failure to install the `ronn` documentation tool, this patch only commits the changes to the source docs and omits to update the man page (pgloader.1). A following patch is intended to be pushed that fixed that. See #236 which is using shell tricks to retrieve the field list from the CSV file itself and motivated this patch to finally get written.	2015-05-21 12:55:23 +02:00
Mark Lee	dc04b40836	Accept periods in CSV field names Periods are allowed in PG column names as well.	2015-05-15 07:22:07 -07:00
Dimitri Fontaine	53dcdfd8ef	Fix handling of COPY data, fix #222 . When given a file in the COPY format, we should expect that its content is already properly escaped as expected by PostgreSQL. Rather than unescape the data then escape it again, add a new more of operation to format-vector-row in which it won't even try to reformat the data. In passing, fix an off-by-one bug in dealing with non-ascii characters.	2015-04-30 13:17:02 +02:00
Dimitri Fontaine	7cf7e714fc	Implement the source date format option.	2014-10-02 01:03:24 +02:00
Dimitri Fontaine	07b5aa3ed6	Add BEFORE/AFTER LOAD clauses to IXF and DBF commands.	2014-07-17 16:56:13 +02:00
Dimitri Fontaine	32b4cf23e8	New test case showing off the 'null if' source field option, see #80 .	2014-06-16 19:59:08 +02:00
Dimitri Fontaine	fe4f577300	Add the csv-filename-pattern test to the suite.	2014-06-16 14:17:53 +02:00
Dimitri Fontaine	65aabb8216	Add a dbf test to the regression suite.	2014-06-16 14:17:33 +02:00
Dimitri Fontaine	3bcd236de6	Add automated regression tests. Those tests currently only work when a single table is the target of the load, and when this target is explicit in the INTO target clause. More work needs to be done to cover interesting cases like MySQL and SQLite where we want to diff a full database rather than a single table.	2014-06-03 12:19:23 +02:00
Dimitri Fontaine	ae2f7e9ed0	Add an hstore test This test is currently commented out of the test suite so that we don't require the hstore being available to run the basic tests.	2014-06-03 10:33:43 +02:00
Dimitri Fontaine	429232c3de	Fix loading data from stdin: fix #53 . The stdin support really was one brick shy of a load, and in particular with-open-file was used against a stream when using that option.	2014-04-27 23:38:02 +02:00
Dimitri Fontaine	b84541367d	Add MySQL (MariaDB) support in tests.	2013-12-03 22:06:10 +01:00
Dimitri Fontaine	3ddcc959ac	Improve tests and add test cases.	2013-11-26 16:48:45 +01:00
Dimitri Fontaine	9fd0bbabe2	Yet another round of fixes for the test setup. Including some Makefile hacks where test doesn't depend on the main pgloader binary anymore because I coulnd't stop the binary to get being built again even if it's been done already...	2013-10-20 23:27:11 +02:00
Dimitri Fontaine	208d10c919	Review and clean the vagrant setup, add tests.	2013-10-20 22:51:06 +02:00

32 Commits