pgloader

mirror of https://github.com/dimitri/pgloader.git synced 2026-03-11 23:21:43 +01:00

Author	SHA1	Message	Date
Dimitri Fontaine	eabfbb9cc8	Fix schema qualified table names usage (more). When parsing table names in the target URI, we are careful of splitting the table and schema name and store them into a cons in that case. Not all sources methods got the memo, clean that up. See #182 and #186, a pull request I am now going to be able to accept. Also see #287 that should be helped by being able to apply #186.	2015-09-04 01:06:15 +02:00
Dimitri Fontaine	b78bb6dd31	Allow quoted field names to contain spaces, fix #285 . Given a fully quoted field name, there should be no restriction about using spaces in between the quotes, but the parser used to choke on that case.	2015-09-01 14:32:50 +02:00
Dimitri Fontaine	75727df72f	Quote table names when migrating from SQLite, fix #281 . Apparently I just forgot to apply any smartness whatsoever to SQLite identifiers and just copied them as they are to PostgreSQL. Change that by calling apply-identifier-case.	2015-08-25 01:13:19 +02:00
Dimitri Fontaine	573a63cd3a	Add a local test, per #271 .	2015-08-24 16:42:39 +02:00
Dimitri Fontaine	04ddf940d9	Left pad COPY octal chars with 0, fix #275 . The COPY TEXT format accepts non printable characters with an escaped sequence wherin pgloader can pass in the octal number for the character in its encoding. When doing that with small numbers like \6 and the non-printable character is then followed by other numbers, then it becomes e.g. \646 which might not be part of the target encoding... To fix, always left pad the character octal number with zeroes, so that we now send in \00646 which COPY knows how to read: the char at \006 then 4 then 6. Also copy the test case over to pgloader and run it in the test suite.	2015-08-20 18:17:18 +02:00
Dimitri Fontaine	833b41c23b	Fix the regression test expected values, see #266 .	2015-07-26 14:45:43 +02:00
Dimitri Fontaine	b4b36caa84	Fix parsing dates with less-than 4 digits, fix #266 . The previous coding decided to add 2000 to the year as an integer if it was below 2000, which parses 1999 as 3999. Oops. Trigger the correction only when the date is given on 2 digits only, parsing 04 as 2004. Dates given on 3 digits are kept as-is. Playing with the century special parameter allows to cancel this behavior, that maybe should be made entirely optional. It's just too common to find current years on 2 digits only, sadly.	2015-07-26 14:41:44 +02:00
Dimitri Fontaine	d2a1a5643e	Improve SQL blocks support, fix #265 . It's now possible to use several files in a BEFORE LOAD EXECUTE section, and to mix DO and EXECUTE parts, bringing lots of flexibility in the commands. Also it actually simplifies the parser.	2015-07-24 17:41:35 +02:00
Dimitri Fontaine	54e29773d7	Fix index creation reporting, see #251 . The new option 'drop indexes' reuses the existing code to build all the indexes in parallel but failed to properly account for that fact in the summary report with timings. While fixing this, also fix the SQL used to re-establish the indexes and associated constraints to allow for parallel execution, the ALTER TABLE statements would block in ACCESS EXCLUSIVE MODE otherwise and make our efforts vain.	2015-07-18 23:06:15 +02:00
Dimitri Fontaine	a98788b670	Implement drop indexes option for copy and fixed. The option doesn't seem relevant to the db3 source type which contains a table definition: pgloader will create the table from scratch and no indexes are going to be found.	2015-07-16 21:39:06 +02:00
Dimitri Fontaine	49bf7e56f2	Implement a "drop indexes" option in CSV mode, fix #251 . When loading against a table that already has index definitions, the load can be quite slow. Previous commit introduced a warning in such a case. This commit introduces the option "drop indexes" that is not used by default. When this option is used, pgloader drops the indexes before loading the data then create the indexes again with the same definitions as before. All the indexes are created again in parallel to optimize performances. Only primary key indexes can't be created in parallel, so those are created in two steps (create unique index then alter table).	2015-07-16 12:22:58 +02:00
Dimitri Fontaine	1c7de22096	Add test coverage for #80 .	2015-06-25 14:16:12 +02:00
Dimitri Fontaine	7e508374c4	Add some SQLite test cases for real type, see #249 .	2015-06-16 23:13:52 +02:00
Dimitri Fontaine	c3b5d60542	Fix type declaration to include null values, fix #238 . In passing, add a test case for NIL datetime values in our SQLite sample database.	2015-05-22 23:49:03 +02:00
Dimitri Fontaine	ba7b27b60a	Travis: actually push the right version of the expected file.	2015-05-22 12:41:29 +02:00
Dimitri Fontaine	bffec4cc63	Allow for more options in the CSV escape character, fix #38 . To allow for importing JSON one-liners as-is in the database it can be interesting to leverage the CSV parser in a compatible setup. That setup requires being able to use any separator character as the escape character.	2015-05-22 12:31:06 +02:00
Dimitri Fontaine	dc86b5e600	Travis: Fix the text case connection string.	2015-05-21 16:29:14 +02:00
Dimitri Fontaine	abbc105c41	Implement CSV headers support. Some CSV files are given with an header line containing the list of their column names, use that when given the option "csv header". Note that when both "skip header" and "csv header" options are used, pgloader first skip as many required lines and then uses the next one as the csv header. Because of temporary failure to install the `ronn` documentation tool, this patch only commits the changes to the source docs and omits to update the man page (pgloader.1). A following patch is intended to be pushed that fixed that. See #236 which is using shell tricks to retrieve the field list from the CSV file itself and motivated this patch to finally get written.	2015-05-21 12:55:23 +02:00
Mark Lee	dc04b40836	Accept periods in CSV field names Periods are allowed in PG column names as well.	2015-05-15 07:22:07 -07:00
Dimitri Fontaine	95a5eb3184	Implement more COPY options, fix #218 . The COPY format now supports user defined delimiter and null options, and we don't require the column names anymore as it's useless in that context.	2015-04-30 14:30:16 +02:00
Dimitri Fontaine	53dcdfd8ef	Fix handling of COPY data, fix #222 . When given a file in the COPY format, we should expect that its content is already properly escaped as expected by PostgreSQL. Rather than unescape the data then escape it again, add a new more of operation to format-vector-row in which it won't even try to reformat the data. In passing, fix an off-by-one bug in dealing with non-ascii characters.	2015-04-30 13:17:02 +02:00
Dimitri Fontaine	11976d2c23	Fix census-places URL location of the source file.	2015-04-17 21:58:19 +02:00
Dimitri Fontaine	7d2d09ce68	Add the option to preserve MySQL index names, fix #187 . See test/parse/hans.goeuro.load for an example usage of the new option. In passing, any error when creating indexes is now properly reported and logged, which was missing previously. Oops.	2015-03-07 20:19:47 +01:00
Dimitri Fontaine	48f451bdbc	Implement the option to disable triggers when loading data. This option is dangerous and allows to skip ALL triggers when loading data against PostgreSQL. This includes foreign key constraints definitions and will allow loading data out of order. When using both the options "create no table" and "disable triggers" it will be possible to load data into a schema prepared by your favorite external tool, at the cost of not validating FK constraints. Use with care. Fix #167.	2015-02-19 15:05:10 +01:00
Dimitri Fontaine	ce5a61face	Catch PostgreSQL internal errors too, fixes #155 .	2015-01-21 13:01:28 +01:00
Dimitri Fontaine	a3b68bb80a	Random test load file cleanup.	2015-01-15 17:49:19 +01:00
Dimitri Fontaine	e1bc6425e2	Implement support for PostgreSQL COPY format, fix #145 . PostgreSQL COPY format is not really CSV but something way easier to parse. Funnily enough, parsing it as CSV is not that easy, so we add here a special simple parser for the COPY format. It should be quite useful too try loading again reject data files from pgloader after manual fixing, too. It's still missing some documentation without any good excuse for that, will add soon.	2015-01-02 18:49:17 +01:00
Dimitri Fontaine	1996256f8f	Add support for postgres:// URI prefixes, fix #135 .	2014-12-15 20:32:49 +01:00
Dimitri Fontaine	87e157bee2	Add a new database source type in the parser. Now it's possible to parse a command to load data from MS SQL. The parser was until now parsing all database URI within the same common rule and that isn't possible anymore if we want to distinguish in between source database right from the parser, which we actually want to do. This patch also implement in-passing fixes all over the place, including the transformation function float-to-string that only happened to work on double-float data.	2014-11-17 00:23:06 +01:00
Dimitri Fontaine	68416a79b3	In passing docs review. Also modernize the test/census-place.load to use the newer fixed file format options, and show that in the docs.	2014-10-18 22:42:31 +02:00
Dimitri Fontaine	22f4317a30	Add support for the CAST rule to SQLite sources. This allows users to benefit from the same flexible machinery when using SQLite as when using MySQL, and also allows to add some more default cast rules too.	2014-10-13 00:52:55 +02:00
Dimitri Fontaine	00b002124b	Travis: switch test case to timestamp, dropping the TZ. The timezone is different in between my own machine and the test system, just get rid of that discrepancy so that the test stop failing.	2014-10-02 01:32:19 +02:00
Dimitri Fontaine	08fc7fd092	Square brackets around source field options are now mandatory.	2014-10-02 01:17:24 +02:00
Dimitri Fontaine	2f49c9614c	Fix the new test case's out file.	2014-10-02 01:16:58 +02:00
Dimitri Fontaine	7cf7e714fc	Implement the source date format option.	2014-10-02 01:03:24 +02:00
Dimitri Fontaine	ac55d71401	Rework CSV and Fixed files source fields options, see #116 . It's not possible to use a comma separator when using more than one source field option at the same time, and for better readability the options are to be found enclosed in squared brackets. Also, it's now possible to spell out "from" and "for" keywords on the source definitions, making it easier to read and maintain the load file, as in this full example: ( a from 0 for 10, b from 10 for 8, c from 18 for 8, d from 26 for 17 [null if blanks, trim right whitespace] )	2014-10-01 18:32:40 +02:00
Dimitri Fontaine	ea97fc4659	Implement a new source level filter: trim. As seen in #116, it might be better for the users to be able to ask for field trimming right in the source definition, like we do for processing nulls.	2014-09-29 15:16:04 +02:00
Dimitri Fontaine	8a0c91fa40	Fix conjunctions for the INCLUDING clause in MySQL. We want all table matching any of the given constraints (regexp or equality search).	2014-09-23 16:44:38 +02:00
Dimitri Fontaine	52e3371be8	Review default SQLite options.	2014-09-22 14:34:57 +02:00
Dimitri Fontaine	d4b58a1f78	Review MySQL default options. The default values for MySQL WITH clause options wasn't really tested and broke on simple cases, the new set of defaults is known to work in many cases (most?). Other combinations of options will need some review work, and we might need to consider preventing some of them, that's for another patch tho.	2014-09-21 12:19:20 -05:00
Dimitri Fontaine	ca52ddacb1	SQLite: transform "0" timestamps to NULL, see #100 .	2014-07-30 18:42:49 +02:00
Dimitri Fontaine	ed8022ce64	SQLite: transform default values to their PostgreSQL representation. When default values are used in SQLite they are of course using their SQLite representation, which might not be compatible with the PostgreSQL target data type we're casting to. Make it so that the default values are transformed too, as we already do in the MySQL case. See #100.	2014-07-30 16:32:35 +02:00
Dimitri Fontaine	4aa8b0946f	Get rid of unused sample file.	2014-07-18 11:07:25 +02:00
Dimitri Fontaine	cb4b2a3334	Convert SQLite tinyint to PostgreSQL smallint, fixes #97 .	2014-07-18 11:06:37 +02:00
Dimitri Fontaine	9eff1bb4d8	Travis: Adapt test/ixf.load to work against 9.1.	2014-07-17 17:35:41 +02:00
Dimitri Fontaine	5a2b98856f	Update the main SQLite test database.	2014-07-17 16:56:28 +02:00
Dimitri Fontaine	07b5aa3ed6	Add BEFORE/AFTER LOAD clauses to IXF and DBF commands.	2014-07-17 16:56:13 +02:00
Dimitri Fontaine	3e0526c957	Implement early support for IXF files.	2014-07-14 21:53:50 +02:00
Dimitri Fontaine	53a7e47058	New MySQL default Cast Rule for bit(1) to boolean, fix #93 . We need a new transformation function that work with a vector of integers as input.	2014-07-03 11:47:59 +02:00
Dimitri Fontaine	6d49d9e10a	Add a "real" column test case in SQLite main test, Closes #73 .	2014-06-29 16:33:42 +02:00

1 2 3

138 Commits