pgloader

mirror of https://github.com/dimitri/pgloader.git synced 2026-02-10 00:41:34 +01:00

Author	SHA1	Message	Date
Dimitri Fontaine	9da012ca51	Fix identifiers quoting when reading PostgreSQL catalogs. We sure can trust PostgreSQL to use names it knows how to handle. Still, it will be happy to store in its catalogs names containing upper case, and in that case we must quote them.	2017-07-06 03:16:06 +02:00
Dimitri Fontaine	e37cb3a9e7	Split SQL queries into their own files. This change was long overdue. Ideally we would use something like the YeSQL library for Clojure, but it seems like the cl-yesql equivalent is not ready yet, and it depends on an experimental build system... So this patch introduces an URL abstraction built on-top of a hash table. You can then reference src/pgsql/sql/list-all-columns.sql as (sql "pgsql/list-all-columns.sql") in the source code directly. So for now the templating system is CL's format language. It is still an improvement from embedded string. Again, one step at a time.	2017-07-06 03:16:05 +02:00
Dimitri Fontaine	bae40d40c3	Fix identifier quoting corner cases. In cases when pgloader needs to build a new identifer from existing ones (mainly for renaming indexes, because they are unique per-table in the source database and unique per-schema in PostgreSQL), and we compose the new name from already quoted strings, pgloader was doing the wrong thing. Fix that by having a build-identifier function that may unquote parts then re-quote properly (if needed) the new identifier.	2017-07-05 15:37:21 +02:00
Dimitri Fontaine	538464f078	Avoid operator is not unique errors. When the intarray extension is installed our PostgreSQL catalog query fails because we now have more than one operator solving smallint[] <@ smallint[]. It is easy to avoid that problem by casting to integer[], smallint being an implementation detail here anyway. Fix #532.	2017-04-06 23:55:06 +02:00
Dimitri Fontaine	5fd1e9f3aa	Fix catalog merge hasards. When reading table names from PostgreSQL, we might find some that need systematic quoting (such as names that begin with a digit). In that case, when later comparing the catalogs to match source database table names against PostgreSQL catalog table names, we need to unquote the PostgreSQL table name we are using. In passing, force the identifier-case to :none when reading object names from the PostgreSQL catalogs.	2017-02-25 17:53:08 +01:00
Dimitri Fontaine	dbf7d6e48f	Don't double-quote identifiers in catalog queries. Avoid double quoting the schema names when used in PostgreSQL catalog queries, where the identifiers are used as literal values and need to be single-quoted. Fix #476, again.	2017-01-10 21:12:34 +01:00
Dimitri Fontaine	8da09d7bed	Log PostgreSQL Catalog queries at SQL log level. See #476 where it would have been helpful to see the PostgreSQL catalog queries with `--log-min-messages sql` in the bug report. Also more generally useful.	2017-01-10 21:12:34 +01:00
Dimitri Fontaine	a86a606d55	Improve existing PostgreSQL database handling. When loading data into an existing PostgreSQL catalog, we DROP the indexes for better performance of the data loading. Some of the indexes are UNIQUE or even PRIMARY KEYS, and some FOREIGN KEYS might depend on them in the PostgreSQL dependency tracking of the catalog. We used to use the CASCADE option when dropping the indexes, which hides a bug: if we exclude from the load tables with foreign keys pointing to tables we target, then we would DROP those foreign keys because of the CASCADE option, but fail to install them again at the end of the load. To prevent that from happening, pgloader now query the PostgreSQL pg_depend system catalog to list the “missing” foreign keys and add them to our internal catalog representation, from which we know to DROP then CREATE the SQL object at the proper times. See #400 as this was an oversight in fixing this issue.	2016-08-10 22:02:06 +02:00
Dimitri Fontaine	70572a2ea7	Implement support for existing target databases. Also known as the ORM case, it happens that other tools are used to create the target schema. In that case pgloader job is to fill in the exiting target tables with the data from the source tables. We still focus on load speed and pgloader will now DROP the constraints (Primary Key, Unique, Foreign Keys) and indexes before running the COPY statements, and re-install the schema it found in the target database once the data load is done. This behavior is activated when using the “create no tables” option as in the following test-case setup: with create no tables, include drop, truncate Fixes #400, for which I got a test-case to play with!	2016-08-06 20:19:15 +02:00
Dimitri Fontaine	2d47c4f0f5	Use internal catalog when loading from files. Replace the ad-hoc code that was used before in the load from file code path to use our full internal catalog representation, and adjust APIs to that end. The goal is to use catalogs everywhere in the PostgreSQL target API and allowing to process reason explicitely about source and target catalogs, see #400 for the main use case.	2016-08-05 11:42:06 +02:00
Dimitri Fontaine	2aedac7037	Improve our internal catalog representation. First, add index and foreign keys to the list of objects supported by the shared catalog facility, where is was only found in the pgsql schema specific package for historical raisons. Then also add to our catalog internal structures the notion of a trigger and a stored procedure, allowing for cleaner advanced default values support in the MySQL cast functions. Once we now have a proper and complete catalog, review the pgsql module DDL output function in terms of the catalog and rewrite the schema creation support so that it takes direct benefit of our internal catalogs representation. In passing, clean-up the code organisation of the pgsql target support module to be easier to work with. Next step consists of getting rid of src/pgsql/queries.lisp: this facility should be replaced by the usage of a target catalog that we fetch the usual way, thanks to the new src/pgsql/pgsql-schema.lisp file and list-all-* functions. That will in turn allow for an explicit step of merging the pre-existing PostgreSQL catalog when it's been created by other tools than pgloader, that is when migrating with the help of an ORM. See #400 for details.	2016-08-01 23:14:58 +02:00

11 Commits