pgloader

mirror of https://github.com/dimitri/pgloader.git synced 2025-08-10 16:26:58 +02:00

Author	SHA1	Message	Date
Adrian Vondendriesch	d966d37579	MSSQL: Fix Default value translation for getdate() (#576 ) (#577 ) Currently the default value getdate() is replaced by 'now' which creates statements like: CREATE TABLE ... created timestamp 'now' ... which leads to table definitions like: default '2017-06-12 17:54:04.890129'::timestamp without time zone This is because 'now' is evaluated when creating the table. This commit fixes the issue by using CURRENT_TIMESTAMP as default instead of 'now'.	2017-06-12 21:42:35 +02:00
Chris Bandy	b59421cb48	Multiple Travis CI jobs (#575 ) * Test with multiple versions of PostgreSQL in Travis * Test with multiple implementations of Lisp in Travis	2017-06-11 13:25:48 +02:00
Chris Bandy	1a5194de1d	Repair Travis CI builds (#574 ) * Uninstall PostgreSQL during Travis setup Recent build images start with PostgreSQL running. Uninstall it so we can install and run a specific version. * Skip authentication in Travis builds The wrong combination of connection parameters will cause psql and other utilities to prompt for a password, stalling the Travis build indefinitely. * Move PostgreSQL setup for Travis builds into script * Use a newer version of SBCL in Travis builds Recent versions of bordeaux-threads require ASDF >= 3.1	2017-06-10 12:06:40 +02:00
Adrian Vondendriesch	1f3659941e	MSSQL: fix typmod conversion for "max" typemods (#573 ) The previous commit makes it possible to convert typemods for various text types. In MSSQL it's possible to create a column like varchar(max). Internally this is reported as varchar(-1) which results in a CREATE TABLE statement that contains e.g. varchar(-1). This patch drops the typemod if it's -1 (max). It's based on Dimitris patch slightly modified by myself.	2017-06-09 16:12:17 +02:00
Adrian Vondendriesch	d0c273a512	MSSQL: Add support for typemods of various text types (#572 ) If a custom CAST rule is defined (e.g CAST type varchar to varchar) the original typemod get's lost. This commit is based on Dimitris patch from 571 and adds typmod support for "char", "nchar", "varchar", "nvarchar" and "binary".	2017-06-09 11:25:02 +02:00
Dimitri Fontaine	355aedfd72	Fix "drop default" casting rule. The previous coding would discard any work done at the apply-casting-rules step when adding source specific smarts about handling default, because of what looks like negligence and bad tests. A test case scenario exists but was not exercized :( Fix that by defaulting the default value to the one given back at the apply-casting-rules stage, where we apply the "drop default" clause.	2017-06-08 21:39:06 +02:00
Dimitri Fontaine	d74c9625a3	Clarify the code. (unless (remove-if-not ...)) reads better as (when (notany ...)) so rewrite the test that way. For reference and context see #563.	2017-06-08 13:39:38 +02:00
Dimitri Fontaine	776f8fcf6f	Add support for SQLite implicit pkeys. It turns out that in some cases SQLite will not list its primary key indexes in the pragma index_list(), and that's related to an implementation detail about using its internal btree structure when rowid can be exploited as the integer pkey. Reverse engineer that bahavior in pgloader so that the PostgreSQL schema contains the primary keys even when they are implicit. In doing that we must be careful that the pkeys might also appear explicitely, in which case we refrain from declaring them twice. SQLite catalogs are not my favorite, not to say hackish as hell. Fixes #563.	2017-06-08 12:11:38 +02:00
Dimitri Fontaine	25e5ea9ac3	Refactor error handling in complete-pgsql-database. Given new SQLite test case from issue #563 we see that pgloader doesn't handle errors gracefully in post-copy stage. That's because the API were not properly defined, we should use pgsql-execute-with-timing rather than other construct here, because it allows the "on error resume next" behavior we want with after load DDL statements. See #563.	2017-06-08 12:09:11 +02:00
Dimitri Fontaine	9fb37bf513	Fix SQLite usage of sqlite_sequence catalog. It turns out that this catalog table in SQLite may or may not exists depending on whether the current schema actually uses the feature. So we need to learn to query the catalogs about the catalogs before querying the catalogs to learn about the current schema. Thanks SQLite.	2017-06-02 22:34:58 +02:00
Dimitri Fontaine	57a7353a94	Add a log entry before executing SQL blocks. That seems helpful.	2017-06-02 22:34:25 +02:00
Dimitri Fontaine	c6b634caad	Provide "on error stop" as a WITH option. As seen in #546 it would be easier to be able to specify the option in the load command directly rather than only at the command line. Here we go!	2017-06-01 16:43:09 +02:00
Dimitri Fontaine	45719645da	Implement GumRoad payment for the Moral License. Let's make it easy for interested parties to actually buy a license, and offer several levels of “partnership” with the project. Be a Sponsor, a Partner or even consider having pgloader custom development made for you. Oracle™ support anyone?	2017-05-31 15:27:06 +02:00
Dimitri Fontaine	21208980fd	Improve readme content about building from source. As reported we still have some complex situations to deal with when buildind from sources in some environments: windows and MacOXS macports are not in a very good shape at the moment. Add some good content in the README to better help people in those environments get their build. Fix #444.	2017-05-29 13:01:40 +02:00
Dimitri Fontaine	8f92cc5a7d	Fix our SQL parser. In order to support custom SQL files with several queries and psql like advanced \i feature, we have our own internal SQL parser in pgloader. The PostgreSQL variant of SQL is pretty complex and allows dollar-quoting and other subtleties that we need to take care of. Here we fix the case when we have a dollar sign ($) as part of a single quoted text (such as a regexp), so both not a new dollar-quoting tag and a part of a quoted text being read. In passing we also fix reading double-quoted identifiers, even when they contain a dollar sign. After all the following is totally supported by PostgreSQL: create table dollar("$id" serial, foo text); select "$id", foo from dollar; Fix #561.	2017-05-29 12:42:12 +02:00
Dimitri Fontaine	1f9a0b6391	Handle SQLite Sequences (auto_increment). It turns out that the SQLite catalogs have a way of registering if a column is an auto_increment. You have to look into the sqlite_sequence catalog for the name of the table the column is found into, and when there's a non-zero entry there and the table as a single pk-id column of integer type then it's safe to assume that we have an "auto_increment" column. The current way to handle this property is suboptimal in pgloader too where we need to attach an "extra" slot to our column representation with a value of the "auto_increment" string. But at least it's easy to hack the code here... anyway. Fix #556.	2017-05-21 18:43:10 +02:00
Dimitri Fontaine	3b4af49e22	Implement ALTER SCHEMA and ALTER TABLE for SQLite. It turns out we forgot to add support for internal catalog munging clauses to SQLite support. The catalogs being normalized means there's no extra work here other than allowing the parser to accept those clauses and then pass them over to our generic `copy-database' method implementation. It is to be noted that SQLite has no support for schemas as per the standard and PostgreSQL, so that when we inspect the database schema we create a nil entry here. It's then not possible to ALTER SCHEMA nil RENAME TO 'target'; unfortunately, but it's easy enough to SET search_path to 'target' anyway, as shown in the modified test case. Fix #552.	2017-05-14 20:47:01 +02:00
Dimitri Fontaine	65cdc48c1e	Fix error conditions for unrecognized source. When the source is not recognized, pgloader would not always output a useful error message with the hint about using --type, as per #248. Here we fix the conditions applying to this error message.	2017-05-12 13:08:30 +02:00
Dimitri Fontaine	fe2e1ee956	Fix comments handling in IXF support. There's no need to add empty string comments, guard against doing that.	2017-05-06 15:10:40 +02:00
Dimitri Fontaine	e7afe993fa	Review database migration logic and options again. In #539 it appears that we fail to honor some actions, and it seems to be because of sloppy reasonning in how to make sense of all of them. It could well be that we have too many options for them to make sense in all possible combinations, but for now it looks safe to bet on an easy fix...	2017-05-06 14:45:05 +02:00
Mikael Sand	3c4e64ed26	Fix spelling error in Windows-only code path (#545 ) Fix spelling error for uiop:make-pathname* key by changing :direction to :directory	2017-04-30 19:53:53 +02:00
Dimitri Fontaine	8254d63453	Fix incorrect per-table total time metrics. The concurrency nature of pgloader made it non obvious where to implement the timers properly, and as a result the tracking of how long it took to actually transfer the data was... just wrong. Rather than trying to measure the time spent in any particular piece of the code, we now emit "start" and "stop" stats messages to the monitor thread at the right places (which are way easier to find, in the worker threads) and have the monitor figure out how long it took really. Fix #506.	2017-04-30 18:09:50 +02:00
Dimitri Fontaine	20ea1d78c4	Improve default summary readability. Now that we have fixed the output of the per-table total timing, we can only show that timing by default. With more verbosity pgloader will add the extra columns, and in computer oriented formats (json, csv, copy) all the details are always provided of course. See #506.	2017-04-30 18:09:50 +02:00
geethaRam	0e12d77a7f	Update pgloader.spec (#537 ) Updated version in spec	2017-04-17 23:20:14 +02:00
Dimitri Fontaine	9b4bbdfef7	Review --load-lisp-file error handling. The handler-case form installed would catch any non-fatal warning and would also fail to display any error to the user. Both are wrong behavior that this patch fixes, using error-output (that's stderr) explicitely for any thing that may happen while loading the user provided code. Fix #526.	2017-04-16 21:22:46 +02:00
Dimitri Fontaine	538464f078	Avoid operator is not unique errors. When the intarray extension is installed our PostgreSQL catalog query fails because we now have more than one operator solving smallint[] <@ smallint[]. It is easy to avoid that problem by casting to integer[], smallint being an implementation detail here anyway. Fix #532.	2017-04-06 23:55:06 +02:00
Dimitri Fontaine	0219f55071	Review DROP INDEX objects quoting. Force double-quoting of objects name in DROP INDEX commands by using the format directive ~s. The names of the objects we are dropping usually come from a PostgreSQL catalog, but still might contain force-quote conditions like starting with a number, as shown in #530. This fix certainly means we will have to review all the DDL formatting we do in pgloader and apply a single method of quoting all along. The simpler one is of course to force quote every object name in "", but it might not be the smartest one (what if some sources are sending already quoted object names, that needs a check), and it's certainly not the prettier way to go at it: people usually like to avoid unnecessary quotes, calling them clutter. Fix #530.	2017-04-01 22:37:26 +02:00
Dimitri Fontaine	e2bc7e4fd4	Fix github language statistics. As documented in https://github.com/github/linguist#overrides and from the conversation in https://github.com/github/linguist/issues/3540 we add a .gitattributes file to the project wherein we pretend that the sql files are all vendored-in. This should allow GitHub to realize that pgloader is all about Common Lisp and not a single line of PLpgSQL...	2017-03-26 11:26:29 +02:00
Dimitri Fontaine	b2f9590f58	Add support for MS SQL XML data type. Given a test case and some reading of the FreeTDS source code, it appears that the XML data type is sent on the wire as (unicode) text. This patch makes pgloader aware of that and also revisit the choice of casting XML to PostgreSQL XML data type (thanks to the test case where we see it just works without surprise). Fix #503.	2017-03-25 21:26:16 +01:00
Dimitri Fontaine	296e571e27	Fix MS SQL tinyint identity support. Add a cast rule to support tinyint being an “identity” type in MS SQL, which means using a sequence to derive its values from. We didn't address the whole MS SQL integer type tower here, and suspect we will have to add more in the future. Fix #528 where I could have access to a test case and easily reproduce the bug, thanks!	2017-03-22 11:38:40 +01:00
Dimitri Fontaine	940fc63a5e	Distribute root-dir to all threads. The creation of the reject and data files didn't happen in the right root-dir setting for lack of sending the main value to the worker threads.	2017-03-18 18:51:22 +01:00
Dimitri Fontaine	ab7e77c2d0	Fix double transformation call in CSV projections. In advanced projections it could be that we call the transformation function for some input fields twice. This is a bug that manifest in particular when the output of the transformation can't be used/parsed again by the same function as shown in the bug reported. Fix #523.	2017-03-04 15:55:08 +01:00
Dimitri Fontaine	3fac222432	Fix MSSQL index column names quoting. We have to pay attention that column names in MS SQL don't follow the same rules as in PostgreSQL and may e.g. begin with numbers. Apply identifier case and rules to index column names too.	2017-03-03 21:30:58 +01:00
Dimitri Fontaine	1023577f50	Review internal database migration logic. Many options are now available to pgloader users, including short cuts that where not defined clearly enough. That could result in stupid things being done at times. In particular, when picking the "data only" option then indexes are not to be dropped before loading the data, but pgloader would still try and create them again at the end of the load, because the option that controls that behavior default to true and is not impacted by the "data only" choice. In this patch we review the logic and ensure it's applied in the same fashion in the different phases of the database migration: preparation, copying, rebuilding of indexes and completion of the database model. See also `96b2af6b2a` where we began fixing oddities but didn't go far enough.	2017-02-26 14:48:36 +01:00
Dimitri Fontaine	8ec2ea04db	Add support for MySQL geometry points. The new version of the sakila database uses geometry typed columns that contain POINT data. Add support for that kind of data by copying what we did already for POINT datatype.	2017-02-25 21:52:41 +01:00
Dimitri Fontaine	9e2b95d9b7	Implement support for PostgreSQL storage parameters. In PostgreSQL it is possible at CREATE TABLE time to set some extra storage parameters, the most useful of them in the context of pgloader being the FILLFACTOR. For the setting to be useful, it needs to be positionned at CREATE TABLE time, before we load the data. The BEFORE LOAD clause of the pgloader command allows to run SQL scripts that will be executed before the load, and even before the creation of the target schema when pgloader does that, which is nice for other use case. Here we implement a new `ALTER TABLE` rule that one can set in the pgloader command in order to change storage parameters at CREATE TABLE time: ALTER TABLE NAMES MATCHING ~/\./ SET (fillfactor='40') Fix #516.	2017-02-25 21:49:06 +01:00
Dimitri Fontaine	57dd9fcf47	Add int as an alias for integer. We cast MS SQL "int" type to "integer" in PostgreSQL, so add an entry in our type name mapping where they are known equivalent to avoid WARNINGs about the situation in DATA ONLY loads.	2017-02-25 17:54:57 +01:00
Dimitri Fontaine	5fd1e9f3aa	Fix catalog merge hasards. When reading table names from PostgreSQL, we might find some that need systematic quoting (such as names that begin with a digit). In that case, when later comparing the catalogs to match source database table names against PostgreSQL catalog table names, we need to unquote the PostgreSQL table name we are using. In passing, force the identifier-case to :none when reading object names from the PostgreSQL catalogs.	2017-02-25 17:53:08 +01:00
Dimitri Fontaine	96b2af6b2a	Fix a hang scenario in schema-only. The parallelism in pgloader is now smart enough to begin fetching data from the next table while the previous one is still not done being written down to PostgreSQL, but when doing so I introduced a bug in the way indexes are taken care of. Specifically, in schema-only mode of operations, we would wait for indexes we skipped creating. The skipping is the bug here, so make sure we create indexes even when we don't want to copy any data over.	2017-02-25 17:09:07 +01:00
Dimitri Fontaine	2f7169e286	Fix MS SQL N'' default values. MS SQL apparently sends default values as Nvarchar, and in this case it means we have to deal ourselves with the N'' representation of it.	2017-02-25 16:14:26 +01:00
Jan Moringen	57bc1ca886	Shadow symbols NAMESTRING, NUMBER and INLINE in pgloader.parser package (#515 ) Defining rules on standard symbols like CL:NAMESTRING is a bad idea since other systems may do the same, inadvertently overwriting each other's rules. Furthermore, future esrap versions will probably prevent defining rules whose names are symbols in locked packages, making this change mandatory.	2017-02-12 15:14:34 +01:00
Dimitri Fontaine	024579c60d	Fix SBCL version requirements. As we now depend on recent enough version of ASDF in some of our build dependencies, that raises the bar to ABCL 1.2.5 or newer now. Fixes #497.	2017-01-28 18:17:19 +01:00
Dimitri Fontaine	6bd17f45da	Add support for MS SQL smalldatatime data type. Availability of a test case for MS SQL allows to make progress on this limitation and add support to the smalldatetime data type. It is converted server-side with the same CONVERT expression as the longer datetime datatype. Fixes #431.	2017-01-28 18:08:55 +01:00
Dimitri Fontaine	a799cd5f5f	Improve error handling for MS SQL. In particular, implement more solid handling of poorly encoded data or badly setup connections, by reporting the error and continuing the load.	2017-01-28 17:47:44 +01:00
Dimitri Fontaine	ddda2f92ca	Force column ordering in SQLite support. In the case of targetting an already existing PostgreSQL database, columns might have been reordered. Add the column name list to the COPY command we send so that we figure the mapping out automatically. Fixes #509.	2017-01-28 17:45:33 +01:00
Dimitri Fontaine	b54ca576cb	Raise some log messages. We should be able to follow the progress more easily at the log level NOTICE, so raise some log messages from INFO to NOTICE.	2017-01-28 17:44:18 +01:00
Dimitri Fontaine	ed217b7b28	Add some docs about FreeTDS and encoding. It turns out that it's possible and not too complex, when using the FreeTDS driver, to enforce the client encoding for MS SQL to be utf-8. Document how to tweak ~/.freetds.conf to that end.	2017-01-27 22:16:59 +01:00
Dimitri Fontaine	1d025bcd5a	Fix log levels. It looks like we missed the INFO level messages in the DEBUG output.	2017-01-23 21:52:38 +01:00
Dimitri Fontaine	bd84c6fec9	Fix default value handling in MS SQL. When the column is known to be non-nullable, refrain from adding a null default value to it. This also fixes the case of casting from an [int] IDENTITY(1,1) NOT NULL That otherwise did get transformed into a bigserial not null default NULL Causing then the error Database error 42601: multiple default values specified for column ... of table ...	2017-01-23 21:50:16 +01:00
Dimitri Fontaine	c0f9569ddd	In passing aesthetic concerns.	2017-01-23 21:49:42 +01:00

... 5 6 7 8 9 ...

1488 Commits