The Postgres schema comparison and related problem with views

Compare schemas of two or more different Postgres databases is a common task, but it can become more tricky, if these databases run on different versions of Postgres'. Quick and canonical way to compare schemes is the use of the same program pg_dump to communicate with each base with --schema-only parameter. This method works great, but there are some pitfalls, especially when copying submissions.

(Photos made Philippe Vieux-Jeanton)

the

the Premise

Let's start with some assumptions, as was discovered by this problem. We have an instance that is in the process of upgrading versions of Postgres 9.2 to 9.6 (the latest version at the time of writing). Use pg_upgrade was impossible as it was planned not only enable data checksums, but changing the encoding to UTF-8. A number of factors, especially the change of the encoding meant that the typical update process old_database pg_dump | psql new_database impossible. Thus, we have a very specific program that accurately migrates portions of the data, producing the action along the way.

the

the Problem

As a final assessment of sanity, we wanted to ensure that the final scheme has been updated to version 9.6 of the database as far as possible identical to the current scheme grocery database version 9.2. When comparing the output of pg_dump, we quickly found a problem with the way of displaying views. Version 9.2 uses very lean, single-line output, while version 9.6 uses multiline "beautifully derived" variation. Needless to say, this meant that none of the representations do not coincide when comparing the output of pg_dump.

The problem lies in the system function pg_get_viewdef(), which is used pg_dump'om to return human-readable and Postgres-recognizable version of the view. To demonstrate the problems and solutions that will create a simple view on both databases, then compare them via pg_dump:

the

$ psql -p vtest 5920-c \
'gregtest create view as select count(*) from pg_class where reltuples = 0'
CREATE VIEW
$ psql -p vtest 5960-c \
'gregtest create view as select count(*) from pg_class where reltuples = 0'
CREATE VIEW
$ diff-u <(vtest pg_dump-x-p 5920 --schema-only) <(vtest pg_dump-x-p 5960 --schema-only)

--- /dev/fd/70 2016-09-29 12:34:56.019700912 -0400
+++ /dev/fd/72 2016-09-29 12:34:56.019720902 -0400
@@ -2,7 +2,7 @@
-- PostgreSQL database dump
--

--- Dumped from database version 9.2.18
+-- Dumped from database version 9.6.0
-- Dumped by pg_dump version 9.6.0

SET statement_timeout = 0;
@@ -35,22 +35,14 @@
--

CREATE VIEW gregtest AS
-SELECT count(*) AS count FROM pg_class WHERE (pg_class.reltuples = (0)::double precision);
+ SELECT count(*) AS count
+ FROM pg_class
+ WHERE (pg_class.reltuples = (0)::double precision);

The only difference besides the version of the server is a representation that does not match at all, and is concerned about the diff. (For the purposes of this article, from the output, to remove all secondary lines).

As mentioned earlier, the culprit is the pg_get_viewdef(). His job is to represent the filling of a presentation in a relevant, readable way. There are two main changes which it makes with this conclusion: adding the parentheses and adding padding with spaces. In recent versions, despite the fact that the documents hint, indentation (nice conclusion) can't be disabled, so there's no easy way to get the server version 9.6 to give the difference in views on one line, how does the server version 9.2 by default. Moreover, there are five versions of the function pg_get_viewdef, each of which accepts different arguments:

view name
the view name, and a logical argument
OID
OIDs and logical argument
an OID and an integer argument

pg_get_viewdef(text,boolean)

see

$ psql vtest -p 5920 -Atc "select pg_get_viewdef('gregtest')"
SELECT count(*) AS count FROM pg_class WHERE (pg_class.reltuples = (0)::double precision);

$ psql vtest -p 5920 -Atc "select pg_get_viewdef('gregtest',false)"
SELECT count(*) AS count FROM pg_class WHERE (pg_class.reltuples = (0)::double precision);

$ psql vtest -p 5920 -Atc "select pg_get_viewdef('gregtest',true)"
SELECT count(*) AS count +
FROM pg_class +
WHERE pg_class.reltuples = 0::double precision;

$ psql vtest -p 5960 -Atc "select pg_get_viewdef('gregtest')"
SELECT count(*) AS count
FROM pg_class
WHERE (pg_class.reltuples = (0)::double precision);

$ psql vtest -p 5960 -Atc "select pg_get_viewdef('gregtest',false)"
SELECT count(*) AS count
FROM pg_class
WHERE (pg_class.reltuples = (0)::double precision);

$ psql vtest -p 5960 -Atc "select pg_get_viewdef('gregtest',true)"
SELECT count(*) AS count
FROM pg_class
WHERE pg_class.reltuples = 0::double precision;

Solutions

to Write a script that will transform and normalize the output circuit
to Modify the source code Postgres'and behavior changes pg_get_viewdef
to call pg_dump'om functions pg_get_viewdef thus, to get identical output

~~ugly~~

src/backend/utils/adt/ruleutils.c

- #define PRETTYFLAG_INDENT 2
+ #define PRETTYFLAG_INDENT 0

pg_get_viewdef(oid)

pg_get_viewdef(oid,integer)

$ psql vtest -p 5920 -tc "select pg_get_viewdef('gregtest'::regclass, 0)"
SELECT count(*) AS count +
FROM pg_class +
WHERE pg_class.reltuples > 0::double precision;

$ psql vtest -p 5960 -tc "select pg_get_viewdef('gregtest'::regclass, 0)"
SELECT count(*) AS count +
FROM pg_class +
WHERE pg_class.reltuples > 0::double precision;

$ diff-u <(vtest pg_dump-x-p 5920 --schema-only) <(vtest pg_dump-x-p 5960 --schema-only)

--- /dev/fd/80 2016-09-29 12:34:56.019801980 -0400
+++ /dev/fd/88 2016-09-29 12:34:56.019881988 -0400
@@ -2,7 +2,7 @@
-- PostgreSQL database dump
--

--- Dumped from database version 9.2.18
+-- Dumped from database version 9.6.0
-- Dumped by pg_dump version 9.6.0

SET statement_timeout = 0;

pg_get_viewdef()

$ createdb -p 5960 vtest92

$ pg_dump vtest -p 5920 | psql -q-p 5960 vtest92

$ diff-s-u <(vtest92 pg_dump-x-p 5960 --schema-only) <(vtest pg_dump-x-p 5960 --schema-only)

Conclusion

Article based on information from habrahabr.ru

Поиск по этому блогу

computer express