[aur-dev] some notes from converting the aur from mysql to postgresql

Mon Apr 25 22:35:27 EDT 2011

While converting the aur codebase from mysql to postgresql (just for
fun/to see if I could), I ran into a few mysql specific gotchas, and a
few oddities. Here are some of my notes from the experience.

* I ran into a few sql integer comparisons that are using string
comparisons. postgres didn't like this, but mysql was fine with it
  ex: `$q .= "WHERE s.UsersId = '" . $userID . "' ";`

* using the limit offset format of `"LIMIT x,y"`
  postgresql doesn't support this format, so I had to track each
occurrence down and change it to `"Limit x OFFSET y"`

* A few instances of `"LIMIT 0,10"`. A `"LIMIT 10"` would have been
sufficient. Not sure why offset was specified in these few instances.

* A pair of column names were reserved words. "user' and 'end' in the
TU_VoteInfo table. I changed the column names to
  username and ending, to avoid issues and/or having to quote the name
all the time.

* postgresql table and column names are lowercase (when not quoted).
This required me to either quote all instances of table and
  column names (ugh) or change the fetch_assoc php references to
lowercase. I chose the latter, even though it was tedious to
  track down all instances and change them.

* postgres has no mysql_insert_id counterpart. Instead I appended to
the query "RETURNING ID" and then used pg_fetch_result
  to get the ID that was created.

* postgres LIKE and ILIKE queries did full table scans, and was quite
slow. I think mysql was caching these queries when I was doing my
  performance tests. The end result was postgres performing poorly in
my load test for LIKE/ILIKE operations. So I created a postgres
fulltext
  search column for the packages table, indexed it, defined triggers
to keep it udpated, and converted to using postgres fulltext searching
  `"@@ to_tsquery('term')"`. The result was quite speedy and gave nice
results. The syntax was a bit different for 'prefix searching' though,
  and the results were not the same as the LIKE/ILIKE queries, because
fulltext uses stemming instead of partial string comparison.

* There is quite a bit of sql code sprinkled all over the place. It
might make sense to try and consolidate some of the sql to fewer
files.

* It might make sense to generalize the 'mysql_real_escape_string'
instances to a wrapper 'db_escape' or something similar. This would
could
  make things a bit cleaner, ease possible future porting to other
databases (fewer things to track down and change individually).

* postgres doesn't support `"UNIX_TIMESTAMP()"`. Instead of changing
all instances of it, I defined a few convenience sql functions.

        --
        -- helper functions to ease conversion from mysql
        -- from:
http://janusz.slota.name/blog/2009/06/mysql-from_unixtime-and-unix_timestamp-functions-in-postgresql/
        --
        -- no params
        CREATE OR REPLACE FUNCTION unix_timestamp() RETURNS BIGINT AS '
            SELECT EXTRACT(EPOCH FROM CURRENT_TIMESTAMP(0))::bigint AS result;
        ' LANGUAGE 'SQL';

        -- timestamp without time zone (i.e. 1973-11-29 21:33:09)
        CREATE OR REPLACE FUNCTION unix_timestamp(TIMESTAMP) RETURNS BIGINT AS '
            SELECT EXTRACT(EPOCH FROM $1)::bigint AS result;
        ' LANGUAGE 'SQL';

        -- timestamp with time zone (i.e. 1973-11-29 21:33:09+01)
        CREATE OR REPLACE FUNCTION unix_timestamp(TIMESTAMP WITH TIME
zone) RETURNS BIGINT AS '
            SELECT EXTRACT(EPOCH FROM $1)::bigint AS result;
        ' LANGUAGE 'SQL';

I think that is it.