On 28/02/13 01:54 PM, Phillip Smith wrote:
I'd be willing to try and assist with this too. What is the format of that backup file?
It is a 46MB text file of SQL commands; the kind you would get by running mysqldump. It only has 462 lines, but some of them are very long. The important lines are 97-117 that specify the "PackageComments" table: CREATE TABLE `PackageComments` ( `ID` bigint(20) unsigned NOT NULL AUTO_INCREMENT, `PackageID` int(10) unsigned NOT NULL DEFAULT '0', `UsersID` int(10) unsigned NOT NULL DEFAULT '0', `Comments` text NOT NULL, `CommentTS` bigint(20) unsigned NOT NULL DEFAULT '0', `DelUsersID` int(10) unsigned NOT NULL DEFAULT '0', PRIMARY KEY (`ID`), KEY `UsersID` (`UsersID`), KEY `PackageID` (`PackageID`), KEY `DelUsersID` (`DelUsersID`) ) ENGINE=MyISAM AUTO_INCREMENT=154508 DEFAULT CHARSET=latin1; /*!40101 SET character_set_client = @saved_cs_client */; -- -- Dumping data for table `PackageComments` -- LOCK TABLES `PackageComments` WRITE; /*!40000 ALTER TABLE `PackageComments` DISABLE KEYS */; ID is a number identifying the comment, PackageID is the package to which it belongs, UsersID is the one who posted it, Comments is the actual text of it, CommentTS is the timestamp of when it was posted, DelUsersID is equal to the ID of the user who deleted the comment and 0 if it has not been deleted. The next important lines are 118-146 which state the actual comment data. An example of it is: INSERT INTO `PackageComments` VALUES (17,46,68,'ruby bindings for fastcgi',1113164127,68),(28,69,65,'A countdown timer applet for the GNOME panel.',1113178883,0); Except that line there is 161 characters and contains two comments (one comment deleted by its poster about Ruby and one non-deleted comment about GNOME). The line in the real file is a million characters and contains ~20k comments. And there are 28 such lines. Reading this would be like reading War And Peace 10 times but it would teach you a lot about the history of the AUR.