Spring in Practice

Willie Wheeler's Spring blog

Migrating non-Wordpress Blog Comments Into Wordpress

| Comments

This isn’t a Spring post, but I’m doing something that I think others might find useful, so I’m going to share it.

I’m in the process of migrating content over from my old Wheeler Software blog to this one, which is a Wordpress blog. Besides the posts themselves, I want to move the comments over.

The slight wrinkle in the plan is that the software and database for the old blog are custom. So getting the comments over involves some SQL scripting. Both databases are MySQL databases, so that helps a bit.

Here’s what I’m doing.

The old, custom comment table

CREATE TABLE `services_comment` (
  `id` int(10) unsigned NOT NULL AUTO_INCREMENT,
  `url` varchar(255) NOT NULL DEFAULT '',
  `name` varchar(50) NOT NULL,
  `email` varchar(50) NOT NULL,
  `web` varchar(100) DEFAULT NULL,
  `text` text NOT NULL,
  `html_text` text,
  `ip_addr` varchar(15) NOT NULL,
  `date_created` timestamp NOT NULL DEFAULT '0000-00-00 00:00:00',
  PRIMARY KEY (`id`),
  KEY `services_comment_idx0` (`url`,`date_created`)

The Wordpress post table

This isn’t the entire table, but just the columns that we care about for this post:

CREATE TABLE `wp_posts` (
  `ID` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
  `post_title` text NOT NULL,
  `comment_count` bigint(20) NOT NULL DEFAULT '0',

  ... other columns that we don't care about for our current purpose ...


The Wordpress comment table

CREATE TABLE `wp_comments` (
  `comment_ID` bigint(20) unsigned NOT NULL AUTO_INCREMENT,
  `comment_post_ID` bigint(20) unsigned NOT NULL DEFAULT '0',
  `comment_author` tinytext NOT NULL,
  `comment_author_email` varchar(100) NOT NULL DEFAULT '',
  `comment_author_url` varchar(200) NOT NULL DEFAULT '',
  `comment_author_IP` varchar(100) NOT NULL DEFAULT '',
  `comment_date` datetime NOT NULL DEFAULT '0000-00-00 00:00:00',
  `comment_date_gmt` datetime NOT NULL DEFAULT '0000-00-00 00:00:00',
  `comment_content` text NOT NULL,
  `comment_karma` int(11) NOT NULL DEFAULT '0',
  `comment_approved` varchar(20) NOT NULL DEFAULT '1',
  `comment_agent` varchar(255) NOT NULL DEFAULT '',
  `comment_type` varchar(20) NOT NULL DEFAULT '',
  `comment_parent` bigint(20) unsigned NOT NULL DEFAULT '0',
  `user_id` bigint(20) unsigned NOT NULL DEFAULT '0',
  PRIMARY KEY (`comment_ID`),
  KEY `comment_approved` (`comment_approved`),
  KEY `comment_post_ID` (`comment_post_ID`),
  KEY `comment_approved_date_gmt` (`comment_approved`,`comment_date_gmt`),
  KEY `comment_date_gmt` (`comment_date_gmt`),
  KEY `comment_parent` (`comment_parent`)

The copy script

WARNING and DISCLAIMER: Before you try anything like this, back up your database! And it’s a good idea to experiment with your script on a separate copy of the database before trying it out on the real thing. You’ve been warned. I’m not responsible if you hose your database, and I’m not going to field support requests!

Step 1. Copy the old table into the new database.

I created a copy of my old comment table inside the new Wordpress database to facilitate the copying.

Step 2. Look up the target Wordpress post’s ID

You’ll need to have something to attach your comments to. You can use whatever means you like here; I just looked it up by the post title:

  post_title = 'Getting started with Hibernate Validator' and
  post_parent = 0

Note that you want the post with post_parent = 0. Other posts are the various revisions you create over time, and Wordpress has to have a way to attach all the comments to the same single post.

Step 3. Copy the comments

This is obviously dependent on the specifics of your source table. In my case the source table is pretty close to the destination table, so that makes things a lot easier.

I didn’t really care about getting the date vs. GMT date right, so I just used the same date for both. If you care, I’m sure there’s a function that can handle that.

insert into
  wp_comments (comment_post_ID, comment_author, comment_author_email, comment_author_url, comment_author_IP, comment_date, comment_date_gmt, comment_content, comment_approved)
  @post_id, name, email, web, ip_addr, date_created, date_created, html_text, 1
  url = '/hibernate-validator.html'
order by

Step 4. Update the post’s comment count

Presumably for performance reasons, Wordpress stores the comment count with the post itself. So we have to update that column or else it will say “0 comments” even though there’s a bunch of comments below:

select count(*) from wp_comments where comment_post_ID = @post_id and comment_approved = 1 into @comment_count;
update wp_posts set comment_count = @comment_count where ID = @post_id;

It probably wouldn’t be a bad idea to wrap these up in a stored procedure if you have a lot of posts. But no biggie either way.

Step 5. Verify

Check to see whether your comments showed up. Mine did:

For some reason the Gravatars don’t seem to be showing up as much when there are a lot of comments all on a single page. I don’t know if it’s rate limited or what.