Just signed into a Google account that I haven’t used for a while and noticed the Introducing Search plus Your World message below – it’ll stay for every search until you dismiss it. Placement money can’t buy!

The open source version of OrangeHRM is a great system and a really good way of managing a team’s annual leave dates. However one feature achingly missing from the free version is a leave calendar, which you have to pay $250(!) in order to add in the functionality.

I’ve knocked up an easy (and moreover free) way of getting a calendar into the system (version 2.6). Just follow the instructions below to implement a basic (not pretty) leave calendar:

1. Edit the main menu to add a link to your calendar (around line 500) by adding another element to the subs menu: Read more »

It’s pretty easy downloading the latest version of Nutch but I had a few issues getting it set up on my Red Hat server; it’s pretty easy really but there are a couple of gotchas along the way and it doesn’t work exactly as specified in the tutorial.

  1. Download & install Java – super simple: yum install java
  2. wget & unzip the latest version of Nutch
  3. I had issues with the JAVA_HOME environment when trying to follow the example on  crawling a website. The error I got was “/usr/loca/jdk/bin/java: No such file or directory” – the problem here was twofold: 1) I didn’t have java set in my environment variables, and 2) around line 118 in the bin/nutch file there’s a reference to $JAVA_HOME/bin/java – the bold part of which seems unnecessary and should be deleted


The below is an internal email sent out today at OMD which I thought would be nice to share with the wider SEO community.


Today marks the day last year when we lost a highly valued colleague and friend on the OMD SEO team Jaamit Durrani. His dedication, passion, humour & intelligence he showed while he was with us has had a lasting effect on us personally, but also has been crucial to the SEO team’s growth from a very small team last year to a highly successful team of 15+ this year; a year where we’ve won our first ever major SEO-only client, scored 100% in client feedback and continue to build a better offering month on month.

As a small tribute, and hopefully an insight for those who weren’t lucky enough to meet or work with him, this week’s links highlight some of his best blog posts, as well as some of the tributes posted online: Read more »

Get HTML returned from HTML::TreeBuilder::XPath

Perl’s HTML::TreeBuilder::XPath is a great module for parsing HTML documents without regular expressions, however it returns text content by default, which is not always what you want when you’re doing advanced HTML processing. The documentation on CPAN doesn’t mention this, but if you want to get out the HTML content, just use “findnodes” and “->shift->as_HTML” in the way illustrated below:

my $value = $tree->findnodes(q{//div[@class='crumbs'})->shift->as_HTML