20101211

ERROR: Failed to build gem native extension. / install stanford parser on ubuntu 10.04

First you need to download Stanford parser after downloading, you need to put the content of the folder into : /usr/local/stanford-parser/current

After, you need to make sure that your shell has $JAVA_HOME by writting this in the config files of your shell (/etc/bash.bashrc) (default-jdk is a requirement at this stage)

export JAVA_HOME=/usr/lib/jvm/java-6-openjdk/
export PATH=$PATH:$JAVA_HOME/bin

Connect as root (sudo -s) and run:
export JAVA_HOME=$JAVA_HOME
gem install jrb
gem install treebank

Here you go, ready to use Stanford parser.

irb(main):001:0> require 'rubygems'
=> true
irb(main):002:0> require 'stanfordparser'
=> true

20101112

install ruby Linguistics with wordnet and linkparser / Failed to build gem native extension.

In one of my projects I needed to install ruby Linguistics.
After a lot of googling, here is the way I made it work :
I am assuming that you already have ruby installed and most probably ruby-dev, rubygems make, gcc ....

First you need to install Linguistics :
sudo gem install linguistics
You will after need linkparser, wich NEEDS link-grammar:
wget http://www.abisource.com/downloads/link-grammar/4.7.0/link-grammar-4.7.0.tar.gz
tar zxvf link-grammar-4.7.0.tar.gz
cd link-grammar-4.7.0
./configure
make
sudo make install
sudo gem install linkparser

The ruby-wordnet part is just a nightmare to install... Basically you will need to download wordnet db files, install berkeley db, ruby extension to read berkeley and convert the wornet files into berkeleydb, Whoua !

Download Wordnet Database:

wget http://wordnetcode.princeton.edu/3.0/WNdb-3.0.tar.gz
tar -xvzf WNdb-3.0.tar.gz


Install BerkeleyDB
sudo aptitude install libdb4.6 libdb4.6-dev libdb-ruby1.8

Check that the file /usr/include/db.h exist.

Convert Wornet files into berkeley db :
svn co svn://deveiate.org/Ruby-WordNet/trunk
cd trunk
ruby convertdb.rb
and follow the interactive questions.
It will ask you to confirm and to indicate the path of the "dict" folder extracted from WNdb-3.0.tar.gz

voila !

irb(main):001:0> require 'rubygems'
=> true
irb(main):002:0> require 'bdb'
=> true
irb(main):003:0> require 'linguistics'
=> true
irb(main):004:0> require 'linkparser'
=> true
irb(main):005:0> require 'wordnet'
=> true
irb(main):006:0>

20101109

Download Business intelligence Pentaho - Free Edition

Pentaho is for sure the open-source BI leader.
A lot of organizations depends on Pentaho BI Free edition to do the good business decisions.
Download the Pentaho BI Free edition and you will cut your cost by 90% ! In order to download pentaho :

Download pentaho - Free Edition

20101024

Bus - Tube London Timetable 2010

I've been looking for the timetable of bus and tube in london.
Finally found it here : journey planner timetables

Each xml file contains the following data elements:

- StopPoints
- RouteSections
- Routes
- JourneyPatternSections
- Operators
- Services
- VehicleJourneys

Enjoy