January 31st
Check out Apache Tika 0.6 . It's hot off the presses and contains even more goodies for mime type detection and content analysis than before. And it's got 30% more Apache goodness than the...
January 22nd
It's official. NASA has its first sorta official Apache project!
At this stage, Apache's official policy is to not do official PR announcing the project (we are kind of at step 2 in...
December 31st
So, lately, I've been pretty involved in Spatial SOLR . I'm just scratching the surface of trying to understand this stuff. There was an interesting plugin though posted recently by Mat Brown...
I announced a while back that Tika 0.5 is available for downloading. Get it while it's hot . Notable changes include moving to a source only release this time, improved RDF and OWL parsing and...
24-13, over Boston College . They weren't super impressive, but USC got it done. We'll take the win and hopefully ride it into some BCS-level success next year. Fight on guys!
Look, even...
December 7th
I'm not sure why I'm even bothering posting.
Have fun in the Emerald Bowl , USC .
Sigh.
November 15th
Not much to say, except:
Stanford, you are classless.
Harbaugh, you are classless.
USC defense, you suck.
USC offense (yes, you Barkley), you suck.
Joe McKnight, you rule.
...
August 29th
The more and more meetings I've attended recently including some at major US funding institutions have included debates between two factions of people:
Those that would group...
August 16th
I had the privilege of being the release manager for yet another Apache Tika release - 0.4 is out the door and has a number of major improvements over prior releases, including a major...
July 2nd
12 days until HBP: come on al-friggin' ready! Been waiting for seemingly years!
April 24th
Hanging out in Spain for a week was a blast. Visited the Royal Palace, several museums (including the Prado), the Basilica church, and took a day trip to Toledo.
At this point, just looking...
March 20th
Apache Tika , a sub-project of Apache Lucene , and a toolkit for content analysis and detection, has just made its 0.3 release.
You can grab the release from a nearby mirror here .
January 1st
I must admit. I originally was a bit disappointed that USC wouldn't be playing in the national championship game. I mean, if not for a shitty first half, the worst possible first half they could...
December 10th
Apache Tika 0.2 has recently been released!. Thanks to Dave Meike for leading the charge.
You can grab Tika 0.2 here . Of note is that Tika recently graduated out of the Incubator and is...
December 4th
...comes from Dave Woollard, who wins the award for interesting blog name, Macgyver Was Here .
Welcome to the blogsphere, Mr. Woollard!
November 27th
Hope everyone out there has a fantastic turkey day!
November 26th
After what seemed like years (yet what was probably months), the blog, and Pagemewhen environment, is back online.
Interestingly enough, after total disaster with the old system (baron...
January 14th
I recently was the release manager for Apache Tika 0.1-incubating. What is Tika?
Tika is a toolkit for parsing binary content, and extracting its Metadata, and making sense out of...
January 12th
Way to go USC!
USC 49, Illinois 17
Another Rose Bowl Victory. Now, if the AP would just vote USC number 1, and create another BCS controversy!
December 1st
Great job, USC 24, UCLA 7
Headed to the Rose Bowl as usual: or are we?