%HTMLlat1; %HTMLsymbol; %HTMLspecial; ]> Tagcloud
Roland van Ipen­burg
To be stolen or blogged
BETA

Tag­cloud

Wed­nes­day 29 April 2009 11:51

I've im­ple­ment­ed some sort of tag­cloud on this blog. It's not re­al­ly a tag­cloud since I don't tag my posts, but it looks like one. The tag­cloud is gen­er­at­ed by count­ing the oc­cur­ances of all the words in all the posts, and then stop­words and oth­er words I think add lit­tle val­ue to the cloud are fil­tered out. There are also some stem­ming is­sues re­solved like putting "play­ing" and "play­er" to­geth­er with "play". From the re­main­ing words the thir­ty or so with the most oc­cur­ances are then fed to HTML::TagCloud, and that gives a re­sult like this:

Not us­ing real tags means every oc­cur­ance of a word adds to the sig­nif­i­cance, so it's not based on how many posts are about some­thing, but how much con­tent is about some­thing. It also means the words are lim­it­ed to words, so "open" is show­ing up, but not "open source". That's prob­a­bly why "OS X" isn't in that cloud. But it's nice to see "fri­day" and "week­end" show up, and "Rot­ter­dam" man­aged to bub­ble up while "Am­s­ter­dam" didn't make it.

:

Book­mark this on De­li­cious

Add to Stum­bleUpon

Add to Mixx!

Share/Save/Book­mark


:

Com­ment/Con­tact
application away browser buy cool data days different flash game gta html ibook internet linux movie open play playstation possible run screen server side site stuff system train web windows work

Blog Posts (418)

Image Gal­leries

ipen­bug Last.fm pro­file

ipen­bug last.fm pro­file

Fol­low me on Twit­ter

Roland van Ipen­burg on face­book
Lin­ux Regis­tered User #488795
rolipe BOINC com­bined stats

Sub­scribe

Add to Google

Valid XHTML + RFDa Valid CSS! Hy­phen­at­ed XSL Pow­ered Valid RSS This site was cre­at­ed with Vim Pow­ered by Bri­co­lage! Pow­ered by Post­greSQL! Pow­ered by Apache! Pow­ered by mod­_perl! Pow­ered by Ma­son! Pow­ered by Perl Made on a Mac Pow­ered By Mac OS X XS4ALL This site has been proofed for ac­cu­ra­cy on the VISTAWEB-3000 Creative Com­mons Li­cense
This work by Roland van Ipen­burg is li­censed un­der a Creative Com­mons At­tri­bu­tion-Non­com­mer­cial-Share Alike 3.0 Un­port­ed Li­cense.
Per­mis­sions be­yond the scope of this li­cense may be avail­able at mail­to:ipen­burg@xs4all.nl.