Archive for January, 2009

Not Every Site on Google May Harm your Computer

Saturday, January 31st, 2009

If you did any searching this morning you may have run into oddly ubiquitous “This site may harm your computer” warnings. I don’t work directly on the anti-malware efforts, but I’m sure everyone on the larger search quality team feels the same, sorry for the inconvenience.

You can read more about what happened on the official Google Blog.

Now, if your site is still showing up with “may harm your computer” warnings, it probably because you have actually been compromised. It’s not always obvious – look for hidden iframes and other inserted code.

Word Clouds – what are they good for?

Friday, January 30th, 2009

ReadWriteWeb had an interesting post showing word clouds generated from Barack Obama’s inauguration speech. 

Obama Inauguration word cloud

But what are word clouds, and how are they useful? Word clouds visually represent the frequency or importance of a word in a given text. In President Obama’s speech, we can see from the cloud that he used words like “nation”, “new”, and “people” fairly often. You can use them to compare to texts in in a sort of qualitative way – does one text have a much sharper distribution than the other?

I would say that most of the time their primary purpose is aesthetic. I’m not convinced people really use them for anything other than as nice design elements – thought I think they have untapped potential. That’s why I created the Tag Altocumulus WordPress Plugin, to try to integrate tag clouds into a site’s navigation system in a way that’s actually useful.

To generate the clouds they used Wordle, a very cool site that lets you create your own word clouds from any text.  Wordle gives you options on color, font, and orientation and you can end up with some pretty nice looking clouds. I went ahead and generated one from my paper on Tagging and Searching:

Wordle: Tagging and Searching

It does look pretty cool. Wordle also will generate a cloud from any site with an RSS feed. Here’s the cloud for my site:

Wordle: Blog cloud

Drop me a note in the comments below if you make one for your site or find an interesting text to use.

A Twitter Experiment: 15 Movies, 30 Hours

Thursday, January 15th, 2009

I’ve been known to do geeky things.  For one, I’ve been experimenting with putting parts of my life on the web live via Twitter.  For another, I’ve been going to a 30-hour science fiction movie marathon with friends for the past 14 years.

It’s time to merge the two together in a Twitter Experiment this weekend.  Starting on Friday, 7 p.m. EST I’ll be posting updates to Twitter about the movies, ridiculous sci-fi plot devices, funny cracks from the crowd, and the general movie marathon experience.

Now for some questions and answers:

Q:  How can I follow along?

A:  Follow me on Twitter and watch the snippets roll in.  Alternatively, if you’re connected to me on Facebook you can watch my status updates, it’s the same thing.

Q:  I’m going to be there, how can I participate?

Let me know in the comments below, we’ll make it a thing.

EDIT:  Use hashtag #marathon34 in any Tweets.

Q:  Why would anyone have even the slightest interest in this?

A:  The CWRU Science Fiction Marathon is really an excuse for a bunch of sarcastic people to shout insults and rejoinders at a movie screen.  It’s like a huge, live-action, sleep-deprived version of Mystery Science Theater 3000.

Q:  No, I mean why would anyone have the slightest interest in you going to a movie marathon?

A:  Point taken, its not like I’m famous or anything (outside of being temporarily internet famous in Australia, of course).  Luckily many of my readers are friends, colleagues, and a bit geeky themselves. If you’re going to get a tiny-text-snippet tour through a science fiction marathon, though, I might as well be your guide – I have a fair knowledge of the genre, I used to be a movie reviewer, and I like to make sarcastic comments.

Q:  How is this possible?

A:  An iPhone, and WiFi or the regular data connection, that’s how.  I might also play around with my G1 phone with Android a bit.  If my connectivity fails for some reason, I reserve the right to basically give up and pretend I never even mentioned it.

One other thing I just can’t leave out of this post – when I mentioned this to my coworkers, they poked fun.  My coworkers at Google.  That’s right, I’m officially too geeky for Google.

My Photography Featured on Flickr?

Wednesday, January 14th, 2009

Small steel mill building set aside for a future museum This is cool – I’m not sure how the honor is bestowed, but I happened to be looking through Flickr’s Places feature and noticed that I’m a featured photographer for Cleveland.  You may have to reload a couple times, they only display two users at a time and there are at least 6 or so featured.

Perhaps not real fame but these are the sorts of inconsequential online vanity things that make me happy. I’d actually be even happier to know that I was chosen complete algorithmically, rather than by human editors, because it means some code somewhere though my pictures were good in some flawed, but slightly more objective way.

This is also an example of why I’m still using Flickr more than Google’s Picasa, even though I use the Picasa application to organize all the photos on my hard drive and the face tagging feature is really impressive.

Speaking of featured photographers (and yet another photo sharing website), check out Vinay’s photos on SmugMug. I just found out about his site and have been enjoying the pics.

How Geeky is your Taste in Music?

Tuesday, January 13th, 2009

You can tell you're in Silicon Valley when Borders has a A while back I created a group on Last.fm for Geek Music. Geek music isn’t a real genre, but it seemed to me that geeks were fairly passionate about their music and might have some similar favorites. I’m not just talking about genre stuff like math rock, nerdcore rap, or heaven forbid, filk. What do programmers listen to when they program? What do bookworms listen to when they read?

Well, we have the answer. And now that we have some numbers, we might as well see some analysis.

So, how geeky are your musical tastes? Click here to find out.

I clocked in at 83.82% geeky. That’s pretty good – can you beat me? Either way, post your score in the comments below.

Usability and Design of WiFi Interstitial Pages

Wednesday, January 7th, 2009

You’re out somewhere, maybe a coffee shop or an airport.  Suddenly, a man falls to the floor – you need to know – what’s the antidote for tricyclic antidepressants*?  You grab your laptop or mobile and use the local WiFi to look up the answer -

Only to be annoyed and inconvenienced by the interstitial “terms and conditions” page that the coffee shop and/or airport redirects your browser to.  Time is of the essence!  Why do they need a 2-minute flash animation to load an “I agree” button?

Thus ends my attempt to make a relatively boring subject seem interesting.  In any event, I can’t be the only person who’s noticed that WiFi login or “terms and conditions” pages are often way more complicated and annoying than they need to be.

In the hope that someone, somewhere will do a quick search before implementing one and come across this blog, here are some guidelines.  These all follow logically from supporting the user’s goal, which is to get info quickly, and the business’ goal, which is to attract and keep customers.

Do not require Flash, ActiveX, or god forbid Silverlight.  Nothing you could want to do here could possibly require it, and there are lots of WiFi-enabled mobile devices that don’t have it.  Do you really want to poke a finger in the eye of every customer with an iPhone?  If we’ve already bought iPhones,  we obviously like spending money!

Make the checkboxes / buttons large enough to click on a small device.  So that people with touchscreen PDAs, Blackberries, iPhones and G1′s will be able to touch it on their screens.

Make the page and any server-side code fast.  Stay out of the user’s way as much as possible.  If your code can’t accept a form and do some logging without dragging a user’s browsing session to a crawl, you need to go yell at your devs.

Once users agree to terms, save it in a cookie for a reasonable amount of time. Many mobile devices don’t allow programs to run in the background, which means session cookies can expire every time the device dozes off.

Once users agree, redirect them back to where they were trying to go.  And use a real, server-side http redirect too, not a fragile javascript redirect.  Many people set up their browsers to ignore those.

Make the legal terms and conditions as readable as possible.  Readability is an interesting topic, with lots of research into measurement formulas and the like.  I think we can all agree that an iframe with 1700 words of legalease is not readable or useful.  And while we’re at it, why not add some actually useful information, like bandwidth limits, disabled protocols, etc.  If you are blocking POP and IMAP, let me know so I don’t waste time trying to check my email.

While the crazy scenario at the beginning of the post isn’t very likely, this isn’t just a list of gripes.  Why are you providing WiFi if not as a service to customers or a way to differentiate?  Well, coffee shops with WiFi aren’t very unusual anymore, so you should look at it as part of the who customer experience.

* Why did I pick tricyclic antidepressants?  I did a Google search for poison antidotes and picked the first one with an antidote I thought you might be able to find, in a mad rush to save a beloved minor character’s life, like in a TV show.

EDIT:  thanks to Wysz for pointing out the massive numbers of typos.

Web-based genealogy software – any recommendations?

Friday, January 2nd, 2009

Desperation This past year we moved away from most of my family and added a new leaf to the family tree.  This has inevitably turned my thoughts to family history.  As expected from a guy who let the internet vote for his baby’s name and Twittered the delivery, I’m not going to be happy typing up a plain old document and mailing it out to family members.

I want to start collating a family history and collecting stories, photos, and other artifacts and I want to do it with a web app so that I can share with family spread all around the country.  Though I don’t have time for any hard-core genealogical research right now I’d like to set up a good framework in case anyone else in family catches the bug and finds themselves hunched over microfiche at the local LDS church.

So two of my main concerns are usability and openness.  Openness means having complete access and ownership of the data (so Facebook family tree apps are out) and compatibility with standard genealogical file types.

It would also me nice if it were written in a language I know like PHP, Java, or even Python in case I get the urge to write plugins or change the interface.

I know of two systems that might fit the bill, PhpGedView and GeneoTree, but I’m hoping to get some suggestions and recommendations before I start installing lots of stuff on my web server.  Has anyone done a project along these lines, or played around with this kind of software?

Please leave any input in the comments below.