Netcraft 2008 survey shows AOLserver is far from dead

The Netcraft February 2008 Web Server Survey says:

Unusually, America Online’s open source AOLserver sees tremendous growth, jumping from 35 thousand to 105 thousand sites in just one month. AOLserver is a multithreaded, Tcl-enabled web server which can be used for large scale, dynamic web sites, but has not seen the release of a new version since 2006. The majority of the new sites served by AOLserver are hosted in Poland.

This isn’t going to make any headlines, but for all those doubters out there who keep wondering who actually uses AOLserver, check out that growth.

The sad thing is, we could easily game the Netcraft survey by doing a few simple things:

  1. Register to become an ICANN accredited domain registrar.
  2. Offer domain registration at-cost, to offer the lowest possible prices to get customers.
  3. Offer free web domain parking or static file only hosting, all on AOLserver.

The costs involved would be around $10K for the first year, plus the cost of the actual AOLserver hosting, plus the $70K working capital in reserve to meet ICANN’s requirements. This could all be set up on Amazon EC2/S3 to avoid having to provision real hardware as the customer demand grows.

Of course, what would be the point? Would having more significant numbers in the Netcraft surveys give AOLserver more credibility? I sure hope not–that would be foolish.

(via Mark Mcgaha)

Adding an “Unsubscribe” button to Google Reader using Greasemonkey

Yesterday, Maki asked on Twitter, “I would love to have a ‘Quick unsubscribe’ button for Google Reader…a greasemonkey script would be terrific. Anyone wanna do one?” Of course, it seems I have a thing for hacking out little web toys for people I know on Twitter, so I gave it a whack.  Here’s the result:

Here’s a screenshot showing what it does:

Google Reader Unsubscribe button screenshot

It’s a pretty simple script. The hardest part was reverse-engineering Google Reader’s packed/minified JavaScript, but that wasn’t too hard.

Do you have a useful idea for a web toy? Perhaps if you run it by me and it interests me, I’ll hack on it for you, too.

Tags: , , ,

LOL meme makes John 4 better

The gospel lesson this past week was on John 4, where Jesus meets the Samaritan woman. Our church uses this lectionary insert which uses the NRSV translation of the Bible. As our pastor read the gospel lesson, I couldn’t help but LOL at verse 11:

11The woman said to him, ‘Sir, you have no bucket, and the well is deep. Where do you get that living water?

The first thought my mind produced, was, “Jesus, you has no bukkit.” I couldn’t help it, my mind went into full LOL mode. This was the internal dialogue that went on inside my head:

JESUS: GIMME DRINK.
WOMAN: LOL! WTF?
JESUS: GIMME DRINK, NAO!
WOMAN: O RLY? U HAS NO BUKKIT. HOW U GET WATER?
JESUS: YA RLY! I DRINK UR MILKSHAKE! I DRINK IT RITE UP! WE CAN BE BFF!
WOMAN: NO WAI! GIMME WATER!

JESUS: I BEEN WATCHIN U FAP.
WOMAN: OMG! I SEE WHAT U DID THAR.

Yeah, pray for my immortal soul, or something.

(Before all you bright people point it out, yes, I know about the LOLcat Bible and there’s even a translation for John 4 already.)

Tags: , , , ,

del.icio.us/dossy links since February 18, 2008 at 09:00 AM

del.icio.us/dossy (RSS) links since February 18, 2008 at 09:00 AM:

Hand-decoding tcpdump’ed SNMPv1 packets

I can’t go into too many details about what I’m working on right now, but it involves processing SNMP requests and generating SNMP responses. The hard part is that I’m writing all my own code to parse and process them, as I can’t use the Net-SNMP library to do it for various reasons. (Trust me, I wish I could.)

What makes this so much fun (not!) is having to debug and troubleshoot my code by hand-decoding the SNMP messages, captured off the network using tcpdump. If you’ve never done this, I’d compare it to performing long division on really large numbers. It’s not particularly hard, but it sure is tedious. Here’s an example of a pair of request/response packets that I’ve hand-annotated:

Screenshot of hand-annotated tcpdump of an SNMP request/response pair

Yeah. This certainly isn’t one of the glorious parts of software development. But, it needs to be done, right?

Tags: ,

Just in time to make me go “d’oh!”

Yesterday, I griped about Mahalo not fitting my definition of social search. Today, the latest release of Mahalo Follow is announced on the Mahalo Blog. In my previous blog post, I asked:

Is there a product out there that combines all this data and uses it to enrich search results in realtime?

Wouldn’t you know it, but that’s exactly what the latest version of Mahalo Follow tries to deliver, in a way. It will now rewrite a Google search results page (SeRP), injecting links from Mahalo data. This is exactly the kind of search engine enhancement that the algorithmic search engines really need. Well done, guys!

Of course, there’s still a huge gap of opportunity for improvement, here. Mahalo still isn’t leveraging the Web 2.0 network effect that is possible through social software. You can recommend a particular page to your friends and possible inclusion in the Mahalo data set. But, the real victory is when I can vote/indicate when I believe any particular search result is relevant or not for the search query I just performed. Then, to complete the circle, when I perform searches, highlight and/or bury the results based on what my friends have voted on.

In a way, the Spock folks have already implemented this, but have limited it to just people (for now?) … on a SeRP for a person on Spock, there’s a section called “Other Results from the Web” which is populated with results from Google. Then, as a registered Spock user, I can vote on each result as being relevant to me or not. Other users can similarly vote and the premise is that “wisdom of crowds” will enable the most relevant results to rise to the top. Mahalo could do the same through Mahalo Follow, by enabling voting/recommendations directly off the Google SeRP, and using that collected information to improve future search results.

What about the problem of people trying to “game the system”? That’s where the “social” part of social software combined with the network effect come into play. As long as the software either only uses my friends’ recommendations–and perhaps second-degree friends, but definitely not third-degree friends or further away–I can avoid being affected by people gaming the system by being selective about who I make friends with. Some folks will friend everyone they can possibly find, some will friend no one at all, but those who fall in between will receive the most benefit and that class of users should be the majority.

This is also where reputation and trust come into play in social software: a person who has a reputation of recommending good links and is otherwise trustworthy will attract followers as they will want to benefit from that person’s activity in the system. A person who tries to game the system will eventually be self-selected out by losing friends in the system, until they receive no benefit from trying to game it at all.

Overall, the latest change to Mahalo Follow is a great first step in the right direction, and I’m sure the Mahalo team is busy working on the next set of changes already. I just hope they keep pushing in the direction of making Mahalo a real Web 2.0 social search service. It could be game-changing for the Internet and the way we search.

Tags: ,

del.icio.us/dossy links since February 4, 2008 at 09:00 AM

del.icio.us/dossy (RSS) links since February 4, 2008 at 09:00 AM:

Why is Jason referring to Mahalo as social search?

I probably don’t say it often enough, but Jason Calacanis is one of the role models I look up to when it comes to entrepreneurship. He’s also a great person and a great friend when I really needed it most. Let this serve as a disclaimer that any perceived hate in this blog entry is entirely unintentional.

I was reading his latest update about Mahalo and something really made me ask WTF? Here’s the key sentence:

[Jason Hines] pointed out all the salient issues, answered the burning questions, and come to the same conclusions we have over the past eight months since launch: social search has great potential.

“… social search has great potential.”

I thought I understood what Mahalo’s feature set currently is. I thought I understood what social software is. I thought I understood what social search is. But, that quote leads me to understand that Jason is suggesting that he thinks Mahalo is social search. Am I right, Jason?

Mahalo isn’t social search, in my mind. It isn’t even “Web 2.0” in my book because for me, Web 2.0 applications increase in value the more users perform the primary function of the application (i.e., network effect). Mahalo is a blend of yesteryear’s Yahoo! directory-oriented web categorization, presented in a manner that’s friendly to a generation of users who have been trained on how to better craft search queries to find what they’re looking for.

Perhaps I can quickly describe, at a low level of detail, what I consider to be a “social search” product, see if you agree, and then decide if Mahalo fits. Or, any other currently existing web product out there, for that matter.

Social search, in a nutshell, means:

  • Users create profiles, identify themselves to the system.
  • Users describe their relationships to other users in the system.
  • Users perform web searches using the system. The system tracks what results they click through to, and of those, users describe whether the result was useful in satisfying their search or not.
  • When users perform web searches, results their friends indicated were useful rank higher in results.
  • The more people search and provide feedback, the better the results will be for their friends when they search.
  • When users feel they have found the best result for their search, they can communicate their search query and the best result to their friends through the system.

This is social software because it creates an environment where people can define relationships that are meaningful to the application. The reason this fits my definition of Web 2.0 is that it continues to improve in quality (thus, increase in value) the more it gets used.

There’s plenty of social networking sites across many verticals already. There’s plenty of search engines already, and I’m sure they track all sorts of things including click-throughs. There’s plenty of sites where people can vote on links to pages and comment on them. There’s plenty of sites where people can bookmark links and share them with their friends.

Is there a product out there that combines all this data and uses it to enrich search results in realtime? I could see this as being done as a browser plugin … you authenticate to the social search service through it, then allow it to track your web surfing activity–which is probably the reason why Jason pushes Mahalo Follow really hard–and you then vote on pages that you like/dislike the way StumbleUpon lets you, with a quick yes/no. The plugin would handle actions on certain sites with special code, like when a user saves a page to their del.icio.us bookmarks or when they Digg it, letting the system know that they think the page is particularly valuable. This could even work for improving image search, when users search for images, in addition to presenting the images as results, there were some way to indicate “yes, when I searched for [xyz], these images from the result set were what I was looking for.” Expert systems in AI haven’t really gotten us very far, but this seems like a really good way to fake it.

Ultimately, the challenge would be to build a search index where pages users have expressed explicit preference for would be ranked higher in search results for that user and their friends. Right now, Mahalo either sends you to their very limited set of manually-created result pages or to one of various other search engines–but, it’s not clear if and how any of the Mahalo-collected data on its users is used to influence the order of the search results as returned by these other search engines. I don’t know if that would violate whatever agreement Mahalo has with them, or the ToS of these search services in general.

Look, I realize people are probably asking themselves, “why do we care what you think, Dossy? Jason’s the one with the millions of dollars from his self-made success, and you’re just picking nits here,” but doesn’t it seem odd that after 10 years, the search space ought to be mature enough that such a product would emerge and give all the old “Web 1.0” search engines a real run for their money, don’t you think?

Have I got this all wrong? Is there already a startup or product out there that I just haven’t heard about? I know several incomplete efforts have come and gone in the past, but I think the reason none of those have really taken a huge lead is because they need all these attributes I described in order to truly win. Just implementing one or two parts may be useful in a particular area, but it’s the whole package that will really change the way we use the web.

Tags: , ,

I got a fat stack of cash on my desk

Scrooge McDuck swimming in his vault

A few weeks ago, I was talking to a friend of mine who works for the Federal Reserve Bank of New York about how cool it must be to work around so much money. I mean, I could just imagine swimming around in the vault through a few hundred million dollars, just like old Scrooge McDuck.

Well, apparently there’s no special vault for swimming in at the Fed, but they do shred the cash that they remove from circulation. What’s even better, is that after they shred it, they package some of it up and it gets reused for various purposes. I guess they realized that shredded money would make a great gift for someone like me who finds this sort of thing really cool. So, my friend got one of these bricks of shredded cash for me–how awesome is that?!? Check it out:

Fat stack of cash

Apparently this fat stack of cash weighs about 2.2 pounds and contains the remains of approximately 1,000 notes. It contains a variety of shreds: $1, $5, $10, $20, $50 and $100. I don’t know what kind of distribution of notes they regularly shred, but the little blurb on the back label says, “If these shreds were only $100 notes the value would be $100,000.” C’mon, tell me it’s not cool to think you might have $100K in cash just sitting on your desk. Sure, you can’t spend it, but it’s still cool to me.

Tags: , , ,

Figment, “all bands, no music”

Figment: All bands, no music!

My friend Dan just told me that his brother Larry is spreading the word about a new site that just launched called Figment.

In a nutshell, Figment is a site where you can explore and discover new bands without having to listen to any of their crappy music–because, there isn’t any. The goal is to create bands and albums that other people like enough to become fans of, like Pastor Of Muppets or Josie and the LOLcats.

I don’t know if it’ll catch on, but it might be worth a chuckle or two to check out the creativity behind some of these faux bands.

Tags: ,