Archive for the 'trends' Category

Tuesday, August 3rd, 2010

Heard, overheard, and misheard at Balisage

The opening day of the conference was not Balisage proper, but a separate symosium on “XML for the long haul”.

Some interesting tidbits overheard, in no particular order…

“it is not necessarily clear that this approach would capture the difference between the ridiculous and the merely implausible.”

Complexity — what is the relationship betwen complexity and long-term data storage?

“Narratives with fancy words in them”

How do you store, say, a video in a format that will be readable in 100 years?

Order of magnitude scale changes produce discontinuities

“The Da Vinci Schema”

Dandelion DNA (Free license)

“Indispensible” — “I don’t think that means what you think it does”

“Keeping electrons alive is really difficult”

“I wondered…with my Topic Map brain damage…”

-m

Monday, March 1st, 2010

Newsweek should never have been free

Andrew Zolli argues in Newsweek that online content should never have been free. I’m probably not the first one to make this profound observation–but if it were not for the free online edition of Newsweek (and link aggregator sites like Digg) I wouldn’t have read a single word of Newsweek in years, nor would I be linking to it as my previous sentence does… Maybe Newsweek is OK with that. -m

Thursday, December 10th, 2009

500th Post

Celebrating 500 posts since I went to WordPress in May 2006. Prior to that, an additional 730 posts as I floated through a typical evolution of blogging platforms:

  • Easy start: blogger (299 posts in 24 months)
  • Succumbing to the desire to roll your own (259 posts in 12 months)
  • Realizing that rolling your own is too difficult: Pyblosxom (172 posts in 12 months)
  • Moving to a mature platform you don’t need to worry about much: WordPress (500 posts in 42+ months)

-m

Thursday, July 2nd, 2009

And then there were one…

On May 8 I wrote:

it’s time for the W3C to show some tough love and force the two (X)HTML Working Groups together.

On July 2, the W3C wrote:

Today the Director announces that when the XHTML 2 Working Group charter expires as scheduled at the end of 2009, the charter will not be renewed. By doing so, and by increasing resources in the Working Group, W3C hopes to accelerate the progress of HTML 5 and clarify W3C’s position regarding the future of HTML.

The real test is whether the single HTML Working Group can be held to the standard of other Working Groups, and be able to recruit some much-needed editorial help from some of the displaced XHTML 2 gang.  -m

Tuesday, April 7th, 2009

GPL’s Cloudy Future

I enjoyed this post, from Jeremy Allison as it turns out. It talks about how GPL software is “the new BSD” when it comes to cloud computing, since redistribuion of the software doesn’t happen, and thus doesn’t trigger the relevant clauses of the GPL. Any old company can use, re-use, and modify the software without sharing the code in the original spirit of the license. The community’s response–something I need to keep a closer eye on–is the AGPL, or Affero license. It works similarly to the GPL, but is triggered by remote use of the software, not just distribution, preserving the work’s copylefedness even in cloud computing situations. -m

Saturday, April 4th, 2009

On YouTube’s bandwidth and Technologizer’s problem with basic estimation

This article states:

The analysts determined YouTube’s bandwidth costs by assuming that 375 million unique visitors would visit the site in 2009, with 20 percent of those users consuming 400 kilobits per second of video at any given time. That works out to 30 million megabits being served up per second. That’s a heck of a lot of bandwidth to devote to videos of sneezing pandas.

Do you honestly believe that YouTube is sending out 30 petabits per second (to put it another way, fully saturating over 200,000 OC3 connections)? That on average, every single user who counts as a unique visitor in 2009 spends 20% of 24hrs = 4.8 hours actually downloading video, every day of every week?

Gesundheit. -m

Update: the quoted article indeed gets it wrong, though it appears the original Credit Suisse analyst report was estimating peak usage, not a running average. Still doesn’t smell right. Updating the article and title to point the finger at the right people.

Thursday, February 12th, 2009

Geek Thoughts: the downside of free energy

Steorn is making noise again about the free energy device they claim to have invented. The proper scientific attitude to have toward such claims is skepticism, though most responses (always from individual who have never seen it) goes well beyond that.

But think of the downside if every phone, iPod, refrigerator, car, air conditioning unit, factory, etc. comes to contain a perpetual energy source. Total energy use would skyrocket, and all that energy still has to go somewhere, so it ends up as waste heat. Global warming on an unprecedented scale ensues.

There’s more. If overabundance of energy is the problem, it’s a mere engineering challenge to build planetary-scale air coolers, beaming waste heat out into space. Imagine an advanced civilization that’s already doing it. From a distance their planet might look nothing like what current exoplanet researchers are looking for.

There’s enough here for several novels. If you had unlimited energy resources, what would you like to see built?

More collected Geek Thoughts at http://geekthoughts.info.

Monday, December 1st, 2008

Where have all the acorns gone?

First the bee colonies start to disappear. Next, acorns. Does anyone have a map of the acorn-devoid areas? -m

Friday, November 7th, 2008

Sign of the times

I got a call today from a pushy recruiter. That’s nothing new. What’s different is that she was not looking for the usual resume, but rather desperately trying to place candidates. (Or maybe it was just social engineering…)

Is anyone else seeing a reversal in recruiter cold-call strategies? How flooded is the tech job market at this point? -m

Thursday, October 23rd, 2008

RDFa is a Recommendation

Haven’t mentioned here that RDFa is a W3C Recommendation. I’m thrilled that something that I’ve been thinking about for a while is ready for prime time.

Also, as of this writing the first page of results at Google still prominently links to a terribly outdated draft of the spec. The first page of results at Yahoo! nails it. Just sayin’.

-m

Sunday, September 21st, 2008

My economic plan

Levy a $24,000, one-time tax, payable in installments over 10 years, against anyone who took out an interest-only mortgage (or various other high-risk instruments) during the previous 10 years, using the full nasty power of the IRS to collect (garnishing wages, etc.)

Take the proceeds and give it to homeowners who did NOT engage in high-risk activities as a tax refund.

Since taxpayers will be bailing out wall street anyway, why not move the blame closer to where it belongs? -m

Wednesday, August 6th, 2008

Happy 0×40 Anniversary, Mark I

It’s been 0×40 years since the dedication of the Mark I. Wired has some great photos and background information. Less than a year later, Vannevar Bush would advance the state of the art with his article As We May Think. A year-and-a-half later, ENIAC unveiled, and with it Turing-completeness. And things have been speeding up ever since. -m

Tuesday, August 5th, 2008

Two miles barefoot

Without the bike commute, I’m back to barefoot running for exercise. I can now do a stretch of 2 miles on asphalt with no problems (other than sore calves). Why barefoot? Because it feels better, and it’s ultimately easier on the joints. The human biomechanical system does excellent work if you let it, and is easily capable of soft landings via shock-absorption in the knees, ankles, and musculature. In contrast, when one wears shoes, it’s too easy to slam the feet down and let the padding (attempt to) take care of impact management.

If you want to get started, go slowly. For a month I only walked, starting out with very short distances. I’m at the point now when I see a new texture of carpet, hardwood, or other floors, I’m tempted to kick my shoes off and sample.

Alas, I don’t think I’ll be ready for the Nike+ Human Race 10k coming up on August 31. (And I wonder how many runners in a shoe-company-sponosred race will be barefoot) :-) -m

Thursday, July 31st, 2008

The rare benefit of easily-mockable names

From the observing-the-human-condition department.

Seems I have a hard to pronounce name. For the record my first name has a long I; it’s MY-ka, not MEE-ka. When someone gets it wrong, I don’t hold anything against them. Afterall, how to pronounce any given name is pretty arbitrary.

But there are a few names that are easily mockable. Either a letter of two off from a word with a less favorable meaning, or a shared name with an infamous person–a few notable examples being a popular figure in the XML world and one of the current U.S. presidential candidates. I find that in these infrequent cases, an easily-mockable name can be a useful thing, since it allows me to immediately flip the bozo bit on any commentator/blogger/reporter who chooses to engage with the mockery. Without such low-hanging invective, sometimes it’s harder to tell when somebody is an idiot.

If you want to constructively criticize someone, go for it. But make sure to use founded, fact-based arguments that can stand on their own without resorting to childish attacks. Go forth and na-na-na-boo-boo no more. -m

Monday, July 28th, 2008

eRDF 1.1 Proposal Discussion

The W3C RDFa specification is now in Candidate Recommendation phase, with an explicit call for implementations (of which there are several). Momentum for RDFa is steadily building. What about eRDF, which favors the existing HTML syntax over new attributes?

There’s still a place for a simpler syntactic approach to embedding RDF in HTML, as evidenced by projects like Yahoo! SearchMonkey. And eRDF is still the only game in town when it comes to annotating RDF within HTML-without-the-X.

One thing the RDFa folks did was define src as a subject-bearing node, rather than an object. At first I didn’t like this inversion, but the more I worked with it, the more it made sense. When you have an image, which can’t have children in (X)HTML, it’s very often useful to use the src URL as the subject, with a predicate of perhaps cc:license.

So I propose one single change to eRDF 1.1. Well, actually several changes, since one thing leads to another. The first is to specify that you are using a different version of eRDF. A new profile string of:

"http://purl.org/NET/erdf11/profile"

The next is changing the meaning of a src value to be a subject, not an object. Perhaps swapping the subject and object. Many existing uses of eRDF involving src already involve properties with readily available inverses. For example:

<!-- eRDF 1.0 -->
<img class="foaf.depiction" src="http://example.org/picture" />

<!-- eRDF 1.1 -->
<img src="http://example.org/picture" class="foaf.depicts" />

With the inherent limitations of existing syntax, the use case of having a full image URL and a license URL won’t happen. But XHTML2 as well as a HTML5 proposal suggest that adding href to many attributes might come to pass. In which case this possibility opens:

<img src="http://example.org/picture" class="cc.license"
href="http://creativecommons.org/licenses/by/2.0/" />

Comments? -m

Tuesday, July 8th, 2008

Save up to 65% on Capacitors and Resistors

Nope, not spam. You can now order electronic components from Amazon, advertised right on the front page for me. What can’t you get on Amazon? -m

Saturday, July 5th, 2008

My Horrible Realization…

The old rule: only even-numbered Star Trek movies are any good.

The new rule: only odd-numbered Indiana Jones movies are any good.

-m

Thursday, July 3rd, 2008

Yahoo! now indexes RDFa

I haven’t seen an announcement about this, but try the following query on Yahoo Search: [searchmonkeyid:com.yahoo.rdf.rdfa] (link). It shows documents containing RDFa, with Digg at the top. Since this is a Searchmonkey ID, it’s also usable in Searchmonkey to actually extract the metadata and use it to customize search results.

Does your site use RDFa yet? -m

Tuesday, June 17th, 2008

Yahoo! Mobile: outgunned and outflanked

According to Ars Technica, Google captured 61% of mobile search market share in the first four months of 2008. Yahoo! came in at a distant 18%, so pretty much reflecting desktop search market share. This is due, of course, to Google being the default provider on the iPhone, and the iPhone being the biggest bulk of mobile internet usage.

So Jerry (or whoever is on deck as CEO), you should probably look into this mobile thing and see what’s up with leadership there and whether anything is salvageable… -m

Monday, June 16th, 2008

Netflix giveth, Netflix taketh away

Here’s something I’ll bet you didn’t know. Netflix has gone on record as saying that although their Instant View library, viewable online or via the hardware Roku player, is much smaller than their DVD library, they’re working hard on closing the gap. For instance, one quote says “adding titles at light speed”. But some titles are disappearing over time.

Just today, the Leslie Neilsen flick Wrongfully Accused (the tale of the one-eyed, one-legged, one-armed man) went offline as of today. Yesterday I watched it for free online. I’d really hate to see the kinds of negotiations that must be going on in back rooms between the studios and distributors these days…

If you have an Instant View queue, check it out. Under the “Availability” column, check for dates when your selections go offline. Blank means it’s safe for the time being. -m

Monday, June 9th, 2008

The comedy stylings of the Windows Vista Blog

For instance, The Business Value of Windows Vista. Seriously, Vista for “speed and security”? Or mobile? The comments on this post alone are worth the click. -m

Sunday, May 4th, 2008

Weekend Project: save $75/month with Ooma

New gear, an Ooma VOIP box. I plan to post more technical details soon, but the short story is that you get a sleek little box that goes between your dsl or cable modem and your router, and you get unlimited local and long distance calling. For free. For life (or 3 years, according to the TOS). Check out the Flickr set of the unboxing experience.

WIth this, I plan to turn off my landline, to the tune of about $35 a month, and by not using our mobile phones for so much long distance, reduce the calling plan for another $40 a month. The one-time cost for the box set me back about $231, so I will be even in just over 3 months. (Only recently, these things were retailing for $599.)

How do these guys stay in business? I’ll write more about this too, but the short story is that bandwidth is really, really cheap, monopolistic efforts of telecom companies notwithstanding.

So far I’m really happy with it. The online Ooma Lounge isn’t as good as Vonage’s system–for one thing, you can only see voicemails, not any kind of call logs. But the features that are there Just Work. The documentation is short and simple but thorough. Setup was a breeze.

Have you tried Ooma? Comment below. -m

Sunday, April 27th, 2008

Is there an inverse to the Innovator’s Dilemma?

Roughly speaking, the innovator’s dilemma happens when a product progressively gets more and more advanced features, to the point that it misses out (by listening to customers) on an entire new opportunity. At that point, a simpler, competing product can come into play and make large gains.

But what happens when a company is generally aware of the Innovator’s Dilemma and tries to compensate? It seems like second order effects might come into their own. A product widely known for being (and remaining) minimalist is exposed to attacks from deliberate enhancements and related complexificaiton of competitive products. As the market gets more mature, the steadfastly-simple market leader gets left behind. In a sense, it’s a role reversal from what Clayton Christensen describes. But can it work out the same in the end? Please comment. -m

Saturday, April 26th, 2008

Deadlines and connections

I’m not involved in the the corporate wrangling about Microsoft and Yahoo! talks. Which leaves me relatively free to comment on it. [Disclosure: I am, not too surprisingly, a Yahoo! shareholder.]

Lots of things have been happening lately. A deadline of, well, today. Talks of Google adsense trials. And all kinds of merger speculation involving Rupert Murdoch in some fashion, or else AOL.

But I haven’t seen anyone point out this connection: Google owns 5% of AOL, having invested a billion bucks and taken over search there a couple of years ago. So if Yahoo! and AOL merged, there would already be a Google advertising connection in place. Running pre-trials now is just due dilligence on something that might happen anyway.

Having both an in-house advertising network and an outsourced one has some advantages too, namely in the form of “knobs” that can be adjusted to tune margins as conditions warrant. And maintaining the in-house system keeps Google honest and makes sure that relatively good deals can be negotiated in the future.

Lots of pundits talk about regulatory scrutiny, but honestly, it’s been years since any antitrust machinery in this country has been effective. And the recent spectrum auctions showcased Google’s skill at turning regulatory tables in their favor. If it came down to it, the smart people on both sides of the table shouldn’t have a problem crafting an agreement in a way that meets muster, even in the stricter EU.

Summary: based solely on public reports, it seems like the AOL connection might be a credible threat to Microsoft’s appetite. The ball is firmly in Steve’s court now. We’ll see what he does.

Friday, April 11th, 2008

Google App Engine dwarfed

Thanks to chromatic for the link. Largely hidden,  largest app clusters of this particular platform can:

Control over a million computers and can deliver over a hundred billion advertisements per day.

However, “don’t be evil” is not a part of this particular platform’s strategy… -m

Thursday, April 3rd, 2008

US Census == paper technology

Never let anyone say that forms are easy. What seems like a boring, tedious topic on the surface is surprisingly deep and challenging. As evidence, the multi-billion-dollar plan to modernize the US census in 2010 has fallen back to paper technology. Sadly their plans didn’t involve XForms.

Highly-critical applications, like say voting, are even more difficult to get right. Possibly the government will get it in shape be 2020 or 2030. -m

Thursday, March 13th, 2008

The (lowercase) semantic web goes mainstream

So today Yahoo! announced a major facet of what I’ve been working on lately: making the web more meaningful. Lots of fantastic coverage, including TechCrunch and ReadWriteWeb (and others, please link in the comments), and supportive responses and blog posts across the board. It’s been a while since I’ve felt this good about being a Yahoo.

So what exactly is it?

A few months ago I went through the pages on this very blog and added hAtom markup. As a result of this change…well, nothing happened. I had a good experience learning about exactly what is involved in retrofitting an existing site with microformats, but I didn’t get any tangible benefit. With the “SearchMonkey” platform, any site using microformats, or RDFa or eRDF, is exposed to developers who can enhance search results. An enhanced result won’t directly make my my site rank higher in search, it it most certainly make it prone to more clicks, and ultimately more readership, more inlinks, and better organic ranking.

How about some questions and answers:

Q: Is this Tim Berners-Lee’s vision of the Semantic Web finally getting fulfilled?

A: No.

Q: Does this presuppose everybody rushing to change their sites to include microformats, RDF, etc?

A: No. After all, there is a developer platform. Naturally, developers will have an easier time with sites that use official and community standards for structuring data, but there is no obligation for any site to make changes in order to participate and benefit.

Q: Why would a site want to expose all its precious data in an easily-extractable way?

A: Because within a healthy ecosystem it results in a measurable increase in traffic and customer satisfaction. Data on the public web is already extractable, given enough eyeballs. An openness strategy pays off (of which SearchMonkey is an existence proof).

Q: What about metacrap? We can never trust sites to provide honest metadata.

A: The system does have significant spam deterrents built in, of which I won’t say more. But perhaps more importantly, the plugin nature of the platform uses the power of the community to shape itself. A spammy plugin won’t get installed by users. A site that mixes in fraudulent RDFa metadata with real content will get exposed as fraudulent, and users will abandon ship.

Q: Didn’t ask.com prove that having a better user interface doesn’t help gain search market share?

A: Perhaps. But this isn’t about user interface–it’s about data (which enables a much better interface.)

Q: Won’t (Google|Microsoft|some startup) just immediately clone this idea and take advantage of all the new metadata out there?

A: I’m sure these guys will have some kind of response, and it’s true that a rising tide lifts all boats. But I don’t see anyone else cloning this exactly. The way it’s implemented has a distinctly Yahoo! appeal to it. Nobody has cloned Yahoo! Answers yet, either. In some ways, this is a return to roots, since Yahoo! started off as a human-guided directory. SearchMonkey is similar, except a much broader group of people can now participate. And there are some specific human, technical and financial reasons why as well, but I suggest inviting me out for beers if you want specifics. :-)

Disclaimer: as always, I’m not speaking for my employer. See the standard disclaimer. -m

Update: more Q and A

Q: How is SearchMonkey related to the recently announced Yahoo! Microsearch?

A: In brief, Microsearch is a research project (and a very cool one) with far-reaching goals, while SearchMonkey is targeted as imminently shipping software. I frequently talk to and compare notes with Peter Mika, the lead researcher for Microsearch.

Monday, March 10th, 2008

Dear readers…

You are awesome. Just sayin’. -m

Monday, March 10th, 2008

Getting what you asked for

Some time ago, Doug Crockford’s excellent blog pointed me to this page on “excessive DTD traffic” at the W3C. Go ahead and follow that link, I’ll wait…

All the standard templates that show how to construct a basic XHTML page include a public identifier of http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd and often a namespace name of http://www.w3.org/1999/xhtml. As the blog points out, these are not actually hyperlinks, they only play them on TV. Huge quantities of software are requesting these URLs 24×7, putting a load on their servers. Often times this results from unfortunate defaults in off-the-shelf XML components such as parsers.

But what did you expect?

This is the web equivalent of having a front-desk receptionist hand out a stacks of self-addressed, stamped postcards, then complaining about how much mail the company gets from all around the world.

HTTP URLs are great for identifiers on a technical basis: they are based on DNS names and have the important qualities of uniqueness and persistence. But as far as human factors go, they are a terrible choice (though with a great deal of inertia at this point). -m

Tuesday, February 26th, 2008

Yahoo! Announces Open Search Platform

As spotted on TechCrunch, full article. This is a game-changer folks. Check out the comments attached to the article. -m