Category Archives: Semanticweb

Just lately I’ve been really interested in finding out Google’s position on semantic technologies and their view on the Semantic Web.

I’d been asked before whether Google were making any efforts in developing semantic technology, but I couldn’t really say. Then I attended the Googleworld debate, at London’s ICA, but couldn’t really find the chance to pose any technical questions.

In an attempt to satisfy my curiosity – and anyway, to investigate something I believe to be of interest that, as far as I can find, hasn’t received any real attention to date – I wrote an open letter, of sorts, forwarded to Google and Semantic Web researchers I’ve found connected to Google, simply asking:

What’s the deal?

Dear Google,

My name is Marc Hibbins, I write a blog I’m sure you wouldn’t have read, I tend to cover new web technologies, online trends, my own development issues, but I’m also extremely interested in the Semantic Web.

Over the past couple of weeks I’ve become increasing interested in finding out Google’s position, or just their ideas even, on the Semantic Web and semantic technologies. I recently wrote about the increase in tech blogs covering the subject over the past couple of weeks, and I’ve been asked a few times – what’s Google up to?

I also recently attended a debate called ‘Googleworld’ – it covered, generally, the past ten years of Google and what’s to come. I wrote about it, and the chair of the meeting replied that he too, is unaware of Google’s position.

Could you shed any light on the topic? Having had a thorough look around online, I’ve found next to nothing. I’m extremely intrigued to find out if Google have any plans with semantic technology – or even if there’s any in place already that just might not be so visible?

If you’ve no plans, do you have any comments? Do you think it’ll even ever happen?

Kindest regards,

Marc Hibbins

I wasn’t sure what kind of response I’d get, if any at all. Or if anybody I did get in touch would be wary of offering any insight that might be misinterpreted as any ‘official’ position.

To my surprise, my first response came from executive ‘Google Fellow’ Jeff Dean. He works in the Systems Infrastructure Group (crawling, indexing and query systems – full bio here), but he couldn’t initially offer any real strong thoughts on the issue. He did say however, that he wasn’t sure if Google even had any real position on the subject at all. If nothing, at least this confirmed that my lack of findings wasn’t down to only an absence of research published externally from Google – or poor investigative work on my part.

My second reply was from Stefan Decker, professor at the National University of Ireland, Galway and director of the Digital Enterprise Research Institute, an internationally recognised institute in Semantic Web and web science research. He co-presented a very interesting Google Tech Talk last year, and worked in Stanford at the same group as Sergey Brin and Larry Page.

He said, very explicitly, that:

In short: The Google management does not believe in meta-data.

Craig Silverstein is on record several times negatively of talking about the topic, as well as Sergey Brin. It is very clear that they are not proactive – a serious mistake from my point of view.

Interesting. I got in touch with his co-speakers, Eyal Oren and Sebastian Kruk. Both said they have contacts at Google still, but neither are aware of any public developments.

Eyal pointed me toward Sindice, a semantic search engine and index as perhaps (though only speculatively – as likely any search engine), might one day receive interest from Google. Perhaps to incorporate their infrastructure for RDF and semantic data consumption. But as he said, there’s absolutely no evidence of it right now.

Sebastian on the other hand described the lack of address specifically as:

[Their] ‘anti-semantic’ approach.

An increasing trend he’s recognised. Suggesting an almost concious movement against any such development. He also expressed his disappointment at the very low turn out at the Tech Talk, that literally only one attendee showed any real interest.

My final response was initially the most exciting – from Ramanathan V. Guha, who leads development of Google Custom Search. He said he’d be happy to comment on what’s going on, although could only offer his own personal opinion and nothing official – but I’ve not received any correspondence from him since.

All in all, at least I know I’ve not overlooked anything major. Fingers crossed I get a response back from Guha, but otherwise I guess I’m left keeping a close eye out for any other developments.

Picked up a spare ticket to the ‘Googleworld’ debate at the the Institute of Contemporary Arts yesterday evening. Bill Thompson chaired a pleasantly ‘warm’ chat between New York Times columnist Randall Stross and Independent columnist Andrew Keen.

Initially I wasn’t sure what to expect, billed as a look back over the past ten years of Google and forward to whatever might come next, it wasn’t as technically oriented as I’d hoped it’d be. It more focused on social and philanthropical interest – as a well as being a bit of a sell for both their new books.

I would have liked the opportunity to open up discussion to Semantic technologies, perhaps to pose the question, What are Google’s intentions? – if they even have any – of introducing any Semantic Web technologies to their platform. It’s something I was recently asked about after writing my last post, but it wasn’t really the right crowd.

In other news, Semantic start-up Twine goes public today. Founder Nova Spivack, posted some interesting stats yesterday about user engagement on the site over the last eight months during it’s semi-public, semi-beta phase.

It seems their users queue up some lengthy sessions on the site, longer even, he now predicts, than Delicious and MySpace.

It’s been hard not to notice the influx of tech blog posts over the past week or so covering all things Semantic Web. I’ve not had a decent chance to talk about the Semantic Web since I wrote my thesis last year, so I thought I’d take this as a good opportunity to do so and collect some of the best of those links in the process.

It’s staggering how much the Semantic idea has grown since I wrote my dissertation. In it I discussed mainly the social, technical and theoretical concepts of the Semantic Web – when I wrote it there was little else around to write about. There were no public start-ups or beta platforms – the community was small and extended little outside a designated group at the W3C and a handful of tech bloggers – their efforts too, were mostly spent on translating Tim Berners-Lee’s very technical, comparatively abstract, web dream across to the mainstream reader.

At that point the W3C were rapidly beginning to develop technologies like SPARQL and OWL, while others were under varied debate, such as RDF and Microformats. Not having a background in hard computer sciences, my interest was more in exploring the ubiquity, connectivity, of a Semantic Web, investigating what forces we would need to drive the paradigm – how our ideas of the Web were already changing with coming to grips with terms like (the then brand-new) ‘Web 2.0′ and embracing the Social Web phenomenon.

Nova Spivack is a leading voice, CEO of Radar Networks, founder of Semantic start-up Twine – an information storage and knowledge sharing service. He writes frequently at his Minding the Planet blog, full of optimism and colourful metaphor. He recently gave a talk at the Bonnier GRID ‘08 conference in Stockholm – basically ‘the TED conference of Scandinavia’, about what he terms the future of the Web, ‘the Semantic Web and the Global Brain‘. Whilst worrying me by using many of the buzzing memes and science fiction references that I think actually harm those trying to invest in his optimism and adopt the Semantic idea – an all-knowing, understanding, artificially intelligent Web just sounds too good – it’s exciting how immediate he predicts the true impact of the Semantic Web impending.

Then it’s even more so to read popular ‘mainstream’ – or at least the more general – tech blogs giving more and more coverage to Semantic web, that the technology and abstract concepts are becoming commonplace and frequently in normal interest.

ReadWriteWeb, ever popular for polls and predictions favour Semantic technologies in various recent top ten-style looks into future web trends (and more here), but too, see the oncoming breakthrough as more immediate. Richard MacManus puts Semantic apps at number one on his hit-list of Web Predictions for 2008.

But while predictions continue to be made, the true killer-app remains allusive. Some do extremely well though. Freebase went public around the end of 2007, essentially a semantic Wikipedia, hasn’t gained the popularity I though it would’ve by now. True Knowledge natural language search is still in beta, though the platform I’ve tested so far is as impressive as their promotional video.

But then kinda out of the blue for me came two Yahoo! developments. SearchMonkey, not exclusively a Semantic Web app, is a search engine that promotes semantic data standards by making use of Microformats and embedded RDF as searchable metadata – definitely read the FAQ – and at the recent Web 3.0 Conference and Expo, announced the consumer release of Yahoo! Open Strategy (Y!OS), ‘blowin’ the doors wide open’ to the ‘open source, hacker attitude’ – basically, ‘rewiring’ Yahoo! to make all data service-wide openly available to developers and consumers alike, granting the opportunity for complete data portability – with the intent to extend even further in the future.

I concluded my thesis in suggesting a new drive would be necessary for consumers Web-wide to understand and willingly adopt the Semantic Web change. At that point the Web Science Research Initiative (WSRI) had just been launched, a joint venture between MIT and the University of Southampton to teach the literal academic science of the web – it was unclear whether this would be enough. Molly Holzschlag wrote a recent article at A List Apart, believing the ‘Ivory tower’ perception of the W3C to be discouraging to everyone independent of the organisation – that they’ve no real outreach. I agree, but think efforts like the newly founded World Wide Web Foundation are a direct result of their awareness of that, but they will fulfil their principle objective and speed the technological advancement faster than she may expect – I hope they do, at least.

For even more reading (and listening) material, subscribe to the Nodalities blog and podcasts. There’s a good interview with David Provost I recommend, discussing many of the things I’ve spoken about here, but also his recently published report on the Semantic Web industry as a whole.

It’s called, ‘On the cusp’. :)

Chrome wheeled, fuel injected.