Some Google usenet fun. Go to the main Google Groups page [1].
Use the search functionality to search for "tim smith csh callan". You get one result, which is a 2007 post from comp.os.linux.advocacy where someone is quoting a 1984 post of mine that was in net.unix-wizards. Note that my 1984 post is not found.
Now go to the Google Groups version of net.unix-wizards [2].
Search there for "tim smith csh callan". Now the above mentioned 1984 post is found, along with another 1984 post.
Lest you think that there is some problem when searching from the main page, click on the "Search all groups" link on the net.unix-wizards search results page, and it only finds the 2007 COLA post that quoted my 1984 post.
A search from the main Google search page, as opposed to the search within groups, finds the first 1984 post as the first result.
I've seen vast numbers of posts become unfindable by search, and then weeks or months later become findable again. For instance, there was a long time when if you searched for "Bill Gates" in Google's usenet archive, it would only return something like a dozen posts.
To put it bluntly, Google's handling of the usenet archives has been negligent and/or incompetent.
I've seen vast numbers of posts become unfindable by search, and then weeks or months later become findable again.
I think this is an effect of the way Google searches/indices things; I am equally frustrated by pages that disappear from Google's web search which may or may not come back eventually (although I've seen more disappear than come back...) Remember that they're running a huge distributed system, and so consistency/completeness is probably relaxed in order to optimise other things they believe are more important. It's the same reason why even if Google says there are X results in a search, you often cannot view them all.
(Not that I'm actually agreeing with this behaviour, however. It's less noticed on the web where there tends to be a lot of redundant/similar information, but still not desirable at all.)
It was once possible access Google's archive of Usenet without Javascript. And there were "heavy" and "light" versions of the messages. The heavy versions have an enormous amount of Javascript, CSS and HTML cruft.
However, later they switched to HTTPS and #! URLs. Around this time I remember getting $CLASSPATH errors. Perhaps this is evidence to support your incompetence argument?
The "content" here is nothing but some plain ASCII Usenet posts. How difficult is it to serve plain text?
Anyway, today the same URL has been converted to this:
As I said in an earlier thread, Google itself developed a proposal to deal with this #! URL problem and advises webmasters to revise these AJAX URL's to "escaped_fragment" style URL's:
But apparently when the webmaster is Google, the specification does not apply.
Years ago, I made my own archives of some important comp and net groups. Google is not reliable. This stuff should be placed with the Internet Archive.
Use the search functionality to search for "tim smith csh callan". You get one result, which is a 2007 post from comp.os.linux.advocacy where someone is quoting a 1984 post of mine that was in net.unix-wizards. Note that my 1984 post is not found.
Now go to the Google Groups version of net.unix-wizards [2].
Search there for "tim smith csh callan". Now the above mentioned 1984 post is found, along with another 1984 post.
Lest you think that there is some problem when searching from the main page, click on the "Search all groups" link on the net.unix-wizards search results page, and it only finds the 2007 COLA post that quoted my 1984 post.
A search from the main Google search page, as opposed to the search within groups, finds the first 1984 post as the first result.
I've seen vast numbers of posts become unfindable by search, and then weeks or months later become findable again. For instance, there was a long time when if you searched for "Bill Gates" in Google's usenet archive, it would only return something like a dozen posts.
To put it bluntly, Google's handling of the usenet archives has been negligent and/or incompetent.
[1] https://groups.google.com/forum/#!overview
[2] https://groups.google.com/forum/#!forum/net.unix-wizards