Google Releases Web APIs

Follow Slashdot blog updates by subscribing to our blog RSS feed

Google Releases Web APIs 144

Posted by Hemos on Friday April 12, 2002 @07:57AM from the developing-with-google dept.

skunkeh writes "Google have released the first beta of their Web APIs package. Used in conjunction with a free license key this SOAP based web service allows developers to execute up to 1000 automated queries a day, but is currently available for non-commercial use only. The download comes with Java and .NET code examples and includes a WSDL description for use with other SOAP supporting languages." There's also a write up about uses on Userland.

This discussion has been archived. No new comments can be posted.

Google Releases Web APIs

Load All Comments

Search 144 Comments Log In/Create an Account

Comments Filter:

Web API Implementations (Score:5, Informative)

by skunkeh ( 410004 ) writes: on Friday April 12, 2002 @08:00AM (#3328731)

A list of implementations of the Google Web API can be found on SoapWare:
http://www.soapware.org/directory/4/services/googl eApi/implementations [soapware.org]
At the time of posting languages catered for were for AppleScript, Frontier/Radio, Perl, Python and Visual Basic. I've written a basic implementation in PHP which has yet to be added to the list - you can find it here:
http://toys.incutio.com/php/php-google-web-api.htm l [incutio.com]
This is a very cool toy.

Share
twitter facebook
- Re:Web API Implementations (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  Sorry for hijacking this thread, but in the last story about "google" [slashdot.org] someone mentioned [slashdot.org] that there should be a google topic on slashdot, and someone [slashdot.org] offered up these icons [sunyit.edu]. Why isn't there a google topic on slashdot?
- Re:Web API Implementations - Ruby missing (Score:2, Informative)
  
  by pong ( 18266 ) writes:
  
  I just submitted a request to have Ruby/Google added to the list of implementations. Until then you can find it at http://www.caliban.org/ruby/
- TCL Implementation (Score:1)
  
  by azaroth42 ( 458293 ) writes:
  
  As there isn't one in the above list, I put together a TCL implementation at:
  http://gondolin.hist.liv.ac.uk/~cheshire/tclgoogle . tml [liv.ac.uk]
  Enjoy!
  -- Azaroth
This story refers to (Score:3, Informative)

by sydneyfong ( 410107 ) writes: on Friday April 12, 2002 @08:08AM (#3328748) Homepage Journal

The old one here - "Google to Offer API" [slashdot.org]

Share
twitter facebook
- Unlimited (Score:1)
  
  by phyxeld ( 558628 ) writes:
  
  So, you can execute 1,000 searches a day through their API, OR you can code your program to do a normal google search and parse the results out of the returned html (like people have had to do until now)...
  
  How many projects can't afford the overhead of a little html parsing but CAN afford to be limited to 1,000 searches/day? I'm sure they'll offer higher limits for a fee, but I think the DIY-html-parsing google "api" is going to keep on working just fine (and for free).
AppleScript for Google API (Score:2, Informative)

by Lysander Luddite ( 64349 ) writes:

http://radio.weblogs.com/0100012/stories/2002/04/1 1/applescriptForGoogleApi.html

has some Applescript for your use
- Re:AppleScript for Google API? (Score:1)
  
  by zaren ( 204877 ) writes:
  
  I get the following when I load that URL:
  
  -----
  Not Found
  
  The requested URL /0100012/stories/2002/04/1 was not found on this server.
  -----
  
  Did you drop something out of that URL, or did it lose something when it got posted?
  
  Aww, FSCK [cafepress.com]!
  - Re:AppleScript for Google API? (Score:2)
    
    by Lysander Luddite ( 64349 ) writes:
    
    It must have most something when posted. Sonmebody posted the "correct" url in a reply, but it looks exactly the same as my post with the space between the 1s
    
    *shrug*
- Re:AppleScript for Google API (Score:4, Informative)
  
  by bodin ( 2097 ) writes: on Friday April 12, 2002 @08:59AM (#3328894) Homepage
  
  And the URL was broken. Here is the right one:
  
  http://radio.weblogs.com/0100012/stories/2002/04/1 1/applescriptForGoogleApi.html [weblogs.com]
  
  Parent Share
  twitter facebook
  - Re:AppleScript for Google API (Score:1)
    
    by Captain Large Face ( 559804 ) writes:
    
    Is there a problem with the slashdot script? Spaces seem to appear in the middle of long unbroken lines. Is this an anti-lamer tactic or something?
    - Re:AppleScript for Google API (Score:1)
      
      by ethereal ( 13958 ) writes:
      
      It prevents "page-widening" attacks, by breaking up any string that would display in such a way as to force the browser window to be too wide. You'll notice that the extra space is only inserted in the displayed text; the actual link itself is OK.
w00t! (Score:1, Funny)

by Anonymous Coward writes:

Now I can write programs to generate Google-whacks!
- Re:w00t! (Score:1, Offtopic)
  
  by igrek ( 127205 ) writes:
  
  Here you go: Perl GoogleWhack module [cpan.org]
- But no searching images or newsgroups... (Score:1)
  
  by casio282 ( 468834 ) writes:
  
  Sadly, though, there is no support in the API for queries on their image or usenet "groups" indices, according to the API FAQ [google.com]:
  
  2. Can Google APIs be used to access Google Groups? Image search? Directory search?
  
  No. The Google Web APIs service can only be used to search Google's main index of 2 billion web pages.
  
  I'll have to keep on parsin'...
  Maybe some day.
A great corporate move (Score:5, Insightful)

by shankark ( 324928 ) writes: on Friday April 12, 2002 @08:16AM (#3328770)

Other than being a really cool idea, this is a great tactical move from Google. On the one hand, by restricting the number of queries made to Google, they ensure that their APIs aren't misused/compromised, it also gives companies an initiative to purchase Google products and deploy this API (probably an unrestricted-query API) on their own network. Furthermore, an API such as this will easily muscle out any sniff of a competition from other search engine wannabes. Google has managed to do all this and yet be as compliant
to an Open Source initiative as possible. Remarkable.

Share
twitter facebook
- Re:A great corporate move (Score:5, Informative)
  
  by skunkeh ( 410004 ) writes: on Friday April 12, 2002 @08:21AM (#3328786)
  
  From the FAQ [google.com]:
  
  2. Does Google have any plans to sell Google Web APIs as a service?
  
  Not at this time.
  Which seems very strange seeing as this could be a huge money spinner. Surely a license system which allows commercial users to subscribe to a certain number of queries a day, or just buy queries in bulk would generate a lot of income for Google and provide a valuable service to the internet business community at large.
  
  Parent Share
  twitter facebook
  - Re:A great corporate move (Score:5, Insightful)
    
    by gowen ( 141411 ) writes: <gwowen@gmail.com> on Friday April 12, 2002 @08:53AM (#3328872) Homepage Journal
    
    2. Does Google have any plans to sell Google Web APIs as a service?
    
    Not at this time.
    
    Sure, thats what they say, but what they mean is :
    
    "of course we do. You think we're doing this out of love? But we don't know what they are just yet, and we want to get things right. So go away, and we'll put out a press release as and when we're ready."
    
    Which is fair enough.
    
    Parent Share
    twitter facebook
    - In fact, farther down in the FAQ. (Score:1)
      
      by mbauser2 ( 75424 ) writes:
      
      Question 15 of the Technical FAQ [google.com] is:
      
      15. What if I want to pay Google for the ability to issue more than 1,000 queries per day?
      
      Google is only offering the free beta service at this time. If you would like to see Google develop a commercial service, let us know at api-support@google.com.
      
      So, yeah, they're definitely interested in it if the developer community is interested in it.
  - Re:A great corporate move (Score:2, Interesting)
    
    by ergo98 ( 9391 ) writes:
    
    I wonder if Google is already providing some sort of paid service to large corporations. On my website on day I actually got a hit coming from a chap at the Redmond campus of Microsoft, and he was searching via http://www.google.com/microsoft [google.com] : BTW, at the time I'm quite sure that that page actually displayed the Microsoft logo as well.
    - Re:A great corporate move (Score:2)
      
      by arkanes ( 521690 ) writes:
      
      Google makes these really, really cool little search appliance things that you set up on your network. Slashdot has a story about it, and I'm sure you can find some stuff on the google site. They also provide site indexing services.
    - Re:A great corporate move (Score:1)
      
      by mini me ( 132455 ) writes:
      
      Google has sections for various things.
      
      Such as
      Linux [google.com]
      BSD [google.com]
      Uncle Sam [google.com]
      
      And I'm sure there are plenty of others...
      - Re:A great corporate move (Score:1)
        
        by jeffphil ( 461483 ) writes:
        
        And I'm sure there are plenty of others...
        
        Mac [google.com], of course.
        
        -----
    - Re:A great corporate move (Score:1)
      
      by syates21 ( 78378 ) writes:
      
      If you look here [google.com] you can see a whole bunch of "special" Google URL that automatically limit searches to certain domains. Most of these are actually universities.
  - - Re:A great corporate move (Score:1)
      
      by sparx ( 25164 ) writes:
      
      Boy, you've really shown them then, haven't you? All they've gotta do is change up their output a bit and you just as screwed. Conspiracy theories aside, getting data without having to worry about the -presentation- of that data is a good thing for everyone.
- Re:A great corporate move (Score:4, Insightful)
  
  by circletimessquare ( 444983 ) writes: <circletimessquar ... m minus language> on Friday April 12, 2002 @08:29AM (#3328810) Homepage Journal
  
  ...not to mention that the 1,000 query limit/ day is only whetting an appetite. Any wagers on whether or not there will be a sweet little pricetag on 10,000 queries/ day or unlimited queries/ day? A pricetag corporate clients will gobble up? Remarkable indeed!
  
  Parent Share
  twitter facebook
Google groups may require a Google account.. (Score:1, Interesting)

by Anonymous Coward writes:

"...The initial service available with your Google Account is:

Google Web APIs - a tool for software developers to automatically query Google

In the future, your Google account will provide access to all Google programs requiring sign in including: Google Groups, Google AdWords, Google Store, Google in Your Language program...."

Does that mean that accessing the Google Groups is now going to need me to create an account? Hmm..
- Re:Google groups may require a Google account.. (Score:2)
  
  by skunkeh ( 410004 ) writes:
  
  No, it means posting on Google Groups through the web interface (which already requires an account) will be possible using your single Google account as opposed to a seperate one for Google Groups and the web API.
- Re:Google groups may require a Google account.. (Score:4, Informative)
  
  by AVee ( 557523 ) writes: <slashdotNO@SPAMavee.org> on Friday April 12, 2002 @08:34AM (#3328823) Homepage
  
  Well this is what it told me:
  In the future, your Google account will enable login access to all Google services, including Google Groups posting, Google AdWords, the Google Store, the Google in Your Language program, and more.
  (My emphasis)
  
  Notice the difference?
  
  Parent Share
  twitter facebook
Example of use (Score:5, Informative)

by dtr20 ( 442135 ) writes: on Friday April 12, 2002 @08:23AM (#3328795)

I just had a go with this and some example output is displayed below. Basically you can do a search of their main web pages, request a cached page or use their spellchecker.

Dave

$ java -cp googleapi.jar com.google.soap.search.GoogleAPIDemo XXmykeyXX search "british empire"
Parameters:
Client key = XXmykeyXX
Directive = search
Args = british empire
Google Search Results:
======================
{
TM = 0.117071
Q = "british empire"
CT = ""
TT = ""
CATs =
{
{SE="", FVN="Top/Regional/Europe/United_Kingdom/Society_an d_Culture/History"}
}
Start Index = 1
End Index = 10
Estimated Total Results Number = 688000
Document Filtering = true
Estimate Correct = false
Rs =
{

[
URL = "http://www.btinternet.com/~britishempire/empire/e mpire.htm"
Title = "The British Empire"
Snippet = "| Introduction | Articles | Biographies | Timelines
| Discussio
n | Map Room | Armed Forces | Art ... "
Directory Category = {SE="", FVN=""}
Directory Title = ""
Summary = ""
Cached Size = "5k"
Related information present = true
Host Name = ""
],
...

Share
twitter facebook
- Re:Example of use (Score:1)
  
  by imnoteddy ( 568836 ) writes:
  
  Nice. Tried your example using my brand new key and it works just fine. Now to figure out what to do with my new toy...
Trying to /. google???? (Score:1, Funny)

by Anonymous Coward writes:

Won't work, evil doers!!
O'Reilly has some good code and stuff (Score:5, Informative)

by bodin ( 2097 ) writes: on Friday April 12, 2002 @08:25AM (#3328799) Homepage

O'Reilly has a good article here with some code as well in both Java and Perl.

http://www.oreillynet.com/cs/weblog/view/wlg/1283 [oreillynet.com]

Share
twitter facebook
Google Terms of Service (Score:4, Funny)

by AVee ( 557523 ) writes: <slashdotNO@SPAMavee.org> on Friday April 12, 2002 @08:29AM (#3328809) Homepage
To create an Google account [google.com] you have te agree with the Google Terms of Service [google.com]. These state the following:

No Automated Querying

You may not send automated queries of any sort to Google's system without express permission in advance from Google. Note that "sending automated queries" includes, among other things:
- using any software which sends queries to Google to determine how a website or webpage "ranks" on Google for various queries;
- "meta-searching" Google; and
- performing "offline" searches on Google.
Now, how can I use the web API?!
Note that this is not in the Google Api TOS [google.com] wich you must agree to before downloading [google.com] the api. But in the Google Terms of Service [google.com] wich you must agree to before creating a Google account needed to use the Google Api.

Still, it's fun and i'll play with it!
Share
twitter facebook
- Re:Google Terms of Service (Score:5, Funny)
  
  by Hektor_Troy ( 262592 ) writes: on Friday April 12, 2002 @08:54AM (#3328876)
  
  Considdering you get Googles express permission [google.com] for it:
  
  Your Google Account and license key entitle you to 1,000 automated queries per day.
  
  I happen to have a pair of spare glasses lying around, (+2,75 on each eye) - wanna borrow them?
  
  Parent Share
  twitter facebook
slashdotted, i think (Score:2, Insightful)

by Ermyf Jym ( 567787 ) writes:

Here's a copy of the write up. My machine barely made it to the site :)
Google is just the juice
Thu, Apr 11, 2002; by Dave Winer.
Good afternoon

A very quick piece today, a story, a question, an answer and a pointer.

The story -- 1995. A new release of Netscape. Can't get through to their servers. This thing is exploding. A mind bomb every minute. Wow. I love this. End of story.

The question: Can it happen again?

The answer..

Yes!

This afternoon Google opened a public SOAP 1.1 interface.

Now, from scripts, we can call Google as if it were a script running locally.

What comes back? Data.

What questions should we ask?

That's where the mind bombs will come from.

In the loop

We've been in the loop with Google, privately, for the last few weeks, so we've had a chance to play with ideas and actually have some.

Yesterday, as a tease, I put a Google Box on Weblogs.Com. Every hour it recalcs, showing the top 10 hits on Google for the term weblog. To my surprise, it changes, it's not constant. And it took me to places I didn't know about. The serendipity of queries that run for a long time. That, imho, is where the juice is in the Google API; and probably many or most of the APIs that are sure to follow; because Google is so popular.

Google hits the ball over the net, then we return the volley. Finally, once again, signs of life. Let's hope we learn from the past -- and keep the spark going -- welcoming competition and learning from it instead of snuffing it out. The intoxication of a new idea every day is too good to not want to be there once again.

Maybe the dark ages are over? I hope so.

Google is just the juice

It's happening in real time. As I write this I'm waiting for the embargo to lift. As soon as that happens, we'll start releasing new parts and samples for Radio and Frontier users that connect to Google's SOAP interface, with simple but geekish instructions for getting started.

Later today Google Boxes will start showing up on Radio weblogs, which you can follow through Weblogs.Com. You'll see SOAP developers, on all platforms, getting to work, creating and publishing the glue that turns the Internet, finally, into a fantastic scripting environment. Google is just the juice we need.

Dave Winer
Staggering Potential (Score:5, Insightful)

by Captain Large Face ( 559804 ) writes: on Friday April 12, 2002 @08:36AM (#3328829) Homepage
Whilst the potential of a regular Google search is large enough, when you consider the Google search modifiers, the potential becomes staggering. Imagine using the following features:
- Business Address Lookup
- File Type Specific Search (.PDF etc..) (filetype:)
- Stock Quotes
- Cached Links (/. Favourite) (cache:)
- Similar Pages (related:)
- Linked Sites (link:)
- Site Specific (site:)
- Maps
Does anyone happen to know if you can use the other sections of Google (e.g. news, images etc.)?

Is Google the best company ever or what?!
Share
twitter facebook
- Re:Staggering Potential (Score:2, Funny)
  
  by selderrr ( 523988 ) writes:
  
  Is Google the best company ever or what?!
  
  Nope. Microsoft is.
  - Re:Staggering Potential (Score:2)
    
    by dimator ( 71399 ) writes:
    
    You're both wrong. It's these guys. [vividvideo.com]
- Re:Staggering Potential (Score:2, Informative)
  
  by darkpurpleblob ( 180550 ) writes:
  
  Does anyone happen to know if you can use the other sections of Google (e.g. news, images etc.)?
  
  From the FAQ:
  
  Can Google APIs be used to access Google Groups? Image search? Directory search?
  
  No. The Google Web APIs service can only be used to search Google's main index of 2 billion Web pages.
  
  --
todays tasks are apparent (Score:1)

by wishiwascool ( 60898 ) writes:

Well,

I know what I'll be doing when I get to work today. Just my additional 2 cents to this marvelous addition to Google.
Pigeonrank anyone? (Score:1, Funny)

by Anonymous Coward writes:

I wanna program some pigeons!
- Re:Pigeonrank anyone? (Score:1)
  
  by dynoman7 ( 188589 ) writes:
  
  public class Pigeon
  {
  public short weight;
  public String name;
  public void Pigeon(){...}
  public void feedPigeon{...}
  public gSearchResult searchPigeon{...}
  ...
  }
  - Re:Pigeonrank anyone? (Score:4, Funny)
    
    by tb3 ( 313150 ) writes: on Friday April 12, 2002 @10:14AM (#3329226) Homepage
    
    So, I guess
    
    public void Pigeon()
    
    is what makes them crap on your shoulder?
    
    Parent Share
    twitter facebook
Anyone care to explain... (Score:1, Interesting)

by Anonymous Coward writes:

how this is different than using a script to parse Google's output?

White Hat Research.net [whitehatresearch.net]

Geek Clothes - Including a shirt with the (in)famous Ben Franklin quote! [cafepress.com]
- Re:Anyone care to explain... (Score:1)
  
  by jmacgill ( 547996 ) writes:
  
  The old terms and conditions prohibited any automated 'page scraping' so I guess this a leagal and much cleaner way of processing google output.
- Re:Anyone care to explain... (Score:5, Informative)
  
  by Software ( 179033 ) writes: on Friday April 12, 2002 @12:50PM (#3330128) Journal
  
  OK, your script parses Google's HTML output today, but what about a year from now when Google changes its output, to say, XHTML or plain text or something. How well will your script work then? Although the Google API could change tommorow like some companies' [microsoft.com], in general APIs are more stable. I haven't looked at their API, but I'm guessing it's also easier to develop against their API, and it should be less processor- and network-intensive.
  
  Parent Share
  twitter facebook
More Advanced Features? (Score:5, Interesting)

by Captain Large Face ( 559804 ) writes: on Friday April 12, 2002 @08:53AM (#3328871) Homepage
I think I speak for most when I ask if you can have your results back in the "interesting" language sets:
- Swedish Chef [google.com]
- Elmer Fudd [google.com]
- Pig Latin [google.com]
- H4x0r [google.com]
- Klingon [google.com]
Share
twitter facebook
why not just use plain http (Score:3, Insightful)

by sanermind ( 512885 ) writes: on Friday April 12, 2002 @08:54AM (#3328877)

If they are going to limit you to only 1000 queries, I fail to see the point. It wouldn't be hard at all to write a simple API on your own to, say, a c++ class that spits out the necessary url's [like http://www.google.com/search?hl=en&q=example]
or the like, dispatch them to google port 80, and then parse the results into easily program readable data sets/results? A third party could write this sort of thing easily enough if there was demand for it. I mean, esentially the google search API isn't going to be offering anything not available in the standard forms, is it? Except their spell checker, I believe. [Which you could use via html too, actually, "Did you mean: ______" ]

Share
twitter facebook
- Re:why not just use plain http (Score:3, Insightful)
  
  by beebware ( 149208 ) writes:
  
  Except if Google does notice a disproportional number of search from an IP address (even a netblock) they do reserve the right to block your IP/netblock. This way, at least, you are querying their database without breaching their TOS...
  - Re:why not just use plain http (Score:2)
    
    by gorilla ( 36491 ) writes:
    
    And the API calls are reasonably guaranteed to be stable, while HTML parsing can break at any time if Google decides to change their output format.
- Re:why not just use plain http (Score:1, Insightful)
  
  by Anonymous Coward writes:
  
  If google changes the way their results are displayed then your simple API breaks, that's why.
What about slashdot? (Score:3, Interesting)

by aozilla ( 133143 ) writes: on Friday April 12, 2002 @08:54AM (#3328878) Homepage

How long until slashdot offers this service?

Share
twitter facebook
- They already do... (Score:3, Interesting)
  
  by xtermz ( 234073 ) writes:
  
  ... sort of.. they have a XML file out there off of the main site that you can query to get the latest headlines...
  
  i've actually used it before with a simple VB app...
  
  email me if you want the code...
  - Re:They already do... (Score:2)
    
    by aozilla ( 133143 ) writes:
    
    That's just the headlines... Pretty useless...
- access to postings (Score:1, Interesting)
  
  by Anonymous Coward writes:
  
  I would love a computer-friendly way to access postings. The limited UI offered by a web browser is not the right way to read these huge, nested discussions! I've been playing with better interfaces, and have kludgey code for parsing slashdot HTML pages, but it would be wonderful to have something cleaner and less brittle.
NNTP tunneling ? (Score:3, Insightful)

by Bert Peers ( 120166 ) writes: on Friday April 12, 2002 @08:57AM (#3328882) Homepage

In case the engineers at google are bored now that it's released, here's an idea ;) Open up groups.google.com via a similar API so that an application can get the latest Usenet info even through proxies blocking NNTP and/or newsservers. Showing the latest threads/posts etc on a webpage could be useful too.

It's not something you have to go to google for, but it'd be nice :)

Share
twitter facebook
- Re:NNTP tunneling ? (Score:4, Interesting)
  
  by km790816 ( 78280 ) writes: <wqhq3gx02@@@sneakemail...com> on Friday April 12, 2002 @09:58AM (#3329152)
  
  This is where things get interesting.
  
  Companies have become happy blocking ports to restrict no-nos: messaging, newsgroups, etc.
  
  I'm wondering how long it will be until we start seeing firewalls that can filter/block SOAP calls for the very reasons you mention. SOAP just forces network admins to move up from ports and protocals to sniffing HTTP requests to keep people from having too much fun.
  
  Enjoy it while it lasts.
  
  Parent Share
  twitter facebook
  - Re:NNTP tunneling ? (Score:1)
    
    by malakai ( 136531 ) writes:
    
    SOAP standard makes it easy to filter/block SOAP calls. But the key is, you can do it per-interface and per-method.
    SOAP clients send their data using M-POST, which mandates the server understand the Interface URI header, and the method-name header.
    This should allow network admins to restrict/allow specifically what they desire, and not force them to have to turn off SOAP through a firewall as a whole.
  - Re:NNTP tunneling ? (Score:1)
    
    by khuber ( 5664 ) writes:
    
    There will always be a way around that kind of stuff (besides moving to a better company). Imagine encoding your request and sending it through a proxy decoder outside your firewall. Surf the web using email? Or get a wireless carrier? The persistent hackers will never be stopped, because they have more knowledge and interest than a network admin who's just working their 40.
    
    -Kevin
- Oh yes, porn. (Score:1)
  
  by iamr00t ( 453048 ) writes:
  
  And don't forget to fix the multipart merger.
only 10 results per search?? (Score:2, Insightful)

by ShaggusMacHaggis ( 178339 ) writes:

The 1000 searches a day is very nice....I know I would never need that many (if results were unlimited anyway).

HOWEVER...you only get 10 results per search??
- Re:only 10 results per search?? (Score:1)
  
  by Mr M ( 120740 ) writes:
  
  I thought this was strange at first until I realised this was based on their existing search.
  
  Perform a search on Google. By default you'll get a list of ten results. The "Next" link shows a parameter called "start" that on the first result is 10, the second 20, and so on. So you can get more than 10 results by using multiple queries. This means the maximum number of results per day is 10,000.
Question (Score:2, Funny)

by loconet ( 415875 ) writes:

Anyone else feels that if google ever dissapears, they will become very unproductive?
Re:Since when do we like Google? (Score:1)

by aozilla ( 133143 ) writes:

FYI:

We like them Monday, Wednesday, and Friday.
We love them Tuesday, Thursday, and Saturday.
And we alternate Sundays.

Get with the program.
--
---
- Re: Re:Since when do we like Google? (Score:2)
  
  by Accipiter ( 8228 ) writes:
  
  Sorry, I don't remember hating Google.
Synthesis (Score:5, Insightful)

by Asprin ( 545477 ) writes: <(moc.oohay) (ta) (dlonrasg)> on Friday April 12, 2002 @10:07AM (#3329189) Homepage Journal

Ummmmm. Ok, check this out.

This morning on /. we have an article about Google releasing their SOAP 1.1 API followed immediately by an article from a guy that set up a spambot trap on his web site, and in the margin a poll about giving spammers what they deserve. Putting 2 and 2 and 2 together, I got 4, popped open a google box and started playing.

All I did was ask google to search for "mailto" and "@msn.com" and lo and behold, she spit back 111,000 hits - hits that contain what look like legit email addresses IN THE THREE LINE SUMMARIES.

The point is, now that google can be automated, what's to stop spammers from SOAPing their way into Google to do their harvesting? Would there be any point over what they're doing now? It might be cheaper, because you only have to run over the google results not the whole sites and since Google caches pages, you can even grab addresses from the past, somewhat.

IT ALSO DEFEATS SPAMBOT TRAPS.

Doesn't this give spammers whole new avenues to exploit?

Worse, are webmasters going to have to put a halt to Google crawls?

Share
twitter facebook
- Re:Synthesis (Score:2, Informative)
  
  by Pvt_Waldo ( 459439 ) writes:
  
  &ltsarcasm&gt
  LOL maybe we should just dismantle the whole internet, as clearly the internet is the channel used by spammers! Oh wait. The internet has many many positive uses. Gee!
  &lt/sarcasm&gt
  
  LOL a 4 for Interesting? Oh come on, this is ignorance, not information.
  
  Horrors! Spammers can use this!
  
  Uh 'scuse me but I can write a 10 line perl script that does the same thing. All I have to do is craft a query to google, and put a bunch of work into parsing out the real content from the HTML that comes back. Kind of a pain, but nothing a few regexp can't handle. This API is nothing new, it's just something handy. I'm seriously thinking I can replace a component of a research project here at our research facility with this. Why reinvent the wheel after all?
  
  Worse, are webmasters going to have to put a halt to Google crawls?
  
  It's called robots.txt. Ever run a web server? All this API does is let you do searches to google. Google is google is always searching. That's what robots.txt is for. You are not going to get crawled by this! This is not a BOT, just a QUERY TOOL.
  - Re:Synthesis (Score:1)
    
    by Asprin ( 545477 ) writes:
    
    I hadn't considered that Google's already crawling you anyway (point taken - oops - someone mod me down!), but again, this defeats spambot traps. Further, some of the more useful tech-support-web-forum postings would be somewhat filtered out if webadmins restricted Google crawls by using robot.txt. (I think the bot trap article had some better policy-oriented ideas about how to accomplish this.)
    
    Regarding the PERL scripting, ten years ago, I actually joked with my friends about sending emails to people with batch/script/program attachments that deleted files with a message that sez "run this c00l program d00dez!" but it didn't occur to me that anyone would actually fall for it and that's what the human-engineering-virus "revolution" (Melissa, ILOVEYOU, et. al.) was all about.
    
    (I've also decided on rereading my comments and yours that I should have gone with my first instinct and posted this under the spambot traps article -- in retrospect, it would have been far more appropriate there. Oh, well I had a 50-50 shot and lost - thanks for keeping me honest!)
    - Re:Synthesis (Score:2)
      
      by Tony-A ( 29931 ) writes:
      
      ...ten years ago, I actually joked with my friends about sending emails to people with batch/script/program attachments that deleted files with a message that sez "run this c00l program d00dez!" but it didn't occur to me that anyone would actually fall for it and that's what the human-engineering-virus "revolution" (Melissa, ILOVEYOU, et. al.) was all about.
      That's the problem with systems/applications that think they are smarter than the user and hide things from the user. Not showing file extensions, even the DOS batch file @ECHO OFF is a bad idea.
      - Re:Synthesis (Score:2)
        
        by sparkz ( 146432 ) writes:
        
        The DOS @echo off command is no different from the bourne [again] shell >dev/null 2>&1 ... is that bad, too?
Example from Python (Score:1)

by f3e2 ( 235629 ) writes:

PyGoogle [diveintomark.org] allows you to access the web API from Python. Download here [diveintomark.org]. Python has no SOAP support in the standard library, but a working SOAP library is included with PyGoogle.

-Mark

Dive Into Python [diveintopython.org] - a free Python book for experienced programmers
- Re:Example from Python (Score:1)
  
  by haluness ( 219661 ) writes:
  
  Hi,
  tried using the code. The page mentions that I need SOAP.py 0.9.7.1, but the included version is 0.9.7
  
  On trying a google search SOAP.py gives me a traceback
  
  Any suggestions?
Google Stock Exchange (Score:2)

by tringstad ( 168599 ) writes:

It seems there is still time to enter the Google Programming Contest [google.com] and although I have neither the time nor the skill to do it, I do have an interesting idea if someone else wants to take a shot at it.

Years ago, The Hollywood Stock Exchange [hsx.com] was a somewhat popular game (maybe it still is, but it doesn't really interest me). The general idea being that you could "Buy shares of your favorite actors, movies, and music artists and watch their values rise or fall based on the success of their careers and personal life."

It would be interesting to see a similar game based on the popularity of queries. It's clear from the Google Zeitgeist [google.com] that certain search terms do gain and lose popularity on a regular basis, and for someone tapped in to mainstream culture, it may not be too hard to predict.

I suppose you could do the same thing with the other info there (Browsers, OSs, Current Events, etc.) but I don't think it would be as interesting. Although... Anime searches might be neat.

Anyhow, just an idea I'd love to see someone run with.

-Tommy
Grammar nits (Score:1)

by jdavidb ( 449077 ) writes:

Google is singular. Even though the term represents an organization of many people, it is just one organization, and so the word is singular. You don't say, "The class have learned the material from lesson 5"; you say, "The class has learned the material from lesson 5."

Trust me on this one. It's not like the word "data" where we monkeyed around and changed the semantics.

So it's not, "Google Release Web APIs," it's "Google Releases Web APIs"; and it's not "Google have released ... ," it's "Google has released ... ." I know, it doesn't matter. That doesn't keep it from bothering me.
- Re:Grammar nits (Score:2)
  
  by GTRacer ( 234395 ) writes:
  
  Except maybe in Europe. And some other places, too. I know when I was in London I noticed that collective nouns were treated as plural: IBM have released... and so on.
  Same's true if you watch enough Britcoms or other British imports (damn do they make good crime dramas!).
  My 2 cents...
  GTRacer
  - Should be returning to England in a year or so...
  - Re:Grammar nits (Score:2, Informative)
    
    by ianmacd ( 46518 ) writes:
    
    True. Plural usage of a company name is correct in British English.
  - Re:Grammar nits (Score:2)
    
    by jdavidb ( 449077 ) writes:
    
    Didn't know that. Cool. I stand corrected.
Interactive Suck-Rules-O-Meter Now? (Score:2)

by GeekLife.com ( 84577 ) writes:

Maybe now Kuro5hin can redo their Interactive Sucks-Rules-O-Meter [kuro5hin.org].
Umm ok. (Score:1)

by Pvt_Waldo ( 459439 ) writes:

<sarcasm>
LOL maybe we should just dismantle the whole internet, as clearly the internet is the channel used by spammers! Oh wait. The internet has many many positive uses. Gee!
</sarcasm>
LOL a 4 for Interesting? Oh come on, this is ignorance, not information.

Horrors! Spammers can use this!

Uh 'scuse me but I can write a 10 line perl script that does the same thing. All I have to do is craft a query to google, and put a bunch of work into parsing out the real content from the HTML that comes back. Kind of a pain, but nothing a few regexp can't handle. This API is nothing new, it's just something handy. I'm seriously thinking I can replace a component of a research project here at our research facility with this. Why reinvent the wheel after all?

Worse, are webmasters going to have to put a halt to Google crawls?
Huh? What does this API have to do with being crawled by google? All this API does is let you do searches to google. Google is google is always searching. That's what robots.txt is for. You are not going to get crawled by this! This is not a BOT, just a QUERY TOOL.
- Re:Umm ok. (Score:1)
  
  by Pvt_Waldo ( 459439 ) writes:
  
  Preview you fool! Slapslapslapslapslap....
they own spell check results? (Score:2)

by Dr. Awktagon ( 233360 ) writes:

The Google Rights include rights to the following:......(3) the search results and spell checking you obtain when you use Google Web APIs.

I never thought I'd read the words "Google Rights" in a legal document, but anyway, how can Google own the rights to "spell checking".. what exactly do they own? The words that come back? The association of misspelled words to spelled words? How could you abuse that??

I must say this is incredibly cool though.. however I would much rather see a generic "Search Engine API" that isn't owned by Google, and can be implemented by anyone.
- Re:they own spell check results? (Score:1)
  
  by yasth ( 203461 ) writes:
  
  I must say this is incredibly cool though.. however I would much rather see a generic "Search Engine API" that isn't owned by Google, and can be implemented by anyone.
  
  It could well become a defacto standard, as they are the first major search engine to do this, and other search engines will want to at the minimum provide a tool for migrating from Google powered applications to thier own, which of course means that one should be able to write to the google API and then use thier tools to port it to whatever one wants.
Notice how Google avoided use of CORBA (Score:1, Insightful)

by Anonymous Coward writes:

Notice how Google avoided use of CORBA for these APIs. That speaks volumes in favour of SOAP's robustness over CORBA knowing how these Google dudes are perfectionists.
Not needed (Score:3, Interesting)

by NineNine ( 235196 ) writes: on Friday April 12, 2002 @01:12PM (#3330263)

Why is this needed? I've been using Google programatically for a while now. What does this offer that I can't use on my own?

Share
twitter facebook
- Re:Not needed (Score:3, Informative)
  
  by skunkeh ( 410004 ) writes:
  Legality [google.com] for one thing:
  
  No Automated Querying
  
  You may not send automated queries of any sort to Google's system without express permission in advance from Google. Note that "sending automated queries" includes, among other things:
  
  using any software which sends queries to Google to determine how a website or webpage "ranks" on Google for various queries;
  
  "meta-searching" Google; and
  
  performing "offline" searches on Google.
  
  It also stops your scripts from breaking every time Google redesign their results page.
  - Re:Not needed (Score:1)
    
    by Vulture_ ( 106594 ) writes:
    That's really pretty unenforceable. How is Google supposed to know if a query is automated? How is a query defined to be "automated"?
    For the latter, we have the following scenarios which could be interpreted as being automated:
    
    You are using a computer to query Google. You're not actually twiddling the electrons in Google's servers with your fingers to perform the search.
    
    If your browser has a built-in "Web Search" or "Google Search" function, the browser is automatically sending the query to Google and parsing the results before displaying them to you. You might also be using some standalone program that does this.
    
    Some IRC bots (particularly infobot) have a Google search function, wherein someone (on a channel or in private) asks the bot to do a Google search. The bot does the search and displays the results to the user.
    
    A user instructs their computer to do a Google search and save the results when a (dialup) Internet connection is established. Thus, a delayed search is performed. (This could be very useful for those who have to pay per minute for phone time to connect, and get lower rates during evenings or such. The connection would be established at a time when rates are low, the search performed, and the connection broken.)
- Re:Not needed (Score:1)
  
  by rbeattie ( 43187 ) writes:
  
  I was going to moderate you down, but decided to just respond to you bluntly instead.
  
  You're a fucking idiot.
  
  -Russ
- Re:Not needed (Score:1)
  
  by tshak ( 173364 ) writes:
  
  There's a difference between a "hack" and a solid implementation.
  - Re:Not needed (Score:2)
    
    by NineNine ( 235196 ) writes:
    
    It's not a hack. It's simply programatically browsing, reading the results, etc. It's very fucking simple in VB, PERL, whatever.
- Re:Not needed (Score:1)
  
  by WWWWolf ( 2428 ) writes:
  
  HTML is great to make pages readable for users.
  SOAP is great to get stuff from web in format programs understand.
  Yes, it's possible to use HTML parsing and stuff for any site (I'm using such tools in Everything2.com), but they all need atrocitious amount of HTML parsing. Thank God for visual-regexp. =)
  However, if E2 would use SOAP, I'd just ask them "Give me IDs of writeups matching title "don't force your gray philosophy on me" and it gives them, and I have them in an @array. Period.
  (Praying the security bug in SOAP::Lite will be fixed and Nate will make good use of mod_soap =)
Looking for something similar to this (Score:1)

by homestead ( 573077 ) writes:

Does anyone know of any consumer retail sites that have a similar api to their online catalogs?

I am a grad student looking to avoid html scrapping for one of my projects.

Thanks
1000 queries per day? Dumb. (Score:1)

by Vulture_ ( 106594 ) writes:

So, Google limits automated queries, but allows unlimited interactive queries, even though the automated queries consume vastly less bandwidth and CPU than the interactive queries? Does this seem just a tad stupid to anyone else?
Also, the limit on results per query severely limits the usefulness of this API.
Finally, the requirement for a license key sounds a little Microsoftish to me. Since Google is not Microsoft, this is unlikely to work in their favor.
For these reasons, I suspect that the release of this API may hurt Google more than it helps them.
- Re:And you thought Microsoft was spying? (Score:4, Insightful)
  
  by Spackler ( 223562 ) writes: on Friday April 12, 2002 @08:18AM (#3328777) Journal
  
  DOH. And I hit submit before the good part:
  
  Your program must include your license key with each query you submit to the Google Web APIs service.
  
  Parent Share
  twitter facebook
  - Re:And you thought Microsoft was spying? (Score:2, Funny)
    
    by Anonymous Coward writes:
    
    You are so right... this is a shocking way of doing buisness. These guys arn't even giving you root access to their servers... Shame on them.
    
    What I find even more amazing is that you seem to expect this unrestricted acces.
- Re:And you thought Microsoft was spying? (Score:1, Flamebait)
  
  by flipflapflopflup ( 311459 ) writes:
  
  > now Google will log every search from your
  > automated application
  
  And why shouldn't they?
  - Re:And you thought Microsoft was spying? (Score:4, Insightful)
    
    by TechnoVooDooDaddy ( 470187 ) writes: on Friday April 12, 2002 @08:59AM (#3328897) Homepage
    
    ummm... yeah... that's partly how they do their whole weighting thing to determine hot websites for search criteria.. without that in place, you'd have to search through tons of crap to find what you want.. I regularly use the "I feel lucky" button on google, because their algorithms manage to pull up what i'm looking for first hit..
    
    Parent Share
    twitter facebook
    - Re:And you thought Microsoft was spying? (Score:1)
      
      by ethereal ( 13958 ) writes:
      
      You're missing the point - it's not that they keep logs of searches (pretty much any web server anywhere is keeping similar logs) and use those searches to sort their database, it's that your searches are specifically traceable to you via your license key if you search through this API. So don't be searching for anything particularly private, unless of course you trust Google more than the average corporation.
      I don't see why the ancestor got modded down as "flamebait" - this is a good point. Apparently it's "flamebait" to say anything good about Microsoft (not that I've tried that one) or bad about Google around here :)
- Re:And you thought Microsoft was spying? (Score:1)
  
  by fr2ty ( 557571 ) writes:
  
  Ha, you are being funny.
  
  Ever thought of forbidding people to remember letters they receive?
  
  Everyone can log http requests to hos or her machine, so do you.Microsoft spying on us is different: They sell programs to you that establish internet connections without your consent, or send additional data along with your requests.
  
  Can you see the difference?
- Re:And you thought Microsoft was spying? (Score:2)
  
  by anthony_dipierro ( 543308 ) writes:
  
  Gee, and now Google will log every search from your automated application.
  
  And now you can create your own peer-to-peer google search which eliminates the logging altogether.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Web API Implementations (Score:5, Informative)

Re:Web API Implementations (Score:2, Interesting)

Re:Web API Implementations - Ruby missing (Score:2, Informative)

TCL Implementation (Score:1)

This story refers to (Score:3, Informative)

Unlimited (Score:1)

AppleScript for Google API (Score:2, Informative)

Re:AppleScript for Google API? (Score:1)

Re:AppleScript for Google API? (Score:2)

Re:AppleScript for Google API (Score:4, Informative)

Re:AppleScript for Google API (Score:1)

Re:AppleScript for Google API (Score:1)

w00t! (Score:1, Funny)

Re:w00t! (Score:1, Offtopic)

But no searching images or newsgroups... (Score:1)

A great corporate move (Score:5, Insightful)

Re:A great corporate move (Score:5, Informative)

Re:A great corporate move (Score:5, Insightful)

In fact, farther down in the FAQ. (Score:1)

Re:A great corporate move (Score:2, Interesting)

Re:A great corporate move (Score:2)

Re:A great corporate move (Score:1)

Re:A great corporate move (Score:1)

Re:A great corporate move (Score:1)

Re:A great corporate move (Score:1)

Re:A great corporate move (Score:4, Insightful)

Google groups may require a Google account.. (Score:1, Interesting)

Re:Google groups may require a Google account.. (Score:2)

Re:Google groups may require a Google account.. (Score:4, Informative)

Example of use (Score:5, Informative)

Re:Example of use (Score:1)

Trying to /. google???? (Score:1, Funny)

O'Reilly has some good code and stuff (Score:5, Informative)

Google Terms of Service (Score:4, Funny)

Re:Google Terms of Service (Score:5, Funny)

slashdotted, i think (Score:2, Insightful)

Staggering Potential (Score:5, Insightful)

Re:Staggering Potential (Score:2, Funny)

Re:Staggering Potential (Score:2)

Re:Staggering Potential (Score:2, Informative)

todays tasks are apparent (Score:1)

Pigeonrank anyone? (Score:1, Funny)

Re:Pigeonrank anyone? (Score:1)

Re:Pigeonrank anyone? (Score:4, Funny)

Anyone care to explain... (Score:1, Interesting)

Re:Anyone care to explain... (Score:1)

Re:Anyone care to explain... (Score:5, Informative)

More Advanced Features? (Score:5, Interesting)

why not just use plain http (Score:3, Insightful)

Re:why not just use plain http (Score:3, Insightful)

Re:why not just use plain http (Score:2)

Re:why not just use plain http (Score:1, Insightful)

What about slashdot? (Score:3, Interesting)

They already do... (Score:3, Interesting)

Re:They already do... (Score:2)

access to postings (Score:1, Interesting)

NNTP tunneling ? (Score:3, Insightful)

Re:NNTP tunneling ? (Score:4, Interesting)

Re:NNTP tunneling ? (Score:1)

Re:NNTP tunneling ? (Score:1)

Oh yes, porn. (Score:1)

only 10 results per search?? (Score:2, Insightful)

Re:only 10 results per search?? (Score:1)

Question (Score:2, Funny)

Re:Since when do we like Google? (Score:1)

Re: Re:Since when do we like Google? (Score:2)

Synthesis (Score:5, Insightful)

Re:Synthesis (Score:2, Informative)

Re:Synthesis (Score:1)

Re:Synthesis (Score:2)

Re:Synthesis (Score:2)

Example from Python (Score:1)

Re:Example from Python (Score:1)

Google Stock Exchange (Score:2)

Grammar nits (Score:1)

Re:Grammar nits (Score:2)

Re:Grammar nits (Score:2, Informative)

Re:Grammar nits (Score:2)

Interactive Suck-Rules-O-Meter Now? (Score:2)

Umm ok. (Score:1)