Saturday, April 4, 2009

Most of the use-ful information is for free.

Firstly let me clarify that this post is not for a high end geek, considering the need for the data there are lot many ways to get them, One of the ways to achieve the ends to means is the same old google. You might be booing that you have already know this from your kindergarten but what you might not have known so far is how to effectively use the site.

Starting with the way the google works, as far as i know, when you hit some text into the search bar and hit the return key you are displayed with almost precise result that you are looking for and now how google does it is by ranking the webpages. The web-pages are ranked depending on how many other web-pages link to this site, when ever some other webpage includes its link in to its own webpage the rank of the linked web-page increases, and you might be wondering on google gets to find a webpage some where remotely located. This job is accomplished by a google-bot called the spider bot (also known as crawlers) crawl in to a site when ever it finds a link to that site and from the crawled site to another site in this way the bot covers the entire internet web pages which are approximately 220 million by now.
So consider your self lucky when ever you have the google webpage in your browswer consider you have access to all those 220 million pages in a single click (thanks to google).

That is as far as the history lesson is considered and now coming to practicality and applications how can data efficiently be mined out. The answer to the question actually depends on what sort of data you are looking for, Ok for time being lets categorize them broadly into Education and Entertainment, Lets talk about entertainment first :) so how do you download music, you locate the site and go to the site, go through all the fuss and then comes the download page (This might even be asking you to register and all sort of things). The simplest way open the google webpage type in the text as shown below

intitle:"index.of"(mp3) linkin.park
Take a look...

What you get is a whole directory access to the songs by linkinpark explaining in detail about the format and how it works, the " index.of " serves as the actual key which helps in searching and listing the directories as they are, now all you have to do is to search your song from the list and download lolz, easy aint it? The format in the braces (mp3) can be replaced with any extension you want to find the data about and also if you want to include multiple extensions it goes like (mp3|swf|wav) etc etc.

That is by far most efficient method i use, people out there if you find any feel free to share it.
Google search allows you to use symbols like +,- inorder to eliminate or include the terms in search eg goes here
suppose you want to search the pages which are .php and ignore all the pages startng with .asp or .html or any other format
All you got to do is to type some simple text where ' abc ' is the content you wish to search for,

abc -html -asp +php
Take a look...

Next comes the ;) education ( No offense ), Now suppose you are interested in all the academic stuffs etc etc, all you have to do is the prefix your search with this

site:.edu <content you want to search>

what you get is all the sites from the educational institues and the best thing search the assignment your proffessor gives you , who knows you might find it. The sites you might want to search may vary, how about trying some like .org (the content mostly in this sort of sites would be free), .net.

Next is the filetypes you want to search for, lets say that you needed a white paper publishd by so and so author you would definitly find it in IEEE why not try some other means to get that with out logging in or some thing like that, do this..

filetype:pdf <content you want to search>

And again the .pdf can be replaced with .doc or .ppt (most of the slides which your lecturer teaches from are here give it a try).

And lastly try applying the combinations of the above methods to find even more useful methods like and a .pdf or a .ppt from the site:.edu

site:.edu filetype:ppt <content to search>

That would all be for now in this post and i would really be glad if any one points out any mistakes or issues in this (if you can) :P.
see you again with some other information.

Introduction

The name of the site says it all, the reason and the sole purpose of the site is to Increase, Share, Spread the knowledge that is accumulated by the real life experiences all my life and remember the site includes mainly the ideas of the tech-world, though i myself am not a tech-guru, I would try and contribute as much as to the world for a noble cause.

That is all for the formal introduction of what this site is about, I would try and accumulate as much information , useful information that is, here in a regular basis so viewers are welcomed to post comments and advice on improvements or any thing as for now.