Home » Posts tagged 'search engines' (Page 2)
Tag Archives: search engines
To help unify and uniform, collobrative work and data management and business models across the enterprise in secure SSL cloud environments- Google Storage has been rolling out some changes (read below)-this also gives you more options on the day Amazon goes ahem down (cough cough) because they didn’t think someone in their data environment could be sympathetic to free data.
And now to the actual update.
We’re making some changes to Google Storage for Developers to make team-based development easier. As part of this work, we are introducing the concept of a project. In preparation for this feature, we will be creating projects for every user and migrating their buckets to it.
What does this mean for you?
Everything will continue to work as it always has. However, you will notice that if you perform a get-acl operation on any of your buckets, you will see extra ACL entries. These entries correspond to project groups. Each group has only one member – the person who owned the buckets before the bucket migration; no additional rights have been granted to any of your buckets or objects. You should preserve these new ACL grants if you modify bucket ACLs.
An example entry for a modified ACL would look like this:
We’ll be rolling out these changes over the next few days,
Detailed Note on GS-
Google Storage for Developers is a RESTful service for storing and accessing your data on Google’s infrastructure. The service combines the performance and scalability of Google’s cloud with advanced security and sharing capabilities. Highlights include:
Fast, scalable, highly available object store
- All data replicated to multiple U.S. data centers
- Read-your-writes data consistency
- Objects of hundreds of gigabytes in size per request with range-get support
- Domain-scoped bucket namespace
Easy, flexible authentication and sharing
- Key-based authentication
- Authenticated downloads from a web browser
- Individual- and group-level access controls
In addition, Google Storage for Developers offers a web-based interface for managing your storage and GSUtil, an open source command line tool and library. The service is also compatible with many existing cloud storage tools and libraries. With pay-as-you-go pricing, it’s easy to get started and scale as your needs grow.
Google Storage for Developers is currently only available to a limited number of developers. Please sign up to join the waiting list.
On a whim, I took the all time stats of my blog posts (more than 1000 posts) , and tried to plot their distribution.
Basically I copied and pasted all the data in a Google docs spreadsheet. and I created dummy codes (like URL1, URL2…. URL 500)
Next I downloaded the….
I wasnt in the mood for downloading and uploading stuff so I decided to use GGPLOT using Jeroen’s Application at http://www.stat.ucla.edu/~jeroen/
I used the mirror server that Dataspora provides as I have had latency issues with Jeroen’s website.
I got this error while trying to connect the Dataspora App to my Google spreadsheet
The page you have requested cannot be displayed. Another site was requesting access to your Google Account, but sent a malformed request. Please contact the site that you were trying to use when you received this message to inform them of the error. A detailed error message follows:
The site “http://dataspora.com” has not been registered.
Oh dear! Back to Jeroen’s /UCLA’s page.
I get this warning but it still manages to log in
This website has not registered with Google to establish a secure connection for authorization requests. We recommend that you continue the process only if you trust the following destination:
wow it works! thats cloud computing now so I wonder why Google and Amazon continue to ignore the rApache, and Jeroen’s cloud app . Surely their Google Fusion Tables can be always improved or tweaked. Not to mention the next gen version of R which will have its own server
Pretty cool screenshot (but click to see more)
I get the following pretty graph. Hadley Wickham would be ashamed of me by now.
What went wrong- well one page has 36000 views . Scale is the key to graphical coherence . So I redo- delete home page in Google spreadsheet ,reimport replot. ( I didnt know how to modify data in the cloud app, maybe we need a cloud PlyR) I redo it again as I have a big outlier-The top 10 Statistical GUI article which ironically has only 5 GUIs in that article but hush dont tell to high quality search engine)
So again Belatedly I discover something called layer in ggplot.
I give up. I rather prefer hist() I go to my favorite GUI Rattle, but it has some dating issues with the dll of GTK+
So I go to John Fox’s simple GUI. R Commander- is the best GUI if you use Occam’s Razor, and I am using Occam’s Chainsaw now.
I get the analysis I want in 12 secs
Summary- GGPLot is more complicated than base graphics engine.
Deducer GUI is not as simple too
R Commander is the best GUI because it retains simplicity
Ignore long tail of internet only at your peril
Almost 2/3 rds of my daily traffic of 400+ comes from old archived content That is why Search Engine Optimization and Alerts for Keywords are CRITICAL for any poor soul trying to write on a blog (which has no journal like prestige nor rewards)
If you make life easier for the search engine, it being a fair chap, rewards you well
Existing web traffic estimates like Comscore and Google Trends ignore this long tail
Comments are welcome (Data is pasted below of 500 rows X 2 columns if you can come up with a better analysis)
Since SAS has ignored web analytics and Google Analytics is hmm hmm, this could be an area of opportunity for R developers as well to create a web analytics package.
- Cloud Computing May Decrease Your API Call Limit (programmableweb.com)
- Book: ggplot2 by Hadley Wickham (r-bloggers.com)
- Google Instant Search: What does this mean for advertisers? (wpromote.com)
- 2 Fun and Useful Goog,e Spreadsheet Tricks (searchenginejournal.com)
- R Graphs Resources (decisionstats.com)
- The Importance of the Long Tail with Keywords and Phrases (businessbloggingtips.com)
- As Google Retools its Search Engine, Content Farms Lose Traction (xconomy.com)
My annual traffic to this blog was almost 99,000 . Add in additional views on networking sites plus the 400 plus RSS readers- so I can say traffic was 1,20,000 for 2010. Nice. Thanks for reading and hope it was worth your time. (this is a long post and will take almost 440 secs to read but the summary is just given)
My intent is either to inform you, give something useful or atleast something interesting.
Sandro Saita from http://www.dataminingblog.com/ just named me for an award on his blog (but my surname is ohRi , Sandro left me without an R- What would I be without R ) ).
Aw! I am touched. Google for “Data Mining Blog” and Sandro is the best that it is in data mining writing.
DMR People Award 2010
There are a lot of active people in the field of data mining. You can discuss with them on forums. You can read their blogs. You can also meet them in events such as PAW or KDD. Among the people I follow on a regular basis, I have elected:
He has been very active in 2010, especially on his blog . Good work Ajay and continue sharing your experience with us!”
What did I write in 2010- stuff.
What did you read on this blog- well thats the top posts list.
2009-12-31 to Today
So how do people come here -
well I guess I owe Tal G for almost 9000 views ( incidentally I withdrew posting my blog from R- Bloggers and Analyticbridge blogs – due to SEO keyword reasons and some spam I was getting see (below))
http://r-bloggers.com is still the CAT’s whiskers and I read it a lot.
I still dont know who linked my blog to a free sex movie site with 400 views but I have a few suspects.
2009-12-31 to Today
Still reading this post- gosh let me sell you some advertising. It is only $100 a month (yes its a recession)
Advertisers are treated on First in -Last out (FILO)
I have been told I am obsessed with SEO , but I dont care much for search engines apart from Google, and yes SEO is an interesting science (they should really re name it GEO or Google Engine Optimization)
Apparently Hadley Wickham and Donald Farmer are big keywords for me so I should be more respectful I guess.
Search Terms for 365 days ending 2010-12-31 (Summarized)
2009-12-31 to Today
|test drive a chrome notebook||467|
|test drive a chrome notebook.||215|
|wps sas lawsuit||158|
|google maps jet ski||123|
|test drive chrome notebook||96|
|sas wps lawsuit||85|
|chrome notebook test drive||83|
|best statistics software||74|
|google maps jetski||72|
|donald farmer microsoft||51|
|best statistical software||49|
What about outgoing links? Apparently I need to find a way to ask Google to pay me for the free advertising I gave their chrome notebook launch. But since their search engine and browser is free to me, guess we are even steven.
Clicks for 365 days ending 2010-12-31 (Summarized)
2009-12-31 to Today
so in 2010,
SAS remained top daddy in business analytics,
R made revolutionary strides in terms of new packages,
JMP launched a new version,
SPSS got integrated with Cognos,
Oracle sued Google and did build a great Data Mining GUI,
Libre Office gave you a non Oracle Open office ( or open even more office)
2011 looks like a fun year. Have safe partying .
- IBM SPSS 19 Now Available to the Global Academic Community via e-academy’s OnTheHub eStore (prweb.com)
- ACM Data Mining Camp 3 (revolutionanalytics.com)
- Accessing R from Python using RPy2 (r-bloggers.com)
- Mining of Massive Data Sets (kinlane.com)
- 5 FeedBurner Alternatives You Should Know About (techie-buzz.com)
- Uncertainty, Risk, Statistics and Data Mining (zyxo.wordpress.com)
- ‘Data Mining’ Gains Traction in Education (edreformer.com)
- If you cut your RSS short I will ignore your post (chrisabraham.com)
- Solar trends for 2011 (cleanbreak.ca)