Using Windows Azure Machine Learning as a service with R #rstats

A Brief Tutorial I wrote by playing with the software at manage.windowsazure.com

Happy July 4th

To all my American friends.


Great Way to learn Git easily

a great way to learn Git easily is here https://try.github.io/

Screenshot 2014-06-24 19.23.59

This is a much better designed code school project than the one for R


However Swirl is a great way to learn  R in an interactive way. its only drawback is it needs to be integrated with something like http://www.r-fiddle.org/#/ for a true automated browser only version

Why do I favor automated elearning solutions now? Because teaching the same thing again and again can be boring for the teacher and videos can be boring for the students. Note how the potential student is given positive reinforcement to boost his morale, something any good teacher know.

The Day My Web Analytics Went Nuts

179 views 1 Visitor 15 different countries.

What is this?

Screenshot 2014-06-21 19.26.58


Fixing Search for Jobs and Resumes

When we search for websites on Google and Bing, we get relatively efficient results of what we are searching for just based on keywords. However for both candidates as well as companies, searching across jobs and resumes is tougher because most job portals do not have the chops to invest in algorithmic unstructured text search. Instead we encounter a scenario where the entire industry of recruitment agencies and consultants exist so that manual intervention reduces the inefficiency of this particular case of search. Even recruitment agencies have a checklist of questions to ask and they store the data in CRM software

Why is this possible? Economics is the study of incentives and a big chunk of paying customers for Job Portals is recruitment consultants. Making Job and Resume search much more efficient would enable both candidates and companies to bypass the traditional model of going via agencies and consultants.

Perhaps the only company with a strong enough database is LinkedIN with Google and Facebook as close behinds. This is a billion dollar industry, it is ripe for disruption, and the bits and pieces for fixing this basic mathematical search problem already exist. Perhaps what is needed is a database with enough data, activity captured through large enough agencies or HCRM software, integrated with algorithms.

The basic problem is Spammy or Outdated results in both resume search and job search. Spam should be fixed in a cutting way, dont you think?

Screenshot 2014-06-19 08.06.35

Cloud versions of Latex

I work with Lyx http://www.lyx.org/, the GUI for Latex http://en.wikipedia.org/wiki/LaTeX, for writing my books. 18 years of writing in MS Word, and yes I have rightly criticized for my bad formatting. I hope to do a better job for R for Cloud Computing. Someday I will learn Latex and Sweave http://www.stat.uni-muenchen.de/~leisch/Sweave/ as well (sighs)

Sweave is a tool that allows to embed the R code for complete data analyses in latex documents. The purpose is to create dynamic reports, which can be updated automatically if data or analysis change. Instead of inserting a prefabricated graph or table into the report, the master document contains the R code necessary to obtain it. When run through R, all data analysis output (tables, graphs, etc.) is created on the fly and inserted into a final latex document. The report can be automatically updated if data or analysis change, which allows for truly reproducible research.

Where can I get it?

The Sweave software itself is part of every R installation

But alternatives to Lyx for a browser only version of Latex do exist.

There are two three of them right now

1) https://www.sharelatex.com/ ShareLaTeX is now open source! ShareLaTeX is an online real-time collaborative LaTeX editor, and you can now run your own local version where you can host, edit, collaborate in real-time, and compile your LaTeX documents. You can run  the hosted version at http://www.sharelatex.com,

Screenshot 2014-06-12 22.01.51

2) http://fiduswriter.org/  Fidus Writer is an online collaborative editor especially made for academics who need to use citations and/or formulas. The editor focuses on the content rather than the layout, so that with the same text, you can later on publish it in multiple ways: On a website, as a printed book, or as an ebook. In each case, you can choose from a number of layouts that are adequate for the medium of choice.Screenshot 2014-06-12 22.16.38

3) https://www.writelatex.com/

Screenshot 2014-06-13 01.40.54

All are equally good and equally nascent. I like that Writelatex has an API


I like the Fidus Writer interface more but the ShareLatex has a bigger set of templates. I think Write Latex is more evolved than Fidus Writer but will still need to catch up with Share Latex

Both are available on Github for tinkering.

https://github.com/fiduswriter/fiduswriter and

https://github.com/sharelatex/sharelatex and


Maybe I will have to wait for Google Docs for creating an application for Latex typesetting. In the meantime, we shall Lyx.

(Hat tip – S Boucher for pointing me to write latex)

Hacks for Travian

What is Travian?

As per Jimmy?


Travian is a persistent, browser-basedmassively multiplayer, online real-time strategy game developed by the German software company Travian Games.[1] It was originally written and released in June 2004 by Gerhard Müller. Set in classical antiquityTravian is a predominantly militaristic real-time strategy game.

Travian has been translated into over 40 languages from the original German version,[1][2] and has over 5 million players on over 300game servers worldwide.[3][4][5] In 2006, it won the Superbrowsergame Award, in the large games category.[1][4][6][7]

Travian is programmed in PHP and runs in most modern browsers. Its creators may have drawn from an earlier German board game,The Settlers of Catan, for layout[8] and the resource development theme


Get onto gettertools.com/ts8.travian.com.3 its awesome for calculation

Read from http://travian.wikia.com/wiki/Travian_Wiki when you are waiting in the game (yes there is waiting)

Early Game

1) Use Gold to take 25% bonus in Lumber and Clay

2) Use crop finder to locate your second city away from your enemy alliance and near a safe area for your alliance

3) Put troops on evasion and kill and plunder with Hero but Hero bonus should be for killing not crops

4) Try and Capture an Oasis Early and Try and Get your best troops early. This means doing more Academy and Hero upgrades.

5) Choose your capital wisely if you have reached multiple cities- Un selecting a city as capital breaks all buildings above level 10 back to 10 and breaks some buildings (Mason) completely.

6) Try and hit 50 points every day in daily tasks

Mid Game

1) Use Gold to take 25% bonus in Lumber, Iron and Crops

2) Rally Point should be way up

3) Try and hot 75 points daily tasks.


End Game

1) Spare some gold for this part

2) Be fickle and lie low.

3) You will need a lot of crops or you will die

(to be continued)


