Click Windows Button +R . The Windows Button is the one with the logo.
You will see this.
Now write the name of program you want to edit and go.
Click Windows Button +R . The Windows Button is the one with the logo.
You will see this.
Now write the name of program you want to edit and go.
ESPN Crincinfo is the best site for cricket data (you can see an earlier detailed post on the database here https://decisionstats.com/2012/04/07/cricinfo-statsguru-database-for-statistical-and-graphical-analysis/ ), and using the XML package in R we can easily scrape and manipulate data
Here is the code.
library(XML) url="http://stats.espncricinfo.com/ci/engine/stats/index.html?class=1;team=6;template=results;type=batting" #Note I can also break the url string and use paste command to modify this url with parameters tables=readHTMLTable(url) tables$"Overall figures" #Now see this- since I only got 50 results in each page, I look at the url of next page table1=tables$"Overall figures" url="http://stats.espncricinfo.com/ci/engine/stats/index.html?class=1;page=2;team=6;template=results;type=batting" tables=readHTMLTable(url) table2=tables$"Overall figures" #Now I need to join these two tables vertically table3=rbind(table1,table2) Note-I can also automate the web scraping . Now the data is within R, we can use something like Deducer to visualize.
Created by Pretty R at inside-R.org
Twitter Bootstrap is a free collection of tools for creating websites and web applications. It contains HTML and CSS-based design templates for typography, forms, buttons, charts, navigation and other interface components, as well as optional JavaScript extensions.
It is the most popular project in GitHub[2] and is used by NASA and MSNBC among others.
———————-
If you like me, hate to get down and dirty in HTML, CSS , JQuery ( not mentioning the excellent Code Academy HTML/CSS tutorials and JQuery Track ) and want to create a pretty simple website for yourself- Jetstrap helps you build the popular Twitter Bootstrap design (very minimalistic) for websites.
And it’s free! And click and point and paste your content- and awesome CSS, HTML. Allows you to download the HTML to paste in your existing site!
Here is one I created in 5 minutes!
So lose your old website! Because not every website needs WordPress!
Try Jetstrap for Bootstrap!
UPDATED- Here are three great examples of a visualization making a process easy to understand. Please click on the images to read them clearly.
1) It visualizes CRISP-DM and is made by Nicole Leaper (http://exde.wordpress.com/2009/03/13/a-visual-guide-to-crisp-dm-methodology/)
2) KDD -Knowledge Discovery in Databases -visualization by Fayyad whom I have interviewed here at http://www.decisionstats.com/interview-dr-usama-fayyad-founder-open-insights-llc/
and work By Gregory Piatetsky Shapiro interviewed by this website here
https://decisionstats.com/2009/08/13/interview-gregory-piatetsky-kdnuggets-com/
3) I am also attaching a visual representation of SEMMA from http://www.dataprix.net/en/blogs/respinosamilla/theory-data-mining
Inspired by true events at
1) Huge variety of courses from the best professors in the world (see Gamification course from Coursera below) or Machine Learning , Human Computer Interaction
2) They are free ( is a mistake)! time is not free.
Also signature courses at Coursera now offer credible tracks for $39, and they have more support.
Why do you as a student need support? because sometimes you get stuck, and sometimes you need human interaction to stay motivated.
3) Coursera- I love these things-
Can run the course faster at 1.75 times ( because seriously I get distracted otherwise)
Can run the multiple language CC (captions) – reading is so much faster
Best feature- in video quizzes
Most number of courses
Free!
Codeacademy-
Makes learning fun
Makes easy to learn language
I wish someone could mash more of Coursera content with Codeacademy gamification and teach hacking and data sciences to the next generation of hackers!!
Rest of the websites are good, but I stick to Coursera and Codeacademy!
5) Education empowers! Every person who learns R or JMP through a free MOOC will create more value for themselves, customers, and their society, country than had they remain uneducated because they could not afford the training.
The 3.0 Era for R starts today! Changes include better Big Data support.
Read the NEWS here
install.packages()
has a new argument quiet
to reduce the amount of output shown.cite()
and citeNatbib()
have been added, to allow generation of in-text citations from "bibentry"
objects. A cite()
function may be added to bibstyle()
environments.merge()
works in more cases where the data frames include matrices. (Wish of PR#14974.)sample.int()
has some support for n >= 2^31: see its help for the limitations.A different algorithm is used for (n, size, replace = FALSE, prob = NULL)
for n > 1e7
and size <= n/2
. This is much faster and uses less memory, but does give different results.list.files()
(aka dir()
) gains a new optional argument no..
which allows to exclude "."
and ".."
from listings.Rprof()
now optionally records information at the statement level, not just the function level.available.packages()
gains a "license/restricts_use"
filter which retains only packages for which installation can proceed solely based on packages which are guaranteed not to restrict use.breaks
argument in hist.default()
can now be a function that returns the breakpoints to be used (previously it could only return the suggested number of breakpoints).This section applies only to 64-bit platforms.
anyDuplicated()
and match()
) or because other limits (e.g. of character strings or matrix dimensions) would be exceeded or the operations would be extremely slow.length()
returns a double for long vectors, and lengths can be set to 2^31 or more by the replacement function with a double value.svd()
).dist()
can produce dissimilarity objects for more than 65536 rows (but for example hclust()
cannot process such objects).serialize()
to a raw vector is unlimited in size (except by resources).R_alloc
can now allocate 2^35 or more bytes.agrep()
and grep()
will return double vectors of indices for long vector inputs..C()
have been replaced by .Call()
to allow long vectors to be supported (now or in the future). Regrettably several packages had copied the non-API .C()
calls and so failed..C()
and .Fortran()
do not accept long vector inputs. This is a precaution as it is very unlikely that existing code will have been written to handle long vectors (and the R wrappers often assume that length(x)
is an integer).sort()
work for long vectors.rank()
, sort.list()
and order()
support long vectors (slowly except for radix sorting).sample()
can do uniform sampling from a long vector.This has been done for packages base
, methods
, splines
and tcltk
: it was already in place for the other standard packages.
Since these entry points are always accessed by the R entry points they do not need to be in the load table which can be substantially smaller and hence searched faster. This does mean that .C
/ .Fortran
/ .Call
calls copied from earlier versions of R may no longer work – but they were never part of the API.
.Call()
calls in package base have been migrated to .Internal()
calls.solve()
makes fewer copies, especially when b
is a vector rather than a matrix.eigen()
makes fewer copies if the input has dimnames..C()
etc) in a package without a PACKAGE
argument will only look in the first DLL specified in the ‘NAMESPACE’ file of the package rather than searching all loaded DLLs. A few packages needed PACKAGE
arguments added.@<-
operator is now implemented as a primitive, which should reduce some copying of objects when used. Note that the operator object must now be in package base: do not try to import it explicitly from package methods.NA
, sometimes with a warning. They are now legal for long vectors so there is no longer a warning, and x[2^31] <- y
will now extend the vector on a 64-bit platform and give an error on a 32-bit one.ulimit
in a bash
shell, limit
in csh
), to set limits on overall memory consumption of an R process, particularly in a multi-user environment. A number of packages need a limit of at least 4GB of virtual memory to load.
64-bit Windows builds of R are by default limited in memory usage to the amount of RAM installed: this limit can be changed by command-line option –max-mem-size or setting environment variable R_MAX_MEM_SIZE.