Backedup or Hacked Up

Decisionstats.com was undergoing some hacking attacks this past two weeks.

Backing up WordPress Blog-

  • use export feature in wordpress to create files.
  • If your number of articles is more than 70, then create multiple usernames, export using bulk apply to seperate wordpress xml files.
  • then in wordpress.com site, import the various xml files ( note multiple file method avoid corruption and using bulk apply -change author – makes it very fast)

Hackedup

  1. once your wordpress.com blog is updated, use a wild card redirect so as to preserve your search engine traffic. Your Backup is now online even if your original site is hacked
  2. Use the server to access your .htaccess file to check whether rouge redirects happened.
  3. Use logs of server access ( painful but true) to pinpoint IP addresses of hack attacks ( note hackers WOULD use relay servers to disguise IP addresses)
  4. To prevent domain name hijacking, make sure your who.is information is private
  5. Change your email passwords, security questions, server passwords. Use random password generators to create secure passwords.
  6. To prevent rogue malware from infecting your laptop create a dual boot Ubuntu/Windows laptop using a 10 minute tutorial. Use the Ubuntu Linux boot to do all the above operations.
  7. Inform the Federal authorities in cyber crime division with the server logs and a SPECIFIC complaint ( no rambling sob stories)
  8. Pray to God, Matt ( both Cutts and Mullenweg), and if all above steps fail ask Donncha O Caoimh at http://ocaoimh.ie/about/ to step in. 

Biz Stone finally talks business

Twitter co-founder, and creator of Blogger Mr Biz Stone finally set out a short brief email ( or twemail) on the changes in Twitter’s terms of service.The very concise email is below and an excellent interview with the man is at http://therumpus.net/2009/04/the-rumpus-interview-with-twitter-co-founder-biz-stone/

Incidentally Biz Stone’s inventions are kind of revolutionary in social media – he also founded Blogger ( blogging and micro blogging have done more to confound LarryRank algorithm at Google Search than anyone else).

What does an analytic, data whining blog have to do with social media. Plenty. If you have ever designed a propensity scoring model for targeting customers based on their behavior , more clean data that is identifiable an individual level is always a boon. The current trend for sentiment analysis is simply addition of text keywords ( or categorical variables if you insist) to the existing customer database.

Can adding keywords from blogs, tweets, web searches, TO existing data about you (credit bureau, demographic, purchase behavior)- can this lead to a better lift in the models. Yes.

Will this lead to more privacy debates. Yes. Given the huge volume of text variables, as well as the huge number of potential customers- privacy debates are quite statistically irrational ( but we digress into economics here).

No one is interested in selling just 1 more product. They use people (nicknamed Numerati) for writing queries to append, manipulate data so as to AGGREGATE and then build a model. Only after the models are built are the scores disaggregated AND scored individually- usually in automated manner.

No company is interested in selling to one consumer so they dont stoop at a privacy invasive search of individuals.

Advertsing is not an evil way of making money, Mr Stone. Just Trust Google and the guys who could not complete their Phd because they WERE making money.

What if all maths grads did that- ..and that’s an interesting thought.

Hi,

We’d like to let you know about our new Terms of Service. As Twitter
has evolved, we’ve gained a better understanding of how folks use the
service. As a result, we’ve updated the Terms and we’re notifying
account holders.

We’ve posted a brief overview on our company blog and you can read the
Terms of Service online. If you haven’t been by in a while, we invite
you to visit Twitter to see what else is new.

Overview: http://blog.twitter.com
Terms: http://www.twitter.com/tos
Twitter: http://www.twitter.com

These updates complement the spirit of Twitter. If the nature of our
service changes, we’ll revisit the Terms as necessary. Comments are
welcome, please find the “feedback” link on the Terms of Service page.

Thanks,
Biz Stone, Co-founder
Twitter, Inc.


Interview Stephen Baker Author The Numerati

Here is an interview with Stephen Baker, the author of the famous and remarkable book The Numerati. Stephen is the senior editor at Businessweek and his remarkable book made the world sit up and pay attention because for the first time, anyone wrote of the increasingly quant driven lives we lead thanks to the internet and the analytical brains that power the stimulus, design and targeting of it. Increasing amounts of data is collected about consumers than at any previous point of time in human history and the number crunchers or the quant jocks are the ones who increasingly help with decision making and decision management. Steve calls these people “The Numerati” or the new math people who help shape our lives.

There will always be lawyers and financiers who make loads of money. But they will have quantitative experts on their teams- Stephen L Baker

Ajay- Describe your career journey from high school to a technology writer to author of The Numerati.

Steve- I was always interested in history and in literature, and in college I fell in love with Spanish. So after college, I moved to Ecuador, taught English, and wrote fiction. I saw early on that I wasn’t going to be able to make a living with fiction. So I went into journalism. My goal was to become a correspondent in Latin America. Through my 20s, I worked in Vermont, Madrid, Argentina, Venezuela, Washington DC, and El Paso, Texas. And I finally got the job I was looking for, bureau chief for the Mexico bureau of BusinessWeek magazine.

After Mexico, my family and I moved to Pittsburgh. It appeared that the magazine was losing interest in heavy industry in the mid-90s, so I began to write about software and robotics coming out of at Carnegie Mellon University. That was my transition into technology. A year later, BusinessWeek offered me a job covering technology in Europe. I moved with my family to Paris, where we lived for four years. I focused largely on mobile communications. It seemed to me that the combination of mobility and the Internet would fundamentally change communications.

I returned to New York in 2002. I focused on big picture stories. One day in 2005, I proposed a story about the decline of the U.S. technology industry. I argued that we were behind in wireless and in broadband, we were graduating fewer scientists and engineers than other great powers, especially in Asia. One editor pointed out that mathematics was critical for these competitive issues. The editor in chief, Steve Adler, called for a cover story on math, and he assigned it to me. I didn’t know much about math at the time, and I still don’t. But this gave me the chance to dive into the world of data analysis. I wrote a cover story, Math will Rock Your Business, and later got the contract to write the Numerati.
Ajay- How do you think the government can motivate more American students to science careers?

Steve- I think focusing on the science that kids find cool–robotics, space and ocean exploration, would help. Funding basic research would be useful. But I don’t think it’s entirely a governmental issue. Parents, companies, universities, they all have to participate.
Ajay- What are the top  tips you would give to aspiring technology writers and bloggers (like myself)?

Steve-
1) Learn about non tech subjects, such as history, literature, art and psychology
2) Work on writing clearly for non experts. Avoid jargon.
3) Do reporting
4) Do more reporting

Ajay-The Numerati portrays a math elite which breaks the stereotype of the lonely, nerdy geek. How important do you think is that common people be more educated in math so they are more aware of marketing operations and credit offers?

Steve- I think it’s important for common people, as you call them, to understand basic statistics. More and more of our lives are going to be analyzed and communicated to us statistically. Those who do not understand this will not know to ask the right questions, and will be easily fooled. This is also true within companies. CEOs can be fooled by numbers, just like anyone else.
Ajay- Asia delivers a disproportionate number of science graduates. Yet one generation ago American and European heritage scientists made the trip to the moon with very basic computers. As our lives get increasingly shaped by the Numerati, how important are geo-cultural influences in its membership?

Steve- Most of the Numerati I met in the United States were born outside the U.S. The US has long relied on foreign brains, especially for its technology industry. As the Numerati study people’s lives, the quantitative experts will increasingly need to work closely with linguists, anthropologists, and psychologists. And they’ll need to understand different global cultures and languages. In this sense, the international nature of the Numerati is an advantage.
Ajay- Do you think the shift in money and influence from lawyers and financiers to scientists and mathematicians is temporary or is it here to stay?

Steve-I think it’s here to stay. There will always be lawyers and financiers who make loads of money. But they will have quantitative experts on their teams.

Ajay- What influenced your decision to be associated with Predictive Analytics world?

Steve- I had the privilege of interviewing Eric Siegel as I was researching the book. We’ve kept in touch since then. I think he’s very bright and does excellent work.
Ajay- What does Stephen Baker do when not writing books or articles or observing the world go around him?

Steve- I like to ride bicycles, I like to travel. I love Spanish and French and baseball and music

Biography-

Stephen L. Baker is the author of The Numerati and a senior writer at BusinessWeek, covering technology. Previously he was a Paris correspondent. Baker joined BusinessWeek in March, 1987, as manager of the Mexico City bureau, where he was responsible for covering Mexico and Latin America. He was named Pittsburgh bureau manager in 1992. Before BusinessWeek, Baker was a reporter for the El Paso Herald-Post. Prior to that, he was chief economic reporter for The Daily Journal in Caracas, Venezuela. Baker holds a bachelor’s degree from the University of Wisconsin and a master’s from the Columbia University Graduate School of Journalism.

You can read more about the Numerati at http://thenumerati.net/index.cfm?catID=18 Stephen L Baker is the keynote speaker at Predictive Analytics World and you can check the details here http://www.predictiveanalyticsworld.com/register.php if you want to listen to  him at the event.

You can follow Steve on twitter at http://twitter.com/stevebaker and follow his blog here http://www.businessweek.com/the_thread/blogspotting/

Hidden Agenda- Teach the Kids

Win $25K in cold hard cash by creating a better way to teach a high school subject – on Facebook! Get creative. Blow away the competition! The best example of fun and effective “social learning” gets the juicy prize.

Here is the official link and page-

In 2003, a small nonprofit foundation ran the first ever educational video game development contest. With the help of a powerful and exciting advisory board, college students around the country worked throughout the school year to build the best games possible for middle school kids with two key goals – their games had to be FUN. And they had to teach a middle school subject. The prize was $25,000 cash – practically the foundation’s entire operating budget for the year.

The games these students built in 2003 and in the years since then were revolutionary. The winners are available for free at http://www.hagames.com, and more about the contest can be found at http://www.hiddenagenda.com. Several thousand copies of the games have also been distributed on computers donated to underprivileged middle school children in urban areas. And more is being done with them, and the games they’ve inspired, every day.

http://www.facebook.com/pages/Hidden-Agenda/114956308585

ha

As someone who studies besides , socializes and tries to teach American children ( my coursemates are a decade younger than me)- this is one cool cool way of making the world a better place.

http://www.facebook.com/pages/Hidden-Agenda/114956308585

Here comes the Cloud- VMWare

Cloud

From http://vcloudexpress.terremark.com

From the fascinating umbrella group at http://twitter.com/cloudcomp_group

If you are a Desktop software expert who thinks the Cloud is nothing to worry, remember what happened to Punching Tapes, Mainframes, Vaccuum tubes.

With the current pricing, even smaller software package creators can offer Statistics as a Service ( STaS).

Great move by VMware !

Journal of Statistical Software

Here is a good open content Journal for people wanting to keep track of latest in statistical software.

It is called Journal of Statistical Software.

Citation: http://www.jstatsoft.org/

Established in 1996, the Journal of Statistical Software publishes articles, book reviews, code snippets, and software reviews on the subject of statistical software and algorithms.  The contents are freely available on-line.  For both articles and code snippets the source code is published along with the paper.

Implementations can use languages such as C, C++, S, Fortran, Java, PHP, Python and Ruby or environments such as Mathematica, MATLAB, R, S-PLUS, SAS, Stata, and XLISP-STAT.

E.g Book Reviews of  A Handbook of Statistical Analyses Using SAS (Third Edition)

and Statistics and Data with R: An Applied Approach Through Examples

jss

It is really cutting edge stuff for someone who wants to keep up with the latest and fast moving tech trends in statistical software and has convenient RSS feeds as well announce alerts for emails.

Note- Various Journals can be ranked using a quantitative index called Impact Factor

Citation http://in-cites.com/research/2007/august_27_2007-2.html

E.G For Statistics

In these columns, total citations to a journal’s published papers are divided by the total number of papers that the journal published, producing a citations-per-paper impact score over a five-year period (middle column) and a 26-year period (right-hand column).

Journals Ranked by Impact:
Statistics & Probability

Rank

2006
Impact Factor

Impact
2002-06

Impact
1981-2006
1 Bioinformatics
(4.89)
Bioinformatics
(9.87)
Econometrica
(52.93)
2 Biostatistics
(3.01)
J. Royal Stat. Soc. B
(6.75)
J. Royal Stat. Soc. B
(27.32)
3 Chemom. Intell. Lab.
(2.45)
Biostatistics
(6.56)
J. Am. Stat. Assoc.
(25.11)
4 Econometrica
(2.40)
J. Computat. Biology
(6.49)
Biometrika
(22.75)
5 J. Royal Stat. Soc. B
(2.32)
Econometrica
(5.82)
Annals of Statistics
(21.31)
6 IEEE ACM T Comp. Bi.
(2.28)
J. Chemometrics
(5.08)
Biometrics
(20.32)
7 J. Am. Stat. Assoc.
(2.17)
J. Am. Stat. Assoc.
(4.95)
Technometrics
(17.74)
8 Multivar. Behav. Res.
(2.10)
Statistical Science
(4.19)
Multivar. Behav. Res.
(16.62)
9 J. Computat. Biology
(2.00)
Annals of Statistics
(3.94)
Bioinformatics
(16.37)
10 Annals of Statistics
(1.90)
Stat. in Medicine
(3.62)
J. Royal Stat. Soc. A
(14.46)

Statistics: Code and Poetry

Dstats

As you can see, my poem on Michael Jackson got more page views

than any code I wrote or any interview I did. which reminds me to be

more humble than be humbled :)) More Details are on Advertise page above.