I got featured as a part of series on A Day In The Life Of A Data Scientist at https://www.kdnuggets.com/2017/11/day-life-data-scientist.html
My typical day begins at 9 AM with a scrum call. Our methodology of project working is to divide tasks into two week goals or sprints. This is basically the agile development method for software and it is different from CRISP-DM or KDD methodologies.
What do I do on a daily basis? It could be many things – including not just emails and meetings. I could be using Hive to pull data, using it to merge data (or using Impala), I could be using PySpark (Mllib) to make churn models or do k means clustering. I could be pulling data in an excel file to make summaries and I could be making data visualizations. Some days I prototype in R using some machine learning packages. When I code Big Data, I could be using the GUI for Hadoop HUE or I could be using command line programming including batch submitting of code.