In PIG
a = LOAD 'nyse' USING org.apache.hcatalog.pig.HCatLoader();
b = FILTER a BY stock_symbol =='IBM' ;
c = group b all;
d = foreach c generate AVG(b.stock_volume);
dump d;
In SQL (Hive)
select AVG(stock_volume) from nyse where stock_symbol =="IBM"
(from HDP 2.0 Horton Sandbox Example)
Also see
http://www.quora.com/How-can-R-and-Hadoop-be-used-together