I like learning and using new technologies and opensource softwares. For instance I use Apache Camel and MongoDB for downloading and analyzing twitter data.  Here you can download a sample file with (some attributes of) 1 million of twitter users