- June, 2021: Our paper "Transfer learning for topic labeling: Analysis of the UK House of Commons speeches 1935-2014" (with Hannah Béchara, Slava Jankin Mikhaylov, and Peter John) is now published in Research & Politics.
- April, 2021: Our paper "Transfer learning for topic labeling: Analysis of the UK House of Commons speeches 1935-2014" (with Hannah Béchara, Slava Jankin Mikhaylov, and Peter John) has been accepted for publication in Research & Politics.
- Oct. 16: Our paper "Are abstracts enough for hypothesis generation?" (with Justin Sybrandt, Angelo Carrabba and Ilya Safro) has been accepted to the 2018 IEEE International Conference on Big Data.
- Sept.: Our book chapter "'More than Words': a quantitative text analysis of the treaty debates" (with Liam Weeks, Mícheál Ó Fathartaigh and Slava Jankin Mikhaylov) is now available as part of the book "The Treaty: Debating and Establishing the Irish State", edited by Liam Weeks and Mícheál Ó Fathartaigh.
- Sept.: I will be serving on the program committee for the 3rd Workshop on Open Science in Big Data (OSBD), held in conjunction with the 2018 IEEE International Conference on Big Data.
- July 27: Our paper "Signals to their parliaments? Governments' use of votes and policy statements in the EU Council" (with Stefanie Bailer and Sara Hagemann) has been accepted for publication in the Journal of Common Market Studies (JCMS).
- July 14: Our paper "Deep voting: measuring ideology and predicting votes from bill texts using neural networks" (with Nick Beauchamp) has been accepted for presentation at the 9th Annual Conference on New Directions in Analyzing Text as Data (TADA 2018).
- Apr. 19: I'm happy to announce that I have received a "National Scholars Program Award of Distinction", for which I was nominated by my research student Grace Glenn. From Clemson's National Scholars Program website: "This recognition goes to individuals for their tireless commitment to the intellectual, professional and personal development of each graduating Scholar." Pictures here.
- Apr. 13: A first version of our new working paper "Are abstracts enough for hypothesis generation?" (with Justin Sybrandt, Angelo Carrabba and Ilya Safro) is now available on arXiv.
- Jan. 29: Our paper "Addressing the challenges of executing a massive computational cluster in the cloud" (with Brandon Posey, Christopher Gropp, Boyd Wilson, Boyd McGeachie, Sanjay Padhi and Amy Apon) has been accepted to the 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid).
- Nov. 29: Our record for producing the largest high-performance cluster in the cloud (1,100,000 vCPUs!) was mentioned during the AWS re:Invent 2017 Keynote. Link to YouTube video.
- Oct. 15: A new version of our working paper "Scalable dynamic topic modeling with clustered latent Dirichlet allocation (CLDA)" is available on arXiv.
- Oct. 13-14: I'm at the amazing New Directions in Analyzing Text as Data (Text As Data 2017) conference at Princeton University.
- Oct. 9: Two of our papers were accepted to the 2017 IEEE International Conference on Big Data: "Representativeness of latent Dirichlet allocation topics estimated from data samples with application to Common Crawl" and "Detecting and summarizing emergent events in microblogs and social media streams by dynamic centralities".
- Oct. 8: Our paper "Automated cluster provisioning and workflow management for parallel scientific applications in the cloud" has been accepted to the 10th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS), which is held in conjunction with the 2017 International Conference for High Performance Computing, Networking, Storage and Analysis.
- Sept. 28: Our work on provisioning high performance computing clusters on Amazon Web Services (AWS) is featured on the AWS Blog.
- Sept. 12: Our paper "Database of Parliamentary Speeches in Ireland, 1919-2013", which introduces one of the largest repositories of legislative speeches for quantitative text analysis, has been accepted to the 1st IEEE International Conference on the Frontiers and Advances in Data Science. More information about the data is available under Data on this website.
- Sept.: I will be serving on the program committee for the 2nd Southern Data Science Conference (SDSC) in Atlanta.
- Aug.: I will be serving on the program committee for the 2017 Open Science in Big Data (OSBD) workshop, held in conjunction with the 2017 IEEE International Conference on Big Data.