Machine learning with fossil data: analyzing environmental and climate change

A tutorial to be presented at ECMLPKDD'17, Skopje on September 22, 2017


The tutorial is scheduled for Friday afternoon. It's a good opportunity to hear about quite an exotic area of data science, welcome! 


Tentative schedule (Friday)

14:00 - 14:40 Analysis of the fossil record: research questions, challenges and trends
14:40 - 14:50 Short break
14:50 - 15:40 Overview of machine learning tasks and existing approaches
15:40 - 16:00 Afternoon coffee break
16:00 - 16:40 Fossil data and fossil databases
16:40 - 16:50 Short break
16:50 - 17:40 Modeling relationships between organisms and their environments 


Slides (last update 2017 09 22):  part Ia   part Ib   part II   part III   bibliography


Presenter: Indrė Žliobaitė

Abstract: Global fossil databases have been growing rapidly in the last decade. They aggregate and accumulate findings and knowledge that palaeobiologists acquired over many years. These datasets are big data in their essence - compiled from different sources, to an extent subjective, include specific biases and uncertainties, data sparseness and quality varies over time and space. In addition, to understand relations between organisms and climate high volume and large velocity satellite observations some into play that require scalability in computing. Databases of this kind offer an excellent ground for interdisciplinary machine learning research. This tutorial will outline research questions that could be addressed using computational methods, discuss characteristics of fossil data and computational tasks for machine learning and data mining, overview existing computational approaches, and discuss what more could be done from the machine learning and data mining perspective. 


Job opening in Helsinki

I'm looking for a PhD student in machine learning for evolving data in biogeosciencesApply here by 31 October, 2017.