Machine learning with evolving data

Traditional machine learning and data mining methods rely on the assumption that data distribution stays the same during model training and operation. In reality this is is often not the case. As the world is continuously changing, so evolve data that describe it. My research background is in machine learning methods that can diagnose themselves and adapt to changing data distribution over time. 

The main outcome of my PhD thesis research was pointing out a distinction between the point in time that change takes place, and optimal training data history for updating predictive models. I have also made contributions in forming optimization and evaluation criteria for predictive models with evolving streaming data, (inter)active learning for streaming data and adaptive data preprocessing.

Selected publications

Position papers and surveys

  • An overview of concept drift applications by Žliobaitė et al. 2016, a book chapter. DOI  PDF 
  • A Survey on Concept Drift Adaptation by Gama et al. 2014 in ACM Computing SurveysDOI  PDF
  • Open Challenges for Data Stream Mining Research by Krempl et al. 2014 in SIGKDD Explorations. DOI  PDF
  • Next challenges for adaptive learning systems by Žliobaitė et al. 2012 in SIGKDD Explorations. DOI  PDF
  • Learning under Concept Drift: an Overview by Žliobaitė 2010 in arXiv.

Evaluation methodologies

  • Evaluation methods and decision theory for classification of streaming data with temporal dependence by Žliobaitė et al. 2015 in Machine Learning. DOI  PDF
  • Controlled Permutations for Testing Adaptive Learning Models by Žliobaitė 2014 in Knowledge and Information Systems. DOI  PDF
  • How good is the Electricity benchmark for evaluating concept drift adaptation by Žliobaitė 2013 in arXiv.

Active learning

  • Active Learning with Drifting Streaming Data by Žliobaitė et el. 2014 in IEEE Transactions on Neural Networks and Learning SystemsDOI  PDF

Cost-sensitive adaptation

  • Towards cost-sensitive adaptation: when is it worth updating your predictive model? by Žliobaitė et al. 2015 in NeurocomputingDOI  PDF

Adaptive reprocessing

  • Adaptive Preprocessing for Streaming Data by Žliobaitė and Gabrys 2014 in IEEE Transactions on Knowledge and Data Engineering. DOI  PDF

Handling missing values

  • Optimizing regression models for data streams with missing values by Žliobaitė and Hollmén 2015 in Machine Learning.  PDF  DOI

Handling concept drift

  • Adaptive Training Set Formation by Žliobaitė, PhD thesis PDF
  • Combining similarity in time and space for training set formation under concept drift by Žliobaitė 2011 in Intelligent Data AnalysisDOI  PDF
  • Theoretical Window Size for Classification in the Presence of Sudden Concept Drift by Žliobaitė and Kuncheva 2010, Technical Report PDF
  • On the Window Size for Classification in Changing Environments by Kuncheva and Žliobaitė in 2009 in Intelligent Data AnalysisDOI  PDF
  • Determining the Training Window for Small Sample Size Classification with Concept Drift by Žliobaitė and Kuncheva 2009 in IEEE ICDM workshopsDOI  PDF
  • Expected Classification Error of the Euclidean Linear Classifier under Sudden Concept Drift by Žliobaitė 2008 in IEEE FSKDDOI  PDF

Change detection

  • BLPA: Bayesian Learn-Predict-Adjust Method for Online Detection of Recurrent Changepoints by Maslov et al. 2017 in IJCNNPDF 
  • Modelling Recurrent Events for Improving Online Change Detection by Maslov et al. 2016 in SIAM SDM. DOI  PDF
  • Change with Delayed Labeling: when is it detectable? by Žliobaitė 2010 in IEEE ICDM workshopsDOI  PDF