Yukino Baba, Ph.D. (Information Science and Technology)

Assistant Professor
Machine Learning and Data Mining Research Laboratory,
Department of Intelligence Science and Technology,
Graduate School of Informatics, Kyoto University

News

August 6, 2017
Our paper on Hyper Questions: Unsupervised Targeting of a Few Experts in Crowdsourcing has been accepted to CIKM 2017.
  • Hyper Questions: Unsupervised Targeting of a Few Experts in Crowdsourcing
  • Jiyi Li, Yukino Baba, Hisashi Kashima
  • 26th ACM International Conference on Information and Knowledge Management (CIKM), 2017
November 16, 2016
Our paper on Pairwise HITS: Quality Estimation from Pairwise Comparisons in Creator-Evaluator Crowdsourcing Process has been accepted to AAAI 2017.
November 16, 2016
Our paper on Predicting Fuel Consumption and Flight Delays for Low-cost Airlines has been accepted to IAAI 2017.

Research Interest

My current research topic is statistical quality control on crowdsourcing. I'm also broadly interested in:

  • Combining human brainpower and computer power, i.e., Human Computation and crowdsourcing [KDD2013][IAAI 2013][IJCAI 2013]
  • Extracting knowledge from users' implicit behaviors on the Web, e.g., social tagging [ECAI 2010] and spelling errors [ACL 2012]

Keywords:
Data Mining, Crowdsourcing, Human Computation, Web Mining

Professional Experience

Assistant Professor, September 2015 to present
Machine Learning and Data Mining Research Laboratory,
Department of Intelligence Science and Technology,
Graduate School of Informatics,
Kyoto University
Program-Specific Assistant Professor, April 2015 to September 2015
Machine Learning and Data Mining Research Laboratory,
Department of Intelligence Science and Technology,
Graduate School of Informatics,
Kyoto University
Project Research Associate, April 2014 to March 2015
Global Research Center for Big Data Mathematics, National Institute of Informatics
Project: JST, ERATO, Kawarabayashi Large Graph Project
Project Researcher, June 2012 to March 2014
Information-Theoretic Machine Learning and Data Mining Group, Dept. of Mathemacical Informatics, Graduate School of Information Science and Technology, The University of Tokyo
Project: Development of the Fastest Database Engine for the Era of Very Large Database and Experiment and Evaluation of Strategic Social Services Enabled by the Database Engine, FIRST Program
Project Researcher, April 2012 to May 2012
National Institute of Informatics
Research Intern, June 2011 to September 2011
Microsoft Research, Redmond, USA
Mentor: Dr. Hisami Suzuki (Natural Language Processing Group)
Research Intern, September 2010 to February 2011
Microsoft Research Asia, Beijing, China
Mentor: Dr. Xian-Sheng HUA (Media Computing Group) and Dr. Lei Zhang (Web Search and Mining Group)
Research Intern, August 2009 to October 2009
Fujitsu Laboratories of America, Sunnyvale, USA
Mentor: Dr. Alex Gilman
Research Assistant, April 2007 to August 2010, April 2011 to June 2011, October 2011 to March 2012
National Institute of Informatics

Education

Ph.D. in Information Science and Technology, June 2012
School of Information Science and Technology, The University of Tokyo
Thesis title: Acquiring Word Denotations as Real-World Data from Social Tagging [dissertation (in Japanese)]
Supervisor: Prof. Shinichi Honiden
Ph.D. candidate, from April 2009 to March 2012
School of Information Science and Technology, The University of Tokyo
Supervisor: Prof. Shinichi Honiden
Master of Information Science and Technology, March 2009
School of Information Science and Technology, The University of Tokyo
Thesis title: Extracting Spatial Concepts Labeled by Tags in Folksonomy
Supervisor: Prof. Shinichi Honiden
Bachelor of Engineering, March 2007
Electrical Engineering, Tokyo University of Science
Thesis title: Secret Sharing Scheme Suitable for Memory and Database
Supervisor: Prof. Keiichi Iwamura

Tutorials

Academic Services

Conference Organizer

  • Publicity/Social Co-chair, 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017)
  • Program Committee, NAACL 2013 Student Research Workshop

Misc.

Skills

  • Languages: Japanese (Native), English (Advanced: TOEIC 915)
  • Programming Languages: Python, Ruby, PHP, C, C++, Java, SQL
  • Platforms: Mac OS X, Linux (Fedora)
  • Web Designing: HTML, CSS, JavaScript

Datasets

  • Spelling-Correction Data
    Collection of strings including before-and-after spelling-correction pairs in English and Japanese, derived automatically by processing keystroke logs collected through Amazon’s Mechanical Turk. See our paper for the details about how this data is generated.

Publications

Refereed journal articles

Refereed conference papers

    • Hyper Questions: Unsupervised Targeting of a Few Experts in Crowdsourcing
    • Jiyi Li, Yukino Baba, Hisashi Kashima
    • 26th ACM International Conference on Information and Knowledge Management (CIKM), 2017
    • Atomic Distance Kernel for Material Property Prediction
    • Hirotaka Akita, Yukino Baba, Hisashi Kashima and Atsuto Seko
    • 24th International Conference on Neural Information Processing (ICONIP), 2017
    • Quality Control for Crowdsourced Multi-Label Classification using RAkEL
    • Kosuke Yoshimura, Yukino Baba and Hisashi Kashima
    • 24th International Conference on Neural Information Processing (ICONIP), 2017
    • Distributed Multi-task Learning for Sensor Network
    • Jiyi Li, Tomohiro Arai, Yukino Baba, Hisashi Kashima, Shotaro Miwa
    • European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2017
    • Learning to Enumerate
    • Patrick Jörger, Yukino Baba, Hisashi Kashima
    • 25th International Conference on Artificial Neural Networks (ICANN), 2016
    • Crowdordering
    • Toshiko Matsui, Yukino Baba, Toshihiro Kamishima, Hisashi Kashima
    • 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2014

Refereed workshop papers

Misc.

    • Crowdsourcing Data Understanding: A Case Study using Open Government Data
    • Yukino Baba, Hisashi Kashima
    • 3rd AAAI Conference on Human Computation & Crowdsourcing (HCOMP), Works-In-Progress, 2015
    • [poster]
    • Making Legacy Open Data Machine Readable by Crowdsourcing
    • Satoshi Oyama, Yukino Baba, Ikki Ohmukai, Hiroaki Dokoshi, Hisashi Kashima
    • 3rd AAAI Conference on Human Computation & Crowdsourcing (HCOMP), Works-In-Progress, 2015
    • Performance Evaluation between Crowdworkers and Biocurators towards Constructing a CrowdR&D Platform
    • Eli Kaminuma, Yukino Baba, Takatomo Fujisawa, Asao Fujiyama, Hisashi Kashima and Yasukazu Nakamura
    • 25th International Conference on Genome Informatics (GIW/ISCB-Asia), Poster Track, 2014
    • Automatically Mapping Flickr Images to WordNet
    • Yukino Baba, Shinichi Honiden
    • 5th joint NII-LIP6 WorkShop on Multi-Agent and Distributed Systems, 2010
    • Extracting Locations Related to Tags on Folksonomy
    • Yukino Baba, Fuyuki Ishikawa, Shinichi Honiden
    • 4th joint NII-LIP6 WorkShop on Multi-Agent and Distributed Systems, 2009
    • Extracting and Utilizing Event-Context Relationships in Blogsphere
    • Yukino Baba, Fuyuki Ishikawa, Shinichi Honiden
    • 6th International Semantic Web Conference and the 2nd Asian Semantic Web Conference (ISWC+ASWC), 2007 (Poster/Demo Track)
    • [poster]