Yukino Baba, Ph.D. (Information Science and Technology)

Associate Professor
Graduate School of Arts and Sciences, University of Tokyo

News

April 13, 2019
Our paper has been accepted to EDM 2019.
  • Probabilistic Modeling of Peer Correction and Peer Assessment
  • Takeru Sunahase, Yukino Baba, Hisashi Kashima
  • 12th International Conference on Educational Data Mining (EDM), 2019
February 1, 2019
Our paper has been accepted to ICASSP 2019.
  • CrowNN: Human-in-the-loop Network with Crowd-generated Inputs
  • Yusuke Sakata, Yukino Baba, Hisashi Kashima
  • 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019
July 18, 2018
I have been invited to give a talk in the IJCAI 2018 Early Career Spotlight:

Research Interest

My current research topic is statistical quality control on crowdsourcing. I'm also broadly interested in:

  • Combining human brainpower and computer power, i.e., Human Computation and crowdsourcing [KDD2013][IAAI 2013][IJCAI 2013]
  • Extracting knowledge from users' implicit behaviors on the Web, e.g., social tagging [ECAI 2010] and spelling errors [ACL 2012]

Keywords:
Data Mining, Crowdsourcing, Human Computation, Web Mining

Professional Experience

Associate Professor, April 2018 to present
Faculty of Engineering, Information and Systems,
University of Tsukuba
Assistant Professor, September 2015 to March 2018
Machine Learning and Data Mining Research Laboratory,
Department of Intelligence Science and Technology,
Graduate School of Informatics,
Kyoto University
Program-Specific Assistant Professor, April 2015 to September 2015
Machine Learning and Data Mining Research Laboratory,
Department of Intelligence Science and Technology,
Graduate School of Informatics,
Kyoto University
Project Research Associate, April 2014 to March 2015
Global Research Center for Big Data Mathematics, National Institute of Informatics
Project: JST, ERATO, Kawarabayashi Large Graph Project
Project Researcher, June 2012 to March 2014
Information-Theoretic Machine Learning and Data Mining Group, Dept. of Mathemacical Informatics, Graduate School of Information Science and Technology, The University of Tokyo
Project: Development of the Fastest Database Engine for the Era of Very Large Database and Experiment and Evaluation of Strategic Social Services Enabled by the Database Engine, FIRST Program
Project Researcher, April 2012 to May 2012
National Institute of Informatics
Research Intern, June 2011 to September 2011
Microsoft Research, Redmond, USA
Mentor: Dr. Hisami Suzuki (Natural Language Processing Group)
Research Intern, September 2010 to February 2011
Microsoft Research Asia, Beijing, China
Mentor: Dr. Xian-Sheng HUA (Media Computing Group) and Dr. Lei Zhang (Web Search and Mining Group)
Research Intern, August 2009 to October 2009
Fujitsu Laboratories of America, Sunnyvale, USA
Mentor: Dr. Alex Gilman
Research Assistant, April 2007 to August 2010, April 2011 to June 2011, October 2011 to March 2012
National Institute of Informatics

Education

Ph.D. in Information Science and Technology, June 2012
School of Information Science and Technology, The University of Tokyo
Thesis title: Acquiring Word Denotations as Real-World Data from Social Tagging [dissertation (in Japanese)]
Supervisor: Prof. Shinichi Honiden
Ph.D. candidate, from April 2009 to March 2012
School of Information Science and Technology, The University of Tokyo
Supervisor: Prof. Shinichi Honiden
Master of Information Science and Technology, March 2009
School of Information Science and Technology, The University of Tokyo
Thesis title: Extracting Spatial Concepts Labeled by Tags in Folksonomy
Supervisor: Prof. Shinichi Honiden
Bachelor of Engineering, March 2007
Electrical Engineering, Tokyo University of Science
Thesis title: Secret Sharing Scheme Suitable for Memory and Database
Supervisor: Prof. Keiichi Iwamura

Invited Talks and Tutorials

Academic Services

Conference Organizer

  • Publicity/Social Co-chair, 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2017)
  • Program Committee, NAACL 2013 Student Research Workshop

Misc.

Skills

  • Languages: Japanese (Native), English (Advanced: TOEIC 915)
  • Programming Languages: Python, Ruby, PHP, C, C++, Java, SQL
  • Platforms: Mac OS X, Linux (Fedora)
  • Web Designing: HTML, CSS, JavaScript

Datasets

  • Spelling-Correction Data
    Collection of strings including before-and-after spelling-correction pairs in English and Japanese, derived automatically by processing keystroke logs collected through Amazon’s Mechanical Turk. See our paper for the details about how this data is generated.

Publications

Refereed journal articles

Refereed conference and workshop papers

    • HumanGAN: Generative Adversarial Network with Human-based Discriminator and its Evaluation in Speech Perception Modeling
    • Kazuki Fujii, Yuki Saito, Shinnosuke Takamichi, Yukino Baba, Hiroshi Saruwatari
    • 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020
    • Active Learning Strategies for Hierarchical Labeling Microtasks
    • Kousuke Uo, Masaki Kobayashi, Masaki Matsubara, Yukino Baba, and Atsuyuki Morishima
    • 3rd IEEE Workshop on Human-in-the-loop Methods and Human Machine Collaboration in BigData, 2019
    • Distributed Multi-task Learning for Sensor Network
    • Jiyi Li, Tomohiro Arai, Yukino Baba, Hisashi Kashima, Shotaro Miwa
    • European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2017
    • Learning to Enumerate
    • Patrick Jörger, Yukino Baba, Hisashi Kashima
    • 25th International Conference on Artificial Neural Networks (ICANN), 2016
    • Crowdordering
    • Toshiko Matsui, Yukino Baba, Toshihiro Kamishima, Hisashi Kashima
    • 18th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), 2014

Misc.

    • Synthetic Accessibility Assessment Using Auxiliary Responses
    • Shun Ito, Yukino Baba, Tetsu Isomura and Hisashi Kashima
    • 6th AAAI Conference on Human Computation & Crowdsourcing (HCOMP), Works-In-Progress, 2018
    • Making Legacy Open Data Machine Readable by Crowdsourcing
    • Satoshi Oyama, Yukino Baba, Ikki Ohmukai, Hiroaki Dokoshi, Hisashi Kashima
    • 3rd AAAI Conference on Human Computation & Crowdsourcing (HCOMP), Works-In-Progress, 2015
    • Performance Evaluation between Crowdworkers and Biocurators towards Constructing a CrowdR&D Platform
    • Eli Kaminuma, Yukino Baba, Takatomo Fujisawa, Asao Fujiyama, Hisashi Kashima and Yasukazu Nakamura
    • 25th International Conference on Genome Informatics (GIW/ISCB-Asia), Poster Track, 2014
    • Automatically Mapping Flickr Images to WordNet
    • Yukino Baba, Shinichi Honiden
    • 5th joint NII-LIP6 WorkShop on Multi-Agent and Distributed Systems, 2010
    • Extracting Locations Related to Tags on Folksonomy
    • Yukino Baba, Fuyuki Ishikawa, Shinichi Honiden
    • 4th joint NII-LIP6 WorkShop on Multi-Agent and Distributed Systems, 2009
    • Extracting and Utilizing Event-Context Relationships in Blogsphere
    • Yukino Baba, Fuyuki Ishikawa, Shinichi Honiden
    • 6th International Semantic Web Conference and the 2nd Asian Semantic Web Conference (ISWC+ASWC), 2007 (Poster/Demo Track)
    • [poster]