Chan Hee (Luke) Song송찬희

I am a senior computer science student at the University of Notre Dame graduating in May 2020. From Seoul, Republic of Korea, I've been blessed to live in the beautiful Notre Dame campus and gain experience as a computer science researcher. I am also part of Notre Dame Natural Language Processing Group.

Email  /  Github  /  LinkedIn

News

  • 10/2019: Chinese Glyph NER paper accepted to AAAI-20, Student Abstract Track!

Research

I am interested in machine learning, especially Natural Language Processing. I have experience in machine translation and word embeddings, and looking to explore problems in NLP before committing myself to a specific field. This summer I have been at the HLTCOE working on improving neural Named Entity Recognition systems.

prl

Using Chinese Glyphs for Named Entity Recognition
Chan Hee Song, Arijit Sehanobish
09/2019
paper, to appear in AAAI-20 Student Abstract Track

Used Chinese glyph information to augment neural named entity system. Introduced a convolutional neural network (CNN) architecture that achieves a state-of-art performance on Chinese Weibo NER dataset and sets a new baseline F1 score on Chinese OntoNotes v5.0 dataset. Easy to implement, train and uses less data. Robust to non-Chinese characters in the dataset.

prl

Gazetteer Generation for Neural Named Entity Recognition
Chan Hee Song, Dawn Lawrie, Tim Finin, Jim Mayfield
09/2019

Introduced a simple method to generate gazetteers from the Web. Generated gazetteer is used as a feature to augment neural named entity recognition system, showing improvement over baseline. Developed a method to create new training data using gazetteer entity replacement. Presented new Russian NER corpus gathered from Reddit. Under review FLAIRS-33.

prl

Sentence Representation for Neural Machine Translation Systems
Chan Hee Song, David Chiang
06/2018 -> 05/2019

Independent undergraduate researcher under supervision of professor David Chiang. I investigated the impact of sentence representation on neural machine translation to develop an optimal encoding mechanism that has a better trade-off between time and accuracy. I am using C, Python and PyTorch library to build encoder/decoder, modify neural machine translation system, and implement new equations.

prl

Verba Volant, Scripta Manent: Automatic Transcription of Medieval Latin Manuscripts
Undergraduate Research Assistant
01/2018 -> 05/2018
poster

Research assistant to the project which aims to find a novel method to process Vatican Secret Archives to transfer early Medieval Latin handwritten papers into a computer readable format. I analyzed various character recognition algorithm and used Python to build an auto analyzer tool for Optical Character Recognition systems

Work Experience
prl

Human Language Technology Center For Excellence, Johns Hopkins University
Visiting NLP/ML Researcher
05/2019 -> 08/2019

I implemented various state-of-the-art neural architectures and adapted to named entity recognition task. Introduced a lexical feature generated from gazetteers to the named entity recognition (NER) system. As an independent project, developed a novel named entity recognition system achieving state-of-the-art F1 score on Chinese datasets using glyph features generated from convolutional neural network.

prl

Kyndi
Software Engineering Intern
01/2019 -> 05/2019

I migrated in-memory graph to external database by building a GraphQL server to interact with the external database using Apollo and Typescript. Also improved accuracy and representation of the knowledge graph by writing unique graph operations on a large knowledge graph.

Teaching
ND

Undergraduate Teaching Assistant, Fundamentals of Computing Fall 2018


Updated 11/2019
(borrowed from here)