/ プログラム/ 発表一覧/ 著者一覧/ 企業展示一覧/ jsai2010ホーム /

1D1-3 CGMからの自己教師あり学習と条件付確率場を用いた人間行動マイニング

06月09日(Wed) 09:00〜10:20 D会場(3F-会議室4)
1D1 Webマイニング「知識獲得」

演題番号	1D1-3
題目	CGMからの自己教師あり学習と条件付確率場を用いた人間行動マイニング
著者	Nguyen MinhThe(電気通信大学大学院情報システム学研究科) 川村隆浩(電気通信大学大学院情報システム学研究科) 中川博之(電気通信大学大学院情報システム学研究科) 田原康之(電気通信大学大学院情報システム学研究科) 大須賀昭彦(電気通信大学大学院情報システム学研究科)
時間	06月09日(Wed) 09:40〜10:00
概要	In our definition, human activity can be expressed by five basic attributes: actor, action, object, time and location. The goal of this paper is to describe a method to automatically extract all of the basic attributes and the relationships (transition and cause) between activities derived from sentences in Japanese CGM (consumer generated media). Previous work had some limitations, such as high setup cost, inability of extracting all attributes, limitation on the types of sentences that can be handled, and insufficient consideration of interdependency among attributes. To resolve these problems, this paper proposes a novel approach that uses conditional random fields and self-supervised learning. This approach treats the activity extraction as a sequence labeling problem, and has advantages such as domain-independence, scalability, and unnecessary hand-tagged data. Since it is unnecessary to fix the positions and the number of the attributes in activity sentences, this approach can extract all attributes and relationships between activities by making only a single pass over its corpus. Additionally, by converting to simpler sentences, the approach can deal with complex sentences retrieved from Japanese CGM. In an experiment, this approach achieves high precision (activity: 88.84%, attributes: over 90%, relationships: over 84%).
論文	PDFファイル