Supported browsers: Edge, Chrome, Safari, Firefox
UniDicのロゴ コーパス開発センターのロゴ
Usage Notes

Whether you are CC-BY-NC-SA or free of licence , if you wish to use the dictionaries for profit, contact/consult the following contact points.

In addition, when publicising the results of a study conducted using the following dictionary for analysis, clearly state this. Refer to the literature in the references as needed. It will be used to calculate the usage of UniDic.

For licensing reasons, dictionaries for analysis prior to Ver. 2.x and 1603 are not subject to distribution and support at the Corpus Development Centre and this site. If you have already downloaded and you have any questions, please contact the person in charge who is listed in the licence file that is packed together with the dictionary. We apologise for any inconvenience caused and thank you for your understanding.

現代語用UniDicS (UniDic for Contemporary Japanese)
  1. 現代書き言葉UniDic (UniDic fo Contemporary Written Japanese)
  2. 現代話し言葉UniDic (UniDic for Spoken Japanese
古文用UniDicS (UniDic for Historical Japanese)
  1. 旧仮名口語UniDic (UniDic for Old Kana Colloquial Japanese)
  2. 近代文語UniDic (UniDic for Modern Literary Japanese)
  3. 近世口語(洒落本)UniDic (UniDic for Edo Period Colloquial Japanese)
  4. 中世口語(狂言)UniDic (UniDic for Muromachi Period Colloquial Japanese)
  5. 中世文語(説話・随筆)UniDic (UniDic for Kamakura Period Literary Japanese)
  6. 中古和文UniDic (UniDic for Heian Period Japanese)
  7. 上代(万葉集)UniDic (UniDic for Nara Period Japanese)
現代書き言葉UniDic (UniDic for Contemporary Written Japanese)
File nameRelease dateLicenceNote
unidic-cwj-3.1.0.zip 2021-04-01 GPL v2.0/LGPL v2.1/New BSD Download Lexicon size (UTF-8)
without matrix.def and model.def (530MB)
unidic-cwj-3.1.0-full.zip 2021-04-01 GPL v2.0/LGPL v2.1/New BSD Download Lexicon size (UTF-8) with matrix.def and model.def for training (1.6GB)
unidic-cwj-2.3.0.zip 2018-04-10 GPL v2.0/LGPL v2.1/New BSD Download 2.2GB
unidic-cwj-2.3.0_beta.zip 2018-03-29 GPL v2.0/LGPL v2.1/New BSD beta version
unidic-cwj-2.2.0.zip 2017-09-05 GPL v2.0/LGPL v2.1/New BSD Download
参考文献 (References)
  • 岡 照晃: 「CRF素性テンプレートの見直しによるモデルサイズを軽量化した解析用UniDic ― unidic-cwj-2.2.0 と unidic-csj-2.2.0 ― 」, 言語資源活用ワークショップ2017発表予稿集, pp.143-152 (2017).
References
謝辞 (Acknowledgements)

This work was supported by the NINJAL Project "Basic Research on Corpus Annotation - Extension, Integration and Machine-aided Approaches" (FY2016-2021).



File nameRelease dateLicenceNote
unidic-mecab-2.1.2_bin.zip 2013-03-14 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab-2.1.2_src.zip 2013-03-14 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab-2.1.2_model.zip 2013-03-14 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab_kana-accent-2.1.2_src.zip 2013-03-14 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab-211_bin.zip 2012-12-13 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab-211_windows.zip 2012-12-13 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab-211_src.zip 2012-12-13 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
unidic-mecab-211_model.zip 2012-12-13 GPL v2.0/LGPL v2.1/New BSD Download OSDN version
File nameRelease dateLicenceNote
UniDic-gendai_1603.zip 2016-03 GPL v2.0/LGPL v2.1/New BSD Download Web Chamame version (2016/03)
参考文献 (References)
References
現代話し言葉UniDic (UniDic for Contemporary Spoken Japanese)
File nameRelease dateLicenceNote
unidic-csj-3.1.0.zip 2021-04-01 GPL v2.0/LGPL v2.1/New BSD Download Lexicon size (UTF-8)
without matrix.def and model.def (530MB)
unidic-csj-3.1.0-full.zip 2021-04-01 GPL v2.0/LGPL v2.1/New BSD Download Lexicon size (UTF-8)
with matrix.def and model.def for training(1.7GB)
unidic-csj-3.0.1.1.zip 2020-02-21 GPL v2.0/LGPL v2.1/New BSD Download Lexicon size (UTF-8)
(1.5GB)
diff from 3.0.1: the file size of matrix.def (4.3GB -> 3.6GB)
unidic-csj-3.0.1.zip 2019-12-17 GPL v2.0/LGPL v2.1/New BSD Download Lexicon size (UTF-8)
(1.6GB)
参考文献 (References)
  • 岡 照晃: 「言語研究のための電子化辞書」, コーパスと辞書, 講座 日本語コーパス 7, pp.1-28, 朝倉書店 (2019).
References
  • Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hideki Ogura. A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation, In Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), pp.1019-1024 (2008).
謝辞 (Acknowledgements)

This work was supported by the NINJAL Project "Basic Research on Corpus Annotation - Extension, Integration and Machine-aided Approaches" (FY2016-2021).



File nameRelease dateLicenceNote
unidic-csj-2.3.0.zip 2018-04-10 GPL v2.0/LGPL v2.1/New BSD Download (2.2GB)
unidic-csj-2.3.0_beta.zip 2018-03-29 GPL v2.0/LGPL v2.1/New BSD beta version
unidic-csj-2.2.0.zip 2017-09-05 GPL v2.0/LGPL v2.1/New BSD Download
参考文献 (References)
  • 岡 照晃: 「CRF素性テンプレートの見直しによるモデルサイズを軽量化した解析用UniDic ― unidic-cwj-2.2.0 と unidic-csj-2.2.0 ― 」, 言語資源活用ワークショップ2017発表予稿集, pp.143-152 (2017).
References
  • Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hideki Ogura. A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation, In Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), pp.1019-1024 (2008).
謝辞 (Acknowledgements)

This work was supported by the NINJAL Project "Basic Research on Corpus Annotation - Extension, Integration and Machine-aided Approaches" (FY2016-2021).



File nameRelease dateLicenceNote
UniDic-spoken_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
参考文献
References
  • Yasuharu Den, Junpei Nakamura, Toshinobu Ogiso, Hideki Ogura. A Proper Approach to Japanese Morphological Analysis: Dictionary, Model, and Evaluation, In Proceedings of the sixth international conference on Language Resources and Evaluation (LREC 2008), pp.1019-1024 (2008).
旧仮名口語UniDic (UniDic for Old Kana Colloquial Japanese)
File nameRelease dateLicenceNote
UniDic-qkana_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame Version (2016/03)
参考文献 (References)
  • 小木曽智信: 「旧仮名遣いの口語文を対象とした形態素解析辞書」, じんもんこん2012論文集, pp.25-32 (2012).
References
  • Toshinobu Ogiso, Mamoru Komachi and Yuji Matsumoto. Morphological Analysis of Historical Japanese Text, Journal of Natural Language Processing, Vol.20, No.5, pp.727-748 (2013). [in Japanese]
  • Tomoaki Kouno and Toshinobu Ogiso. Improving an Electronic Dictionary for Morphological Analysis of Japanese: Use of historical period information, In Proceedings of The 9th International Conference of ASIALEX (ASIALEX2015) (2015). [can not read online]
近代文語UniDic (UniDic for Modern Literary Japanese)
File nameRelease dateLicenceNote
UniDic-kindai_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
unidic-MLJ_14.zip 2014-03-31 not available Old version (2014/03/31)
参考文献 (References)
  • 小木曽 智信, 小町 守, 松本 裕治: 「歴史的日本語資料を対象とした形態素解析」, 自然言語処理, Vol.20, No.5, pp.727-748 (2013).
References
  • Toshinobu Ogiso, Mamoru Komachi and Yuji Matsumoto. Morphological Analysis of Historical Japanese Text, Journal of Natural Language Processing, Vol.20, No.5, pp.727-748 (2013). [in Japanese]
  • Tomoaki Kouno and Toshinobu Ogiso. Improving an Electronic Dictionary for Morphological Analysis of Japanese: Use of historical period information, In Proceedings of The 9th International Conference of ASIALEX (ASIALEX2015) (2015). [can not read online]
近世口語(洒落本)UniDic (UniDic for Edo Period Colloquial Japanese)
File nameRelease dateLicenceNote
UniDic-kinsei_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
参考文献 (References)
  • 小木曽 智信, 市村 太郎, 鴻野知暁: 「近世口語資料の形態素解析の試み」, 第4回コーパス日本語学ワークショップ予稿集, pp.145-150 (2013).
References
  • Toshinobu Ogiso, Mamoru Komachi and Yuji Matsumoto. Morphological Analysis of Historical Japanese Text, Journal of Natural Language Processing, Vol.20, No.5, pp.727-748 (2013). [in Japanese]
  • Tomoaki Kouno and Toshinobu Ogiso. Improving an Electronic Dictionary for Morphological Analysis of Japanese: Use of historical period information, In Proceedings of The 9th International Conference of ASIALEX (ASIALEX2015) (2015). [can not read online]
中世口語(狂言)UniDic (UniDic for Muromachi Period Colloquial Japanese)
File nameRelease dateLicenceNote
UniDic-kyogen_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
参考文献 (References)
  • 小木曽 智信, 鴻野 知暁, 市村 太郎: 「狂言台本の形態素解析」, 日本語学会2015年度春季大会 (2015). [can not read online]
References
  • Toshinobu Ogiso, Mamoru Komachi and Yuji Matsumoto. Morphological Analysis of Historical Japanese Text, Journal of Natural Language Processing, Vol.20, No.5, pp.727-748 (2013). [in Japanese]
  • Tomoaki Kouno and Toshinobu Ogiso. Improving an Electronic Dictionary for Morphological Analysis of Japanese: Use of historical period information, In Proceedings of The 9th International Conference of ASIALEX (ASIALEX2015) (2015). [can not read online]
中世文語(説話・随筆)UniDic (UniDic for Kamakura Period Literary Japanese)
File nameRelease dateLicenceNote
UniDic-wakan_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
参考文献 (References)
  • 小木曽 智信, 小町 守, 松本 裕治: 「歴史的日本語資料を対象とした形態素解析」, 自然言語処理, Vol.20, No.5, pp.727-748 (2013).
References
  • Toshinobu Ogiso, Mamoru Komachi and Yuji Matsumoto. Morphological Analysis of Historical Japanese Text, Journal of Natural Language Processing, Vol.20, No.5, pp.727-748 (2013). [in Japanese]
  • Tomoaki Kouno and Toshinobu Ogiso. Improving an Electronic Dictionary for Morphological Analysis of Japanese: Use of historical period information, In Proceedings of The 9th International Conference of ASIALEX (ASIALEX2015) (2015). [can not read online]
中古和文UniDic (UniDic for Heian Period Japanese)
File nameRelease dateLicenceNote
UniDic-wabun_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
unidic-EMJ_14.zip 2014-03-31 not available Old version (2014/03/31)
参考文献 (References)
  • 小木曽 智信, 小椋 秀樹, 田中 牧郎, 近藤 明日子, 伝 康晴: 「中古和文を対象とした形態素解析辞書の開発」, 情報処理学会研究報告 人文科学とコンピュータ, Vol.2010-CH-85, No.4, pp.1-8 (2010).
  • 小木曽智信: 「中古仮名文学作品の形態素解析」, 日本語の研究, Vol.9, No.4, pp.49-6 (2013).
  • 小木曽 智信, 小町 守, 松本 裕治: 「歴史的日本語資料を対象とした形態素解析」, 自然言語処理, Vol.20, No.5, pp.727-748 (2013).
References
  • Toshinobu Ogiso, Mamoru Komachi, Yasuharu Den and Yuji Matsumoto. UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese, In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp.911-915 (2012).
上代(万葉集)UniDic (UniDic for Nara Period Japanese)
File nameRelease dateLicenceNote
UniDic-manyo_1603.zip 2016-03 クリエイティブ・コモンズ・ライセンス Download Web Chamame version (2016/03)
参考文献 (References)
  • 小木曽 智信, 小町 守, 松本 裕治: 「歴史的日本語資料を対象とした形態素解析」, 自然言語処理, Vol.20, No.5, pp.727-748 (2013).
References
  • Toshinobu Ogiso, Mamoru Komachi and Yuji Matsumoto. Morphological Analysis of Historical Japanese Text, Journal of Natural Language Processing, Vol.20, No.5, pp.727-748 (2013). [in Japanese]
  • Tomoaki Kouno and Toshinobu Ogiso. Improving an Electronic Dictionary for Morphological Analysis of Japanese: Use of historical period information, In Proceedings of The 9th International Conference of ASIALEX (ASIALEX2015) (2015). [can not read online]