.. Copyright (C) 2001-2012 NLTK Project .. For license information, see LICENSE.TXT ============================ Japanese Language Processing ============================ >>> from nltk import * ------------- Corpus Access ------------- KNB Corpus ---------- Currently, the interface returns objects of the wrong type. >>> from nltk.corpus import knbc Access the words: this should produce a list of strings: >>> type(knbc.words()[0]) Access the sentences: this should produce a list of lists of strings: >>> type(knbc.sents()[0][0]) Access the tagged words: this should produce a list of word, tag pairs: >>> type(knbc.tagged_words()[0]) Access the tagged sentences: this should produce a list of lists of word, tag pairs: >>> type(knbc.tagged_sents()[0][0]) JEITA Corpus ------------ >>> from nltk.corpus import jeita Access the tagged words: this should produce a list of word, tag pairs, where a tag is a string: >>> type(jeita.tagged_words()[0][1])