Index
A
B
C
- cardinal
- Cardinal number (CD) / Tagging with regular expressions
- CART
- categorized chunk corpus read
- categorized chunk corpus reader
- CategorizedChunkedCorpusReader class / How it works...
- categorized Conll chunk corpus reader
- categorized corpora
- CategorizedCorpusReader class
- CategorizedPlaintextCorpusReader
- CategorizedPlaintextCorpusReader class
- categorized tagged corpus reader
- categorized text corpus
- category file
- cess_cat corpora / The cess_esp and cess_cat treebank
- cess_esp corpora / The cess_esp and cess_cat treebank
- channel
- character encoding
- charade
- chart parser
- ChinkRule class / How it works...
- chinks
- chi_sq() function / There's more...
- choose_tag() method / How it works...
- Chrome / The Scrapy shell
- chunk
- chunked corpus
- ChunkedCorpusReader class / How to do it..., How it works...
- chunked phrase corpus
- chunker
- chunk extraction
- chunking
- chunk patterns
- ChunkRule class / How it works...
- ChunkRule pattern / How to do it...
- chunk rules
- chunks
- about / Introduction
- merging, with regular expressions / Merging and splitting chunks with regular expressions, How to do it..., How it works...
- splitting, with regular expressions / Merging and splitting chunks with regular expressions, How to do it..., How it works...
- rule descriptions, specifying / Specifying rule descriptions
- expanding, with regular expressions / Expanding and removing chunks with regular expressions, How it works..., There's more...
- removing, with regular expressions / Expanding and removing chunks with regular expressions, How it works..., There's more...
- ChunkScore metrics
- ChunkString / How it works...
- chunk transformations
- chunk transforms
- chunk tree
- chunk types
- chunk_tree_to_sent() function / How it works...
- class-imbalance problem / There's more...
- classification
- classification-based chunking
- classification probability
- classifier-based tagging
- ClassifierBasedPOSTagger class / Classifier-based tagging
- ClassifierBasedTagger class / Classifier-based tagging, How it works...
- ClassifierChunker class
- classifiers
- classify() method / How to do it...
- Cleaner class
- clean_html() function / How to do it...
- clear() method / How it works..., How it works...
- collocations
- complex matrix operations, NumPy
- concatenated corpus view
- conditional exponential classifier
- conditional frequency distribution
- Conditional Random Field (CRF)
- CoNLL
- CoNLL2000
- CoNLL2000 corpus
- Context Free Grammar (CFG) rules
- ContextTagger
- convert() function / How it works...
- convert_tree_labels() function / How to do it..., How it works...
- corpora / Creating POS-tagged corpora
- corpus / Creating POS-tagged corpora
- CorpusReader class / How to do it...
- corpus view
- corpus views / There's more...
- correct_verbs() function / How to do it..., How it works...
- cross-fold validation
- css() method
- CSV synonym replacement
- CsvWordReplacer class
- custom corpus
- custom corpus view
- custom feature detector
- CustomSpellingReplacer class / Personal word lists
- CYK chart parsing algorithm
D
E
- Earley chart parsing algorithm
- eigenvalues
- eigenvectors
- ELE (Expected Likelihood Estimate)
- ELEProbDist parameter
- Enchant
- enchant.list_languages() method / There's more...
- English words corpus / English words corpus
- entropy
- entropy_cutoff value
- en_GB dictionary
- error identification
- estimator
- evaluate() method / Evaluating accuracy
- execnet
- distributed tagging, used with / Distributed tagging with execnet, How to do it..., How it works..., Creating multiple channels, Local versus remote gateways
- about / Distributed tagging with execnet
- URL / Getting ready
- distributed chunking, used with / Distributed chunking with execnet, How to do it..., How it works..., There's more...
- parallel list processing, used with / Parallel list processing with execnet, How it works..., There's more...
- distributed word scoring, used with / Distributed word scoring with Redis and execnet, How to do it..., How it works..., See also
- ExpandLeftRule / How to do it...
- ExpandRightRule / How to do it...
- exploratory data analysis (EDA)
- extract() method
F
G
- gateway
- gateways, API documentation
- gensim
- geomap
- geo visualization
- Gibbs sampling
- GIS (General Iterative Scaling) / How it works...
- gis algorithm / How to do it...
- Good Turing
- Google news
H
I
J
K
L
M
- machine learning
- machine learning algorithm
- machine learning based extraction
- machine learning based tagger
- machine translation
- MapReduce
- Markov Chain Monte Carlo (MCMC) / Applying metropolis hastings in modeling languages
- masi distance / How to do it...
- matplotlib
- MaxentClassifier class
- maximum entropy (MaxEnt)
- maximum entropy classifier
- Maximum Entropy Classifier (MEC)
- max_iter variable / How it works...
- megam algorithm
- MergeRule class
- Message Passing Interface (MPI)
- metrics, based on lexical matching
- metrics, based on shallow semantic matching
- metrics, based on syntactic matching
- metropolis hastings
- min_lldelta variable / How it works...
- min_stem_length keyword
- ML (Machine learning)
- MLE model
- model
- MongoDB
- MongoDB-backed corpus reader
- MongoDBCorpusReader class / How it works...
- more informative features
- MorfoMelayu
- morphemes
- morphological analyzer
- morphology
- most informative features
- most_informative_features() method
- movie_reviews corpus / Getting ready
- multi-label classifier
- multilabel classifier / Classifying with multiple binary classifiers
- MultinomialNB / How it works...
- multiple binary classifiers
- multiple channels
- multi_metrics() function / How to do it...
N
- N-gram tagger
- Naive Bayes
- Naive Bayes algorithms
- Naive Bayes classifier
- NaiveBayesClassifier.train() method / How it works...
- NaiveBayesClassifier class / Training a Naive Bayes classifier
- NaiveBayesClassifier constructor / How it works...
- NAME chunker / How it works...
- named-entity recognition (NER)
- named entities
- named entity chunker
- named entity recognition
- Named Entity Recognition (NER)
- NamesTagger class / How to do it...
- names wordlist corpus / Names wordlist corpus
- National Institute of Standards and Technology (NIST) / How to do it...
- Natural Language Toolkit (NLTK) / Understanding stemmer
- ndarray
- negations
- negative feature sets / How it works...
- NER
- NER system
- NER tagger
- NetworkX
- ngram
- NgramTagger class / Quadgram tagger
- ngram taggers
- NLP
- NLP application
- NLP Systems
- NLP tools
- NLTK
- NLTK, on Hadoop
- NLTK-Trainer
- about / Training a tagger with NLTK-Trainer
- URL, for documentation / Training a tagger with NLTK-Trainer
- tagger, training with / Training a tagger with NLTK-Trainer, How to do it..., How it works...
- URL, for installation instructions / Training a tagger with NLTK-Trainer
- pickled tagger, saving / Saving a pickled tagger
- training, on custom corpus / Training on a custom corpus
- used, for training chunker / Training a chunker with NLTK-Trainer, How to do it..., How it works...
- used, for training classifier / Training a classifier with NLTK-Trainer, How it works...
- pickled classifier, saving / Saving a pickled classifier
- training instances, using / Using different training instances
- most informative features / The most informative features
- Maxent classifier / The Maxent and LogisticRegression classifiers
- LogisticRegression classifier / The Maxent and LogisticRegression classifiers
- SVM classifiers / SVMs
- classifiers, combining / Combining classifiers
- high information words / High information words and bigrams
- cross-fold validation / Cross-fold validation
- classifier, analyzing / Analyzing a classifier
- nltk.chunk functions / How it works...
- nltk.corpus
- nltk.corpus.treebank_chunk corpus / Treebank chunk corpus
- nltk.data.load() function / How it works...
- nltk.metrics module / See also
- nltk.metrics package
- nltk.sem.logic module
- nltk.tag.untag() function / There's more...
- NLTK functionality
- normalization
- noun cardinals
- Noun Phrase (NP)
- noun phrase (NP)
- Noun Phrase chunk rule / Developing a chunker using pos-tagged corpora
- Noun Phrases (NP)
- NumPy
- NumPy array
- NumPy package
- n_ii parameter / How it works...
- n_ix parameter / How it works...
- n_xi parameter / How it works...
- n_xx parameter / How it works...
O
- optical character recognition (OCR)
- ordered dictionary
P
- P(features) parameter / Training a Naive Bayes classifier
- P(features | label) parameter / Training a Naive Bayes classifier
- P(label) parameter / Training a Naive Bayes classifier
- P(label | features) parameter / Training a Naive Bayes classifier
- pandas
- paragraph block reader
- parallel list processing
- ParaMorfo
- parsed_docs() method / How it works...
- parser evaluation
- parsers
- parse trees
- parsing
- part-of-speech tagged word corpus
- part-of-speech tagging
- partial parsing
- part of speech (POS) tagging
- Part of Speech tagger (POS)
- part of speech tagging
- parts-of-speech tagging
- Path and Leacock Chordorow (LCH) similarity / Path and Leacock Chordorow (LCH) similarity
- Path Distance Similarity / Disambiguating senses using Wordnet
- pattern creation
- Penn Treebank
- Penn Treebank corpus
- Penn Treebank Project
- personal word lists / Personal word lists
- PersonChunker class / There's more...
- petabytes
- phi_sq() function / There's more...
- phonemes
- phrases
- phrase structure parsing
- pickle corpus view
- pickled chunker
- pickled tagger
- pivot point
- PlaintextCorpusReader class / How it works...
- plural nouns
- pmi() function / There's more...
- Polyglotis
- Porter stemmer
- PorterStemmer class
- Porter stemming algorithm
- pos-tagged corpora
- positive feature sets / How it works...
- POS tag
- pos tagged corpora
- pre-trained classifier
- Precision / Evaluation of IR system
- precision
- precision and recall, MaxentClassifier class
- precision and recall, NaiveBayesClassifier class
- precision_recall() function
- Prepositional Phrases (PP)
- PresuppositionDRS class
- probabilistic approach, parsing
- Probabilistic Context-free Grammar (PCFG)
- probabilistic context-free grammar (PCFG)
- probabilistic dependency parser
- probabilistic model
- projective dependency parser
- proper names
- proper noun chunks
- PunktSentenceTokenizer class / How it works..., There's more...
- PunktWordTokenizer
- pure function
- pure module
- PyEnchant library
- PyMongo documentation
- PySpark
- Python
- Python, on Hadoop
- Python subprocesses, distributed chunking / Python subprocesses
- PyYAML
Q
- Quadgram tagger
- question answering (QA) systems
- question answering system
R
- random forest
- rare word
- re() method
- recall
- Recall / Evaluation of IR system
- recursive descent parser
- Redis
- frequency distribution, storing / Storing a frequency distribution in Redis, How to do it..., How it works..., There's more...
- URL / Getting ready
- conditional frequency distribution, storing / Storing a conditional frequency distribution in Redis, How to do it..., How it works...
- ordered dictionary, storing / Storing an ordered dictionary in Redis, How to do it..., There's more..., See also
- distributed word scoring, used with / Distributed word scoring with Redis and execnet, How to do it..., How it works..., See also
- redis-py homepage
- Redis commands
- reference set
- regex parser
- RegexpParser class / How it works...
- RegexpReplacer class / How to do it...
- RegexpStemmer class
- RegexpTagger class / How to do it...
- RegexpTokenizer class / How it works...
- regex tagger
- regression
- regular expression
- Regular Expressions
- regular expressions
- used, for tokenizing sentences / Tokenizing sentences using regular expressions, There's more...
- words, tagging with / Tagging with regular expressions, How it works...
- used, for defining chunk patterns / Chunking and chinking with regular expressions, How to do it..., How it works..., There's more...
- used, for merging chunks / Merging and splitting chunks with regular expressions, How to do it..., How it works...
- used, for splitting chunks / How to do it..., How it works...
- used, for expanding chunks / Expanding and removing chunks with regular expressions, How it works..., There's more...
- used, for removing chunks / Expanding and removing chunks with regular expressions, How it works..., There's more...
- used, for partial parsing / Partial parsing with regular expressions, How it works...
- reinforcement learning
- relative link / There's more...
- remote gateway
- remove_line() function / How to do it..., How it works...
- repeating characters
- RepeatReplacer class
- replace() method / How to do it..., How it works..., How it works...
- replacement technique
- replace_negations() method / How it works...
- Resnik Score / Disambiguating senses using Wordnet
- reuters_high_info_words() function / How it works...
- reuters_train_test_feats() function / How it works...
- rule-based approach, parsing
- rule-based extraction
S
- sampling
- scatter plot
- scikit-learn
- scikit-learn classifiers
- scikit-learn model / There's more...
- SciPy
- score_ngrams(score_fn) function / Scoring ngrams
- score_words() function
- Scrapy
- Scrapy shell
- Script Applier Mechanism(SAM)
- semantic analysis
- semi-supervised learning
- sense disambiguation
- senses
- sentence
- Sentence level Construction, CFG
- sentences
- sentences, tokenizing into words
- sentence splitter
- sentence tokenizer
- sentiment analysis
- sent_tokenize function / How it works...
- SequentialBackoffTagger class / How it works..., How to do it..., How to do it...
- sequential taggers
- shallow parsing
- shallow tree
- shallow_tree() function / How to do it..., How it works...
- shift-reduce parser
- show_most_informative_features() method
- significant bigrams
- significant words
- similarity measures
- singularize_plural_noun() function / How to do it...
- singular value decomposition (SVD)
- Singular Value Decomposition (SVD)
- Sitemap spider
- SklearnClassifier class
- smoothing
- SnowballStemmer class
- Snowball stemmers
- SPANEW (Spanish ANEW) / Introducing sentiment analysis
- sparse matrix
- specific preprocessing
- speech recognition
- spell correction
- spelling issues
- SpellingReplacer class / Personal word lists
- SplitRule class
- split_label_feats() function / How to do it..., How it works...
- squared Pearson correlation coefficient
- Stanford parser
- Stanford tagger
- Stanford tools
- statistical machine translation (SMT)
- statistical modeling
- stem() method / How to do it...
- stemmer
- Stemmer I interface
- stemming
- Stochastic Finite State Automaton (SFSA)
- stochastic gradient descent (SGD)
- stop word removal
- stopwords
- stopwords corpus / How it works..., See also
- StreamBackedCorpusView class / Creating a custom corpus view
- string functions
- subplot
- subtrees
- sub_leaves() method / See also
- summarization
- supervised classification
- supervised learning
- Support Vector Machines (SVM)
- support vector machines (SVM)
- support_cutoff value
- swap_infinitive_phrase() function / How to do it...
- swap_noun_cardinal() function / How to do it...
- swap_verb_phrase() function / How to do it..., There's more..., How it works...
- synonyms
- Synset
- synset id
- syntactic matching
- syntactic parser
- syntactic transfer
T
U
- Udacity
- unambiguous antonyms / There's more...
- UnChunkRule pattern / How to do it...
- UnicodeDammit
- unigram
- unigram part-of-speech tagger
- UnigramTagger
- UnigramTagger class
- universal tags
- universal tagset
- unlabeled feature set
- unsupervised classification
- unsupervised learning
- URLs
- user defined function (UDF)
V
W
X
Y
- YAML file
- YAML synonym replacement
Z
..................Content has been hidden....................
You can't read the all page of ebook, please click
here login for view all page.