Primary analyzers available in Lucene
Analyzer Steps taken
WhitespaceAnalyzer Splits tokens at whitespace
SimpleAnalyzer Divides text at nonletter characters and lowercases
StopAnalyzer Divides text at nonletter characters, lowercases, and removes stop words
StandardAnalyzer Tokenizes based on a sophisticated grammar that recognizes
e-mail addresses, acronyms, Chinese- Japanese-Korean characters,
alphanumerics, and more; lowercases;and removes stop words