Skip to contents

Tools for Text Analytics

Tools that support “Natural Language Processing” for text analytics.

bitNLP bitNLP-package
3rd party package of Bit2R with Text Analytics function

텍스트 데이터 전처리 기능

filter_text()
Filter data based on string matches of text data
replace_text() concat_text() split_text() remove_text()
Replace/remove/join/separate strings in text data
append_userdic_meta()
Write to the user-defined noun dictionary file.
get_meta() set_meta()
Meta information processing for text data pre-processing
get_userdic_meta()
Query the user-defined person dictionary file.
get_spacing()
Korean automatic spacing
get_ngrams()
Tokenization with N-gram
tokenize_noun_ngrams()
N-gram Tokenizer
unnest_noun_ngrams()
Wrapper around unnest_tokens for n-grams of noun
collapse_noun()
Extract Collapsed Noun

텍스트 데이터 탐색 기능

explore_docs()
Text Data Explorer

형태소분석 기능

install_mecab_ko()
Installation of Eunjeonhan morpheme analyzer and dic
regist_mecab_ko()
Register the path where Mecab-Ko is installed
morpho_mecab()
part-of-speech tagger based on mecab-ko morphology analyzer

감성분석 기능

get_opinion()
KOSAC(Korean Sentiment Analysis Corpus) Sentiment Analysis
get_polarity()
KNU Korean Sentiment Dictionary Sentiment Analysis

공동발생 분석

collocate()
Calculate table for co-occurrence analysis
coll_scores()
Calculate t-score and mutual information score

기타 텍스트 처리 기능

has_final_consonant()
Test whether the final consonant of Korean terms

텍스트 데이터

buzz
Naver Cafe Post Scraping Data
rest_area
Highway rest area related buzz
president_speech
President's Speech
movie_ratings_train movie_ratings_test
Naver sentiment movie corpus v1.0
polarity
KOSAC(Korean Sentiment Analysis Corpus) sentiment dictionary
sentiment_dic
KNU Korean Sentiment Dictionary

형태소분석기 사전 관리

add_sysdic()
Add user-defined dictionary files to system dictionary.
edit_termcost()
Modify the word cost of a word in a dictionary definition file.
get_plan_cost()
Search for tokenizer plans based on word cost
append_userdic_meta()
Write to the user-defined noun dictionary file.
create_userdic()
create user dictionary with user-defined dictionary files.
get_userdic_meta()
Query the user-defined person dictionary file.
update_userdic()
update user dictionary with user-defined dictionary files.