대칭 조건부 확률과 TF-IDF 기반 텍스트 분류를 위한 N-gram 특질 선택

논문상세정보
' 대칭 조건부 확률과 TF-IDF 기반 텍스트 분류를 위한 N-gram 특질 선택' 의 주제별 논문영향력
논문영향력 선정 방법
논문영향력 요약
주제
  • 공학, 공업일반
  • feature selection
  • n-gram
  • symmetrical conditional probability
  • term frequency-inverse document frequency
  • text classification
동일주제 총논문수 논문피인용 총횟수 주제별 논문영향력의 평균
5,736 0

0.0%

' 대칭 조건부 확률과 TF-IDF 기반 텍스트 분류를 위한 N-gram 특질 선택' 의 참고문헌

  • 텍스트마이닝을 활용한 산업공학 학술지의 논문 주제어간 연관관계 연구
    조수곤 대한산업공학회지 38 (1) : 67 ~ 73 [2012]
  • Using TF-IDF to Determine Word Relevance in Document Queries
    Ramos, J. Proceedings of the First Instructional Conference on Machine Learning [2003]
  • UCI Machine Learning Repository
  • Translating Collocations for Bilingual Lexicons : A Statistical Approach
    Smadja, F. Computation Linguistics 22 (1) : 1 ~ 38 [1996]
  • Towards Automatic Building of Document Keywords
    Silva, J. Proceedings of the 23rd International Conference on Computational Linguistics : Posters : 1149 ~ 1157 [2010]
  • The Text Mining Handbook : Advanced Approaches in Analyzing Unstructured Data
    Feldman, R. Cambridge University Press [2007]
  • Text Classification from Labeled and Unlabeled Documents using EM
    Nigam, K. Machine Learning 39 (2-3) : 103 ~ 134 [2000]
  • Syntactic N-grams as Machine Learning Features for Natural Language Processing
    Sidorov, G. Expert Systems with Applications 41 : 853 ~ 860 [2014]
  • Scikit-learn : Machine Learning in Python
    Pedregosa, F. Journal of Machine Learning Research 12 : 2825 ~ 2830 [2011]
  • Python Language Reference, Version 2.7
  • Opinosis : A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions
    Ganesan, K. Proceedings of the 23rd International Conference on Computational Linguistics [2010]
  • N-Gram Feature Selection for Authorship Identification, Artificial Intelligence : Methodology, Systems, and Applications
    Houvardas, J. Springer : 77 ~ 86 [2006]
  • Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model
    Chemudugunta, C. Smyth, P. Steyvers, M. Advances in Neural Information Processing Systems 19 : Proceedings of the 2006 Conference 19 [2007]
  • Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large-scale Data Collections
    Phan, X. Proceedings of the 17th International Conference : 91 ~ 100 [2008]
  • Is Naïve Bayes a Good Classifier for Document Classification?
    Ting, S. L. International Journal of Software Engineering and Its Applications 5 (3) : 37 ~ 46 [2011]
  • Introduc- tion to Modern Information Retrieval
    Salton, G. McGraw-Hill Book Company [1983]
  • Improving Gender Classification of Blog Authors
    Mukherjee, A Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing : 207 ~ 217 [2010]
  • Improved Feature Selection Approach TFIDF in Text Mining
    Jing, L. Proceedings of the First International Conference on Machine Learning and Cybernetics : 944 ~ 946 [2002]
  • Discriminating Gender on Twitter
    Burger, J. D. Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing : 1301 ~ 1309 [2011]
  • Comparing and Combining Dimension Reduction Techniques for Efficient Text Clustering
    Tang, B. Proceeding of SIAM International Workshop on Feature Selection for Data Mining : 17 ~ 26 [2005]
  • Classification of Text Documents
    Li, Y. H. The Computer Journal 41 (8) : 537 ~ 546 [1998]
  • Class-Based N-gram Models of Natural Language
    Brown, P. F. Computational linguistics 18 (4) : 467 ~ 479 [1992]
  • A Local Maxima Method and a Fair Dispersion Normalization for Extracting Multiword Units
    da Silva, J. F Sixth meeting on the Mathematics of Language : 369 ~ 381 [1999]
  • A Hybrid Method N-Grams-TFIDF with Radial Basis for Indexing and Classification of Arabic Documents
    Zaki, T. International Journal of Software Engineering and Its Applications 8 (2) : 127 ~ 144 [2014]