Skip to content

nehchal/KeyphraseExtraction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KeyphraseExtraction

Keyphrase extraction using Word2vec and Page rank

Getting started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

  • Spacy
  • Networkx
  • Word2vec vectors trained on Wikipedia data
  • Stopwords and topic-file have been added to repository

Running

python SgRank.py
python SemRank.py

Algorithm description

Extract noun chunks using Spacy. Apply pagerank on graph with noun chunks as nodes and edge weights can either be calculated using syntax heuristics or semantic similarity or both.

About

Keyphrase extraction using Word2vec and PageRank

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages