Create Your Own LSA Text Summarizer Python

If you need to read long book or article, but you have no time, summarizer will help. It helps if you have no option to get a summary of the text. The first option, you get a summary that created by a human. You can google the summary of the book. But if you didnt get summary that you want, summary machine can help you. I already explain how to create summarizer using luhn and lexrank, in this article I will talk about LSA method to summary. LSA is Latent Semantic Analysis, a computerized based summarization algorithms.

In this article, you can learn how to create summarizer by using lsa method. It is automate process by using python and sumy. I will tell you below, about three process to create lsa summarizer tool.

Lsa summary is One of the newest methods. It is the Latent Semantic Analysis (LSA). An LSA-based summarization using algorithms to create summary for long text.

How to make LSA summary.

First, we have to install a programming language, python.
Next, we’re installing an open source python library, sumy.
Finally, we can finish up with using small code that ready to work.
By reading this article, you can get tool ,a LSA method to summary. You can do itu whenever and for whatever for free.

First, we do first step, install python programming language. Python is a programming language that help us to give a set of instruction to computer.

Python is programming language that you need to create lsa summary.You have another option to create luhn summarizer, but this article give you tutorial to create lsa summarizer using python. Why ? because of I more familiar with this language than another language, like c+, php, java, javascript. How to install python is by download python installer from Then click the installer. For further detail to installation, click here. This paragraph didn’t give you detail tutorial, but the link gives you complete detail to install python. After you install python, go to the next step.

“Next, we will install a python library that suit to summarize, especially lsa summarizer. It is sumy. Sumy is python library that give you programming language to summarize text in several methods. The methods is lexrank, luhn, lsa, et cetera. We didnt reinvent the whell to program summarizer. We can use Sumy. It is one of several summarizer in github. We can install it by open terminal (linux/mac) / command prompt (windows). Type pip install sumy, the your computer install sumy, if you have internet connection.

If your computer can not install by using pip in command prompt or terminal, you can download it in … then open file in command prompt.

After you install sumy, finally, run the program by using this small code. Copy the code below and paste in notepad. Then save as the file with py extension. Do not use default txt extension. For example you save code in file in folder “sumy folder”.

Then the next steps is create blank notepad file. It name source.txt. Why, because the code call it source.txt. If you want to change name, you has to change code.

#Import library essentials
from sumy.parsers.plaintext import PlaintextParser #We're choosing a plaintext parser here, other parsers available for HTML etc.
from sumy.nlp.tokenizers import Tokenizer 
from sumy.summarizers.lsa import LsaSummarizer #We're choosing Luhn, other algorithms are also built in

file = "source.txt" #name of the plain-text file
parser = PlaintextParser.from_file(file, Tokenizer("english"))
summarizer_lsa = LsaSummarizer()

summary_2 =summarizer_lsa(parser.document,5) #Summarize the document with 5 sentences

for sentence in summary_2:
    print sentence

Then copy long text that you want to summarize. Then after that, open by using python IDLE. It is my article about python IDLE.
Then you get a summary of the long text.

“That is three step to create a lsa summarize tool by using python and sumy. Step one is about install python. Step two is about install sumy, after you install python. Then activate sumy by using code to activate lsa methods to summarize. Then you can enjoy summarizer tool by lsa method. You get summary of long text that you want to learn and extract. ”

Leave a Comment