Abstract
All the news sites these days have RSS feeds which are used to publish flash news on their sites. The main objective of this website is to gather news from different sites and publish it in one location and intelligent ways of searching it. The major drawback of RSS feeds in general is that they are not organized in a proper manner. They tend to create more traffic and demands on the server. Most readers usually prefer a whole update and hence still access the site. Some users prefer email updates over subscribing to an RSS feed system. The system collects all the RSS feeds from different sites and organizes them in a database in a categorized manner. In this process I have implemented an online newspaper website. This site is intended to collect all the RSS feeds from different online news sites and put them in a staging area. Once they are put in the staging area we push the data onto a database. If the site does not have RSS feeds we create one for it. Once they are put in the database another process is executed and generates xhtml pages for different sections. The next process is to implement intelligent search technique to make a searchable database, by means of stemming. Stemming mechanism includes different algorithms, which would be used in this project. The language used is Advanced Java with SQL Server as backend. The categories considered in this project are National, World, Sports, Business, Religion, Regional, Horoscope and Politics. The project is implemented in three parts: The first part collects all the RSS feeds and stores them in a temporary staging area. The rss.xml file is collected from news provider's site and parsed by Rome RSS parsing library. The second part loads the database tables from the staging area. A database connection class named DatabaseConnection.java has been used for creating a connection with the database and storing the data to the database from the temporary staging area. The third part takes the data from database and generates html files for each category, implements and tests the searching algorithm. I have used stemming technique, which is a search algorithm as a part of advanced java for implementation purposes. It searches for similar keywords more specifically. We have a method named searchRoot() for searching the root of the word. Once we get the root we search for other forms of the root by using searchForm() method. We have a DAO (data access object) class for the news table. We can search for news from the database by using a given keyword of the DAO class.