998guide.com Rss search open source enginee.
Building Rss search enginee by open source software.
technique structure
Based on the following tools,softwares:
- Lucene, full text index and search
- heaton spider, some base calss to build internet crawler.
- Jdom, Parses, manipulates, and outputs XML using standard Java constructs
- rome, Rome is a set of Atom/RSS Java utilities that make it easy to work in Java with most syndication formats
- Java 2, Java virtual machine
- Apache, apache web server,for display web page in the project.
- Mysql, open source database.
- Tomcat, web server for run JSP and java applcations.
- Linux, open source OS.
We will build the following applications:
- lib,base lib for the project
- crawler,to crawler the internet rss file
- dao,database access utils,manage database access and operations
- indexer, full text indexer
- searcher,execute full text search
- portal,web page for end user,including HTML,JSP,XML,etc.
The project is under working, if you have interesting,you can contact us and work with us!(contact:wyyruyan@gmail.com);
You can see the demo verion at:http://search.998guide.com/rss.jsp.
We plan to release it at 01/11/2006.
|