hello every one
i've aproject about mini_search_engine
and this is the what's doctor wrote to us
please i want some ideas about that and the planner of the project
thanks alot
i've aproject about mini_search_engine
and this is the what's doctor wrote to us
You are given a folder of input text or HTML files (say 50 files) and a set of keywords (say 40 keyword).
Build an efficient data structure (an index file) using hashing that will provide information as which files
contain a certain keyword and in which line. The structure should be saved on hard disk and loaded in
memory upon system startup. Then you build a small program in which a user will have a simple
interface asking for a keyword:
Build an efficient data structure (an index file) using hashing that will provide information as which files
contain a certain keyword and in which line. The structure should be saved on hard disk and loaded in
memory upon system startup. Then you build a small program in which a user will have a simple
interface asking for a keyword:
Input Keyword to Search
:
The user then types "water" for example.
The program then searches the index file and should have an output such as:
The user then types "water" for example.
The program then searches the index file and should have an output such as:
The keyword water exists in:
water-resources.html:
Line 212: The water problem in the middle east ....
Line 345: The Nile water is the main source of ...
Line 2003: Water is a blessing from God ...
arab-water-supply.txt:
Line 2: shortage of water in ..
Line 25: Libya has ample supply of water from rain ...
and so on. Can you handle the rule that when a file is changed, the index is re-constructed?water-resources.html:
Line 212: The water problem in the middle east ....
Line 345: The Nile water is the main source of ...
Line 2003: Water is a blessing from God ...
arab-water-supply.txt:
Line 2: shortage of water in ..
Line 25: Libya has ample supply of water from rain ...
please i want some ideas about that and the planner of the project
thanks alot