ask485? - Search Engine
Python, Madoop(MapReduce), HTML, CSS
Project details
Description
- Learned information retrieval concepts like text analysis (tf-idf) and link analysis (PageRank), and parallel data processing with MapReduce.
- Gained experience using a Service-Oriented Architecture to scale dynamic pages and web search.
- Created a segmented inverted index of web pages using a pipeline of MapReduce programs.
- Built an Index server to search each and every single partition output file containing the inverted indexes of every word!
- Built a Search Server, a multi-threaded application (each thread being one Index Server) to aggreate the search results from various Index servers and rank it based on web pages with the best scores.
- Finally, user interface that returns search results just like Google or Bing.
-
Start Date:
June 15th, 2024 -
End Date:
June 21st, 2024 -
Github:
Private -
Course:
EECS485 -
Course Topic:
Web Systems
7+
Years of Coding Experience
25
Completed CS Projects
3+