Based on the little-used HTML5 outline spec, investigate&implement an in-browser tool (currently a chrome extension or browser user script) to easily, interactively scrap a documentation web page into an 'index-content' map for (offline) searching.

Motivated by the fact that most scrappers today are command line tools, too tech-savvy.

Currently only target documentation web pages, which are much better structured and so easier to scrap. Also I think these pages benefit most from indexing&scraping.

Looking for mad skills in:

web scrapper indexer documentation

This project is part of:

Hack Week 13

Activity

  • about 3 years ago: cxiong added keyword "documentation" to Interactive Documentation Web Page Scrapper
  • about 3 years ago: cxiong added keyword "indexer" to Interactive Documentation Web Page Scrapper
  • about 3 years ago: cxiong added keyword "scrapper" to Interactive Documentation Web Page Scrapper
  • about 3 years ago: cxiong added keyword "web" to Interactive Documentation Web Page Scrapper
  • about 3 years ago: cxiong started Interactive Documentation Web Page Scrapper
  • Show History

    Comments

    Be the first to comment!

    Similar Projects

    Convert the openATTIC project web site from Typo3 to Nikola (static content generator) by LenzGr

    Overview

    Currently, the [openATTIC...


    Google Hangouts killer: WebRTC-based video conferencing system by ancorgs

    We have some internal systems for videoconferen...


    Charon: A planet-like feed aggregator by hennevogel

    Charon (ˈʃærən) is intended for communities of ...


    logorator: an offline internal analytics tool by dleidi

    There are customer use cases where sharing info...


    Create a web application for configuring laitos - your "Do Everything" software for serious preppers by guohouzuo

    Laitos is an open source project written in go,...