Screen scrape a SharePoint wiki to Markdown
This is largely intended for importing to Confluence with the markdown-to-confluence-uploader or direct entry
This script requires a manual patch to Mechanize described here
Copy config.yml.dist to config.yml and enter your details
| Option | Meaning |
|---|---|
| username | Your SharePoint username |
| password | Your SharePoint password |
| sharepoint_url | The base domain of your SharePoint instance |
| wiki_base_url | The subdirectory of your particular wiki. This ripper will only process pages under this base_url |
| wiki_index | The index file of the wiki to start with. Assumes a big list of links to other pages under wiki_base_url |
| scrape_recursively | Whether to follow links and continue scraping or just the index page |
| content_div_id | The id of the innermost div that your pages all share |
| confluence_space_key | The key of your Confluence space |
| direct_confluence_entry | True if you'll copy and paste the resultant markdown straight into the wiki entry option of Confluence. False if you're using the markdown-to-confluence-uploader |
| add_legacy_link | Whether to add a link to the bottom of each page linking back to the legacy SharePoint source page |