C:\> HttpClone.exe COMMAND [arguments]
A simple website clone, export, and/or publishing utility.
- Content
- Deduplicate - Search the site for duplicate content and redirect or remove the duplicates.
- Import - Imports a url to a specified page.
- Internalize - Imports a url, and then modified all links to source to point to the target page.
- Remove - Removes a single page from the site.
- Rename - Rename a content url from one location to another.
- Exporting
- Archive - Creates a zip file that can be transfered manually to a server or used as a backup.
- Export - Exports the website copy as html to a local directory.
- Hyperlinks
- Links - Lists all the known links for the specified site.
- LinkSource - Tracks down all pages that reference a specified link.
- List - Lists all the known pages for the specified site.
- Relink - Changes all links from one url to another.
- RelinkEx - Changes all links matching an expression to the evaluated target link.
- Optimizations
- Optimize - Apply the optimization rules associated with each document type configured.
- OptimizePage - Apply the optimization rules to a single page.
- Pingbacks
- Pingback - Sends a pingback notice to a website.
- PingReplay - Reads the pingbacks recorded on a live site and replays them.
- Publishing
- Host - Runs a small HTTP host to server the content of the site on the port provided.
- Publish - Publishes the a new snapshot of the website.
- PublishTo - Publishes the a new snapshot of the website to the specified host.
- Searching
- HtmlSearch - Searches the site and prints the rendered html result.
- Index - Creates a clean copy of the search index for the site.
- Like - Displays content similar to the page provided.
- NewTemplate - Recreates the search template from the original content.
- Search - Searches the site and prints the matching pages.
- Security
- cPassword - Specifies the password used to access the client RSA publishing key.
- CreateKeys - Creates the client/server shared keys used for remote publishing.
- sPassword - Transmits the server password used to access the RSA publishing key.
- Viewing
- Dump - Print the details about the content to the console.
- Edit - Edit the content url with the default registered program.
- Notepad - Open the content url with notepad.exe.
- Open - Open the content url with the default registered program.
- Type - Print the content url to the console.
- Websites
- CopySite - Copy one website to another changing all links to the original to the target.
- CrawlSite - Crawl the website provided and store any changed content.
- DeleteSite - Delete the specified website copy.
- UpdateSite - Crawl, Optimize, Deduplicate, and Index the website provided.
Deduplicate
Search the site for duplicate content and redirect or remove the duplicates.
Usage:
C:\> HttpClone.exe DEDUPLICATE [/site=]String [/remove] [/noprompt]Arguments:
- [/site=]String - The root http address of the website copy.
- [/remove] = (False) - True to remove the page and modify source links, otherwise inserts a redirect.
- [/noprompt] = (False) - True to stop prompt for confirmation before changing content.
Import
Imports a url to a specified page.
Usage:
C:\> HttpClone.exe IMPORT [/page=]String [/source=]String [/recursive] [/noprompt]Arguments:
- [/page=]String - The full http address of the page to save the source content to.
- [/source=]String - The full http address of the page you want to import.
- [/recursive] = (False) - True to recursivly import all links within the same domain.
- [/noprompt] = (False) - True to stop prompt for confirmation before overwriting content.
Internalize
Imports a url, and then modified all links to source to point to the target page.
Usage:
C:\> HttpClone.exe INTERNALIZE [/page=]String [/source=]String [/recursive] [/noprompt]Arguments:
- [/page=]String - The full http address of the page to save the source content to.
- [/source=]String - The full http address of the page you want to import.
- [/recursive] = (False) - True to recursivly import all links within the same domain.
- [/noprompt] = (False) - True to stop prompt for confirmation before overwriting content.
Remove
Removes a single page from the site.
Usage:
C:\> HttpClone.exe REMOVE [/page=]StringArguments:
- [/page=]String - The full http address of the page you want to remove.
Rename
Rename a content url from one location to another.
Usage:
C:\> HttpClone.exe RENAME [/page=]String [/source=]String [/redirect]Arguments:
- [/page=]String - The full http address of the page to move the source content to.
- [/source=]String - The full http address of the page you want to move.
- [/redirect] = (True) - True to insert a redirect after moving the content.
Archive
Creates a zip file that can be transfered manually to a server or used as a backup.
Usage:
C:\> HttpClone.exe ARCHIVE [/site=]StringArguments:
- [/site=]String - The root http address of the website copy.
Export
Exports the website copy as html to a local directory.
Usage:
C:\> HttpClone.exe EXPORT [/site=]String [/path=]String [/rebase]Arguments:
- [/site=]String - The root http address of the website copy.
- [/path=]String - The directory to export to.
- [/rebase] = (True) - Changes the fully-qualified links to use file: references.
Links
Lists all the known links for the specified site.
Usage:
C:\> HttpClone.exe LINKS [/site=]String [/verbose] [/validate] [/internal] [/external]Arguments:
- [/site=]String - The root http address of the website copy.
- [/verbose] = (True) - Display detailed information about each link.
- [/validate] = (False) - Check each link and print the status information.
- [/internal] = (True) - Set to false or 0 to omit internal links.
- [/external] = (True) - Set to false or 0 to omit external links.
LinkSource
Tracks down all pages that reference a specified link.
Usage:
C:\> HttpClone.exe LINKSOURCE [/site=]String [/link=]StringArguments:
- [/site=]String - The root http address of the website copy.
- [/link=]String - The target link to search for.
List
Lists all the known pages for the specified site.
Usage:
C:\> HttpClone.exe LIST [/site=]String [/verbose]Arguments:
- [/site=]String - The root http address of the website copy.
- [/verbose] = (False) - Display detailed information about each page or link.
Relink
Changes all links from one url to another.
Usage:
C:\> HttpClone.exe RELINK [/site=]String [/from=]String [/target=]StringArguments:
- [/site=]String - The root http address of the website copy.
- [/from=]String - The original link you want to change.
- [/target=]String - The new link you want to use instead.
RelinkEx
Changes all links matching an expression to the evaluated target link.
Usage:
C:\> HttpClone.exe RELINKEX [/site=]String [/expression=]String [/target=]StringArguments:
- [/site=]String - The root http address of the website copy.
- [/expression=]String - A regular expression to match against the links.
- [/target=]String - The new link, use {0} to insert matched capture groups by ordinal.
Optimize
Apply the optimization rules associated with each document type configured.
Usage:
C:\> HttpClone.exe OPTIMIZE [/site=]StringArguments:
- [/site=]String - The root http address of the website copy.
OptimizePage
Apply the optimization rules to a single page.
Usage:
C:\> HttpClone.exe OPTIMIZEPAGE [/page=]StringArguments:
- [/page=]String - The full http address of the page you want to run optimizations on.
Pingback
Sends a pingback notice to a website.
Usage:
C:\> HttpClone.exe PINGBACK [/source=]String [/target=]StringArguments:
- [/source=]String - The http address of the web page linking to target.
- [/target=]String - The http address of the target paged linked from source.
PingReplay
Reads the pingbacks recorded on a live site and replays them.
Usage:
C:\> HttpClone.exe PINGREPLAY [/site=]String [/source=]StringArguments:
- [/site=]String - The root http address of the website to replay on.
- [/source=]String - The http address of the live website to read pingbacks from.
Host
Runs a small HTTP host to server the content of the site on the port provided.
Usage:
C:\> HttpClone.exe HOST [/site=]String [[/port=]Int32]Arguments:
- [/site=]String - The root http address of the website copy.
- [[/port=]Int32] = (11080) - The tcp/ip port to use for hosting the content.
Publish
Publishes the a new snapshot of the website.
Usage:
C:\> HttpClone.exe PUBLISH [/site=]StringArguments:
- [/site=]String - The root http address of the website copy.
PublishTo
Publishes the a new snapshot of the website to the specified host.
Usage:
C:\> HttpClone.exe PUBLISHTO [/site=]String [/host=]StringArguments:
- [/site=]String - The root http address of the website copy.
- [/host=]String - The root http address of the target website host.
HtmlSearch
Searches the site and prints the rendered html result.
Usage:
C:\> HttpClone.exe HTMLSEARCH [/site=]String [/term=]String [[/page=]Int32]Arguments:
- [/site=]String - The root http address of the website copy.
- [/term=]String - The expression to search for, see http://lucene.apache.org/java/2_4_0/queryparsersyntax.html.
- [[/page=]Int32] = (1) - The result page to return.
Index
Creates a clean copy of the search index for the site.
Usage:
C:\> HttpClone.exe INDEX [/site=]StringArguments:
- [/site=]String - The root http address of the website copy.
Like
Displays content similar to the page provided.
Usage:
C:\> HttpClone.exe LIKE [/page=]StringArguments:
- [/page=]String - The http address of the web page.
NewTemplate
Recreates the search template from the original content.
Usage:
C:\> HttpClone.exe NEWTEMPLATE [/site=]StringArguments:
- [/site=]String - The root http address of the website copy.
Search
Searches the site and prints the matching pages.
Usage:
C:\> HttpClone.exe SEARCH [/site=]String [/term=]String [/newest]Arguments:
- [/site=]String - The root http address of the website copy.
- [/term=]String - The expression to search for, see http://lucene.apache.org/java/2_4_0/queryparsersyntax.html.
- [/newest] = (False) - Order the results by date rather than by best match.
cPassword
Specifies the password used to access the client RSA publishing key.
Usage:
C:\> HttpClone.exe CPASSWORD [/site=]String [/password=]StringArguments:
- [/site=]String - The root http address of the website copy.
- [/password=]String - The password to be used.
CreateKeys
Creates the client/server shared keys used for remote publishing.
Usage:
C:\> HttpClone.exe CREATEKEYS [/site=]String [[/key-size=]Int32] [[/password=]String] [/NoServerPassword]Arguments:
- [/site=]String - The root http address of the website copy.
- [[/key-size=]Int32] = (4096) - The size, in bits, of the RSA cryptographic keys to produce.
- [[/password=]String] - The client password used for access to the RSA publishing key (Empty for prompt).
- [/NoServerPassword] = (False) - Do not generate or require a password for the server's RSA publishing key.
sPassword
Transmits the server password used to access the RSA publishing key.
Usage:
C:\> HttpClone.exe SPASSWORD [/site=]String [/password=]StringArguments:
- [/site=]String - The root http address of the website copy.
- [/password=]String - The password to be used (obtained from CreateKeys).
Dump
Print the details about the content to the console.
Usage:
C:\> HttpClone.exe DUMP [/page=]StringArguments:
- [/page=]String - The http address of the web page.
Edit
Edit the content url with the default registered program.
Usage:
C:\> HttpClone.exe EDIT [/page=]StringArguments:
- [/page=]String - The http address of the web page.
Notepad
Open the content url with notepad.exe.
Usage:
C:\> HttpClone.exe NOTEPAD [/page=]StringArguments:
- [/page=]String - The http address of the web page.
Open
Open the content url with the default registered program.
Usage:
C:\> HttpClone.exe OPEN [/page=]StringArguments:
- [/page=]String - The http address of the web page.
Type
Print the content url to the console.
Usage:
C:\> HttpClone.exe TYPE [/page=]StringArguments:
- [/page=]String - The http address of the web page.
CopySite
Copy one website to another changing all links to the original to the target.
Usage:
C:\> HttpClone.exe COPYSITE [/site=]String [/target=]String [/overwrite]Arguments:
- [/site=]String - The root http address of the website to copy.
- [/target=]String - The root http address of the destination website.
- [/overwrite] = (False) - True to overwrite any existing content.
CrawlSite
Crawl the website provided and store any changed content.
Usage:
C:\> HttpClone.exe CRAWLSITE [/site=]StringArguments:
- [/site=]String - The root http address of the website copy.
DeleteSite
Delete the specified website copy.
Usage:
C:\> HttpClone.exe DELETESITE [/site=]String [/noprompt]Arguments:
- [/site=]String - The root http address of the website copy.
- [/noprompt] = (False) - True to suppress confirmation message.
UpdateSite
Crawl, Optimize, Deduplicate, and Index the website provided.
Usage:
C:\> HttpClone.exe UPDATESITE [/site=]String [[/source=]String]Arguments:
- [/site=]String - The root http address of the website copy.
- [[/source=]String] - Overrides the source http address.