Get Page Source
Retrieves the complete HTML source code of the current page.
Common Properties
- Name - The custom name of the node.
- Color - The custom color of the node.
- Delay Before (sec) - Waits in seconds before executing the node.
- Delay After (sec) - Waits in seconds after executing the node.
- Continue On Error - Automation will continue regardless of any error. The default value is false.
info
If Continue On Error property is true, no error is caught when the project is executed, even if the Catch node is used.
Input
- Browser ID - The browser session identifier from the Open Browser node.
Output
- Page Source - The complete HTML source code of the current page, including dynamically loaded content.
Examples
Get HTML Source
Browser ID: {{browser_id}}
Output: {{html_source}}
Save HTML to File
1. Get Page Source node:
Browser ID: {{browser_id}}
Output: {{source}}
2. Write File node:
Path: C:\output\page.html
Content: {{source}}
Parse HTML Content
1. Get Page Source node:
Browser ID: {{browser_id}}
Output: {{html}}
2. DOM Parser nodes to extract data from {{html}}
Check for Specific Content
1. Get Page Source node:
Browser ID: {{browser_id}}
Output: {{source}}
2. Condition: if {{source}} contains "<!-- Specific Comment -->"
Tips
- Returns the rendered HTML including JavaScript-generated content
- Different from viewing page source (Ctrl+U) which shows original HTML
- Useful for advanced data extraction when selectors are complex
- Can be parsed with DOM Parser or Regular Expression nodes
- Large pages will return large amounts of text data
- Save to file for offline analysis
Common Errors
- "Browser ID cannot be empty" - Connect to an Open Browser node
- "Browser session not found" - Ensure the Open Browser node has run successfully
Related Nodes
- Get URL - Get current URL
- Get Title - Get page title
- Get Value - Get specific element values
- Run Script - Execute JavaScript on the page