Skip to main content

Get Page Source

Retrieves the complete HTML source code of the current page.

Common Properties

  • Name - The custom name of the node.
  • Color - The custom color of the node.
  • Delay Before (sec) - Waits in seconds before executing the node.
  • Delay After (sec) - Waits in seconds after executing the node.
  • Continue On Error - Automation will continue regardless of any error. The default value is false.
info

If Continue On Error property is true, no error is caught when the project is executed, even if the Catch node is used.

Input

  • Browser ID - The browser session identifier from the Open Browser node.

Output

  • Page Source - The complete HTML source code of the current page, including dynamically loaded content.

Examples

Get HTML Source

Browser ID: {{browser_id}}
Output: {{html_source}}

Save HTML to File

1. Get Page Source node:
Browser ID: {{browser_id}}
Output: {{source}}

2. Write File node:
Path: C:\output\page.html
Content: {{source}}

Parse HTML Content

1. Get Page Source node:
Browser ID: {{browser_id}}
Output: {{html}}

2. DOM Parser nodes to extract data from {{html}}

Check for Specific Content

1. Get Page Source node:
Browser ID: {{browser_id}}
Output: {{source}}

2. Condition: if {{source}} contains "<!-- Specific Comment -->"

Tips

  • Returns the rendered HTML including JavaScript-generated content
  • Different from viewing page source (Ctrl+U) which shows original HTML
  • Useful for advanced data extraction when selectors are complex
  • Can be parsed with DOM Parser or Regular Expression nodes
  • Large pages will return large amounts of text data
  • Save to file for offline analysis

Common Errors

  • "Browser ID cannot be empty" - Connect to an Open Browser node
  • "Browser session not found" - Ensure the Open Browser node has run successfully