DataPancake™ Docs
  • DataPancake™ Documentation
  • Overview
    • What is DataPancake™?
    • Understanding the Challenges of Semi-Structured Data
  • Getting Started
    • Quick Start Guides
      • Real Estate JSON Quick Start
      • FDA_UDI Data Quick Start
      • Internal Data Quick Start
      • Alerts
      • Page
    • Application Overview
      • Data Source Overview
      • Manage Data Source
      • Manage Scan Configuration
      • Scan Data Source
      • Data Source Attributes
      • Dynamic Table SQL
      • Manage Warehouse
  • How to Guides
    • How to Purchase & Install DataPancake
    • How to Manage Access to DataPancake
    • How to Grant DataPancake Account & Cortex AI Privileges
    • How to Add Warehouses to DataPancake
    • How to Create a Single DataPancake Data Source (Script Builder)
  • Topics
    • Warehouses
    • Data Sources
    • Scan Configurations
    • Attributes
    • Scan Processing
    • Views
    • Attribute Consolidation
    • Pricing
  • DEVELOPMENT
    • Release Notes
Powered by GitBook
On this page
  • 1. Navigate to the "Data Source" script builder
  • 2. On the "Single Data Source" tab, enter the custom data source name
  • 3. (Optional) Change the app database name
  • 4. Enter the role used to install DataPancake
  • 5. Enter the role which has the select privilege for the database object to be scanned
  • 6. Select the Snowflake Database Object Type
  • 7. Select the warehouse to be used by DataPancake to scan data
  • 8. Enter the Semi-structured Data Format Type
  • 9. Enter the Snowflake database name that contains the data
  • 10. (Optional) Indicate if the database is shared
  • 11. Enter the name of the schema which contains the database object to be scanned
  • 12. Enter the name of the database object to be scanned
  • 13. Enter the name of the column to be scanned
  • 14. (Optional) Prevent the scan from starting automatically
  • 15. (Optional) Enter a record limit for the initial scan
  • 16. Enable the toggle to add SQL code generation features after the scan is complete.
  • 17. Enter the user defined name for the root dynamic table, such as 'dt_<object_name>'
  • 18. Enter the name of the warehouse used to refresh dynamic tables
  • 19. (Optional) Enter the type of semantic layer view to create
  • 20. Enter the user defined name for the root level view, such as 'vw_<object_name>'
  • 21. Click "Generate Script"
  • 22. Copy the script
  • 23. Open a new snowflake worksheet and paste the script
  • 24. Review & Execute the script
  • 25. Ensure successful execution by examining the call result column in the result set
  1. How to Guides

How to Create a Single DataPancake Data Source (Script Builder)

Build a script to create and initiate a scan for a single data source in DataPancake and optionally generate SQL code.

Last updated 1 day ago

Please ensure the Admin setup has been completed before creating your first data source.

This script builder is not available for the following scenarios:

  • Data sources with a pre-configured schema

  • Data sources that contain stream data such as Kafka

See "How to Create a DataPancake Data Source (UI)" if needed.

1. Navigate to the "Data Source" script builder

2. On the "Single Data Source" tab, enter the custom data source name

3. (Optional) Change the app database name

The default is DATAPANCAKE

4. Enter the role used to install DataPancake

5. Enter the role which has the select privilege for the database object to be scanned

6. Select the Snowflake Database Object Type

7. Select the warehouse to be used by DataPancake to scan data

8. Enter the Semi-structured Data Format Type

9. Enter the Snowflake database name that contains the data

If the name is case sensitive, put the name in double quotes such as "EXAMPLE_NAME"

10. (Optional) Indicate if the database is shared

11. Enter the name of the schema which contains the database object to be scanned

If the name is case sensitive, put the name in double quotes such as "EXAMPLE_NAME"

12. Enter the name of the database object to be scanned

If the name is case sensitive, put the name in double quotes such as "EXAMPLE_NAME"

13. Enter the name of the column to be scanned

If the name is case sensitive, put the name in double quotes such as "EXAMPLE_NAME"

14. (Optional) Prevent the scan from starting automatically

If you switch this off, you will need to manually trigger the data source scan from the "Scan Data" page.

15. (Optional) Enter a record limit for the initial scan

The "Generate Code After Scan" will automatically create the output SQL statements based on the discovered schema at the completion of the scan.

SQL Code Generation is a paid feature available once you purchase the application or as part of an Enterprise license. Code Generation in the Trial version is not enabled.

16. Enable the toggle to add SQL code generation features after the scan is complete.

17. Enter the user defined name for the root dynamic table, such as 'dt_<object_name>'

18. Enter the name of the warehouse used to refresh dynamic tables

This warehouse name is included as part of the create dynamic SQL table statements.

19. (Optional) Enter the type of semantic layer view to create

20. Enter the user defined name for the root level view, such as 'vw_<object_name>'

21. Click "Generate Script"

22. Copy the script

23. Open a new snowflake worksheet and paste the script

24. Review & Execute the script

Use the blue dropdown arrow at the top right and click "Run All"

25. Ensure successful execution by examining the call result column in the result set

If you mistyped any of the database connection information, the resulting error message will show you what was incorrect.

If the warehouse used has not been previously added to DataPancake, then the scan will not be initiated.

If the scan is initiated, the result message will show that the scan has started.

If you choose not to generate code,

skip to step 21.