githubEdit

Data Source Configurations

Configure your Data Source to ingest data on your terms

Aisera Platform offers versatile options for data ingestion, allowing you to define the parameters, schedule ingestion times, and transform data on entry and exit.

circle-info

Not all configurations will be available for all Data Sources.

Auto-Commit

By default, data ingested into the Aisera Platform is not automatically committed, and must be manually reviewed. Uncommitted files are not available to the system until they are committed. This feature allows you to bypass this step, making ingested documents immediately available to the system.

If the auto-commit flag is set to true in your Data Source Configuration, content ingested by the data source will be automatically approved upon ingestion.

Auto Commit Option

Schedule

The schedule field allows you to determine how frequently a Data Source will ingest. Ingestion can be automatically performed On Demand, Every 20 Minutes, Every 30 Minutes, Every 45 Minutes, Hourly, Daily, Weekly, or you may specify a custom schedule using Cron expressions.

UNIX Cron Syntax

A cron expressionarrow-up-right is a string used to define a schedule for running tasks at specific times or intervals. It is commonly used in Unix-based systems for scheduling repetitive tasks. A typical cron expression consists of five or six field, each representing a different unit of time:

  1. Minute (0 - 59)

  2. Hour (0 - 23)

  3. Day of the month (1 - 31)

  4. Month (1 - 12)

  5. Day of the week (0 - 7, where both 0 and 7 represent Sunday)

For example, the cron expression 0 3 * * 2,4 is used to schedule a task to run at 03:00 AM on Tuesdays and Thursdays 1.

Setting a Custom Recurring Schedule

To set a custom recurring schedule:

  1. Navigate to Settings > Data Source.

  2. Open an existing data source.

  3. Choose the pencil icon in the upper-right corner to edit the data source.

  4. Make sure you're in the General tab of the Edit Data Source window.

  5. Under Schedule, select Custom from the pull down list.

  6. Select + Add Schedule

  7. Enter a cron expression in the field that appears.

  8. Choose the + Add Schedule button to add another schedule line. Repeat this step to add more lines, up to a maximum of five.

  9. Click OK.

You may quickly reference the schedule of a Data Source by referencing the Schedule section on the right side of the Data Source Details page.

Last updated

Was this helpful?