Class IngestDataSourceJob
- java.lang.Object
-
- com.c12e.cortex.profiles.module.job.IngestDataSourceJob
-
- All Implemented Interfaces:
java.lang.Runnable
public class IngestDataSourceJob extends java.lang.Object implements java.lang.Runnable
Ingest aDataSource
-
-
Field Summary
Fields Modifier and Type Field Description static java.util.function.Function<org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>,org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>>
DEFAULT_DATASOURCE_FORMATTER
java.util.function.Function<CortexDataSourceWriter,CortexDataSourceWriter>
defaultWriterOptions
Default writer options, remove option by returning dswstatic java.util.function.Function<org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>,org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>>
formatDatasetForDataSource
transform onDataSource
DataFrame to alter Timestamp and Date type columns to String Timestamp and Date type columns are not supported in Phoenix at this timejava.util.function.Supplier<java.lang.Boolean>
performFeatureCatalogCalculations
PerformFeatureCatalog
calculations, is an expensive operation and may be set to false depending on the datasetjava.util.function.Function<CortexConnectionReader,CortexConnectionReader>
readerOptions
Expand onCortexConnectionReader
reader optionsjava.util.function.Function<CortexDataSourceWriter,CortexDataSourceWriter>
writerOptions
Expand onCortexDataSourceWriter
writer options
-
Constructor Summary
Constructors Constructor Description IngestDataSourceJob(java.lang.String project, java.lang.String sourceName, CortexContext cortexContext)
Constructor
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
run()
-
-
-
Field Detail
-
DEFAULT_DATASOURCE_FORMATTER
public static final java.util.function.Function<org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>,org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>> DEFAULT_DATASOURCE_FORMATTER
-
formatDatasetForDataSource
public static java.util.function.Function<org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>,org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>> formatDatasetForDataSource
transform onDataSource
DataFrame to alter Timestamp and Date type columns to String Timestamp and Date type columns are not supported in Phoenix at this time
-
readerOptions
public java.util.function.Function<CortexConnectionReader,CortexConnectionReader> readerOptions
Expand onCortexConnectionReader
reader options
-
defaultWriterOptions
public java.util.function.Function<CortexDataSourceWriter,CortexDataSourceWriter> defaultWriterOptions
Default writer options, remove option by returning dsw
-
writerOptions
public java.util.function.Function<CortexDataSourceWriter,CortexDataSourceWriter> writerOptions
Expand onCortexDataSourceWriter
writer options
-
performFeatureCatalogCalculations
public java.util.function.Supplier<java.lang.Boolean> performFeatureCatalogCalculations
PerformFeatureCatalog
calculations, is an expensive operation and may be set to false depending on the dataset
-
-
Constructor Detail
-
IngestDataSourceJob
public IngestDataSourceJob(java.lang.String project, java.lang.String sourceName, CortexContext cortexContext)
Constructor- Parameters:
project
- the projectsourceName
- the DataSource namecortexContext
- the context
-
-