![]() |
![]() |
![]() |
![]() |
char * | null-fallback | Read / Write |
GADatasetSegmentEncoding | segment-encoding | Read / Write |
gpointer | partitioning | Write / Construct Only |
gboolean | infer-dictionary | Read / Write |
GArrowSchema * | schema | Read / Write |
GADatasetSegmentEncoding | segment-encoding | Read / Write |
GEnum ╰── GADatasetSegmentEncoding GObject ├── GADatasetKeyValuePartitioningOptions │ ├── GADatasetHivePartitioningOptions │ ╰── GADatasetHivePartitioningOptions ├── GADatasetPartitioning │ ├── GADatasetKeyValuePartitioning │ ╰── GADatasetKeyValuePartitioning │ ├── GADatasetDirectoryPartitioning │ ├── GADatasetHivePartitioning │ ├── GADatasetDirectoryPartitioning │ ╰── GADatasetHivePartitioning ╰── GADatasetPartitioningFactoryOptions
GADatasetPartitioningFactoryOptions is a class for partitioning factory options.
GADatasetPartitioning is a base class for partitioning classes such as GADatasetDirectoryPartitioning.
GADatasetKeyValuePartitioningOptions is a class for key-value partitioning options.
GADatasetKeyValuePartitioning is a base class for key-value style partitioning classes such as GADatasetDirectoryPartitioning.
GADatasetDirectoryPartitioning is a class for partitioning that uses directory structure.
GADatasetHivePartitioningOptions is a class for Hive-style partitioning options.
GADatasetHivePartitioning is a class for partitioning that uses Hive-style partitioning.
GADatasetPartitioningFactoryOptions *
gadataset_partitioning_factory_options_new
(void
);
Since: 11.0.0
gchar *
gadataset_partitioning_get_type_name (GADatasetPartitioning *partitioning
);
Since: 6.0.0
GADatasetPartitioning *
gadataset_partitioning_create_default (void
);
Since: 12.0.0
GADatasetKeyValuePartitioningOptions *
gadataset_key_value_partitioning_options_new
(void
);
Since: 11.0.0
GADatasetDirectoryPartitioning * gadataset_directory_partitioning_new (GArrowSchema *schema
,GList *dictionaries
,GADatasetKeyValuePartitioningOptions *options
,GError **error
);
schema |
A GArrowSchema that describes all partitioned segments. |
|
dictionaries |
A list of GArrowArray
for dictionary data types in |
[nullable][element-type GArrowArray] |
options |
[nullable] | |
error |
[nullable] |
Since: 6.0.0
GADatasetHivePartitioningOptions *
gadataset_hive_partitioning_options_new
(void
);
Since: 11.0.0
GADatasetHivePartitioning * gadataset_hive_partitioning_new (GArrowSchema *schema
,GList *dictionaries
,GADatasetHivePartitioningOptions *options
,GError **error
);
schema |
A GArrowSchema that describes all partitioned segments. |
|
dictionaries |
A list of GArrowArray
for dictionary data types in |
[nullable][element-type GArrowArray] |
options |
[nullable] | |
error |
[nullable] |
Since: 11.0.0
gchar *
gadataset_hive_partitioning_get_null_fallback
(GADatasetHivePartitioning *partitioning
);
Since: 11.0.0
They are corresponding to arrow::dataset::SegmentEncoding
values.
Since: 6.0.0
struct GADatasetPartitioningFactoryOptionsClass { GObjectClass parent_class; };
struct GADatasetKeyValuePartitioningOptionsClass { GObjectClass parent_class; };
struct GADatasetKeyValuePartitioningClass { GADatasetPartitioningClass parent_class; };
struct GADatasetDirectoryPartitioningClass { GADatasetKeyValuePartitioningClass parent_class; };
struct GADatasetHivePartitioningOptionsClass { GADatasetKeyValuePartitioningOptionsClass parent_class; };
struct GADatasetHivePartitioningClass { GADatasetKeyValuePartitioningClass parent_class; };
typedef struct _GADatasetDirectoryPartitioning GADatasetDirectoryPartitioning;
typedef struct _GADatasetHivePartitioningOptions GADatasetHivePartitioningOptions;
typedef struct _GADatasetKeyValuePartitioning GADatasetKeyValuePartitioning;
“null-fallback”
property“null-fallback” char *
The fallback string for null. This is used only by GADatasetHivePartitioning.
Owner: GADatasetHivePartitioningOptions
Flags: Read / Write
Default value: "__HIVE_DEFAULT_PARTITION__"
Since: 11.0.0
“segment-encoding”
property“segment-encoding” GADatasetSegmentEncoding
After splitting a path into components, decode the path components before parsing according to this scheme.
Owner: GADatasetKeyValuePartitioningOptions
Flags: Read / Write
Default value: GADATASET_SEGMENT_ENCODING_URI
Since: 11.0.0
“partitioning”
property“partitioning” gpointer
The raw std::shared<arrow::dataset::Partitioning> *.
Owner: GADatasetPartitioning
Flags: Write / Construct Only
“infer-dictionary”
property“infer-dictionary” gboolean
When inferring a schema for partition fields, yield dictionary encoded types instead of plain. This can be more efficient when materializing virtual columns, and Expressions parsed by the finished Partitioning will include dictionaries of all unique inspected values for each field.
Owner: GADatasetPartitioningFactoryOptions
Flags: Read / Write
Default value: FALSE
Since: 11.0.0
“schema”
property“schema” GArrowSchema *
Optionally, an expected schema can be provided, in which case inference will only check discovered fields against the schema and update internal state (such as dictionaries).
Owner: GADatasetPartitioningFactoryOptions
Flags: Read / Write
Since: 11.0.0
“segment-encoding”
property“segment-encoding” GADatasetSegmentEncoding
After splitting a path into components, decode the path components before parsing according to this scheme.
Owner: GADatasetPartitioningFactoryOptions
Flags: Read / Write
Default value: GADATASET_SEGMENT_ENCODING_URI
Since: 11.0.0