tidyfst包 - 读写fst文件 - 《R语言》

描述

read_fst和write_fst的包装器，但使用不同的默认值。对于数据导入，始终返回data.table。

对于数据导出，总是将数据压缩到最小大小。

Usage

export_fst(x, path, compress = 100, uniform_encoding = TRUE)

import_fst( path, columns = NULL, from = 1, to = NULL, as.data.table = TRUE, old_format = FALSE )

Arguments

x	a data frame to write to disk
path	path to fst file
compress	值在0到100之间，表示要使用的压缩量。值越低，文件越大。默认压缩值设置为50。
uniform_encoding	如果“TRUE”，则假设所有字符向量具有相同编码的元素。第一个非na元素的编码(latin1、UTF8或native)将用作整个列的编码。对于大多数用例来说，这是一个正确的假设。如果’uniform.encoding’ 设为’ FALSE ‘，则不做这样的假设，所有元素将转换为相同的编码。后者是一个相对昂贵的操作，会降低字符列的写性能。
columns	Column names to read. The default is to read all columns.
from	Read data starting from this row number.
to	Read data up until this row number. The default is to read to the last row of the stored dataset.
as.data.table	如果为真，结果将返回data.table对象。写入之前在数据集x上设置的任何键都将保留。这允许存储排序数据集。此选项需要安装data.table。
old_format	必须是FALSE，旧的fst文件格式是不赞成的，只能读取和转换fst包版本0.8.0到0.8.10。

## Not run:
export_fst(iris,"iris_fst_test.fst")
iris_dt = import_fst("iris_fst_test.fst")
iris_dt
unlink("iris_fst_test.fst")
## End(Not run)