tidytable 0.11.2
Bug fixes
- Empty
dt() works with no warning, #824
tidytable 0.11.1
Functionality improvements
pmap() now preserves names (#809)
- Package developers no longer need to define
.datatable.aware = TRUE when using tidytable
in their package (#269)
Bug fixes
- Attempting to rename columns using
group_by() now leads
to an error (#799)
pmap() family works with data frame inputs (#803)
filter() properly handles when comparing to
NA when .by is used (#812)
paged.print has been removed since it was breaking
console printing (#810)
tidytable 0.11.0
Functionality improvements
distinct() now works on data frames with list columns
(#773)
pivot_wider(): Gains unused_fn argument
(#698)
Deprecations
verb.() functions have been removed
Bug fixes
count() has a more helpful error message when
pick() is used (#778)
unnest(keep_empty = TRUE) preserves vectors of length 0
(#783)
Functions with
notable speed improvements
tidytable 0.10.2
- Patch release to pass CRAN checks for future R release
tidytable 0.10.1
New functions
Bug fixes
case_when(): .default is used when
calculating a common ptype (#751)
pivot_wider(): Works correctly with dates (#759)
Functions with
notable speed improvements
case_match(): When comparing input vector to a length 1
literal
tidytable 0.10.0
Deprecations
verb.() functions are now deprecated
(e.g. mutate.()). Users should now use the
verb() versions of functions.
New functions
Functionality improvements
nest(): Gains .by and .key
args
tidytable(): Auto-names unnamed inputs
slice_*() family: by arg added to match
dplyr semantics. .by can still be called by
the user instead of by.
dt(): Can use let() to add columns even on
older versions of data.table
Bug fixes
expand_grid(): Can use “unique” or “sorted” as column
names (#718)
Breaking changes
slice_head()/_tail(): .by arg
must be explicitly named when slicing by group
Functions with
notable speed improvements
mutate(): When overwriting existing columns on data
frames with a high number of columns (>1000)
tidytable 0.9.2
New functions
cross_join()
group_cols()
map_vec()
pick()
separate_longer_delim()
separate_wider_delim()
separate_wider_regex()
Functionality improvements
separate(): Can now handle when too many or too few new
names are specified in into arg (#666)
unnest_longer(): Gains keep_empty arg
Bug fixes
separate(): Can overwrite separated column without
removal (#680)
na_if(): Properly replaces NAs when
y is a vector (#689)
across(): Anonymous functions are properly translated
(#699)
pivot_wider(): names_sort = FALSE works
(#705)
Deprecations
- Using
by = character() in joins is now deprecated.
Users should instead use cross_join().
tidytable 0.9.1
New functions
nest_join()
- Ranking functions:
min_rank()
dense_rank()
percent_rank()
cume_dist()
Functionality improvements
%in% falls back to base::'%in%' when input
types aren’t compatible with vec_in() (@krterberg, #632)
relocate(): Can rename columns that are moved
- Joins: Can now do cross joins by specifying
by = character()
group_by(): Gains .add argument
ungroup(): Gains ... arguments
- Printing of grouped tidytables now shows grouping variables
Bug fixes
row_number(): Works correctly on 0-row data frame when
overwriting existing column (#639)
slice_head()/slice_tail(): Properly slice
on 0-row data frame (#642)
Functions with
notable speed improvements
tidytable 0.9.0
Dotless functions!
tidytable now exports dotless versions of all functions
(e.g. arrange()/mutate()/etc.).
verb.() syntax is still available to users for
backwards compatibility.
- Users can use both
tidytable and dplyr by
simply loading dplyr after tidytable,
as the verb.() functions won’t be overwritten by
dplyr.
New functions
dplyr-style interface to grouping
group_by()/ungroup()
group_vars()
is_grouped_df()
rowwise()
add_tally()/tally()
case_match()
Functionality improvements
summarize(): Gains .unpack argument
pivot_longer(): Can pass a single ptype or function to
values_ptypes/values_transform/names_ptypes/names_transform
args.
unnest_longer()/unnest_wider(): Can pass a
single ptype or function to ptype/transform
args.
Bug fixes
tidytable::'%in%' dispatches to
base::'%in%' when comparing with a list (#563)
pivot_wider(): Works with column names with spaces
(#569)
pivot_wider():
names_glue="{.value}_{somecolumn}" assigns column names in
correct order (@Darxor, #579)
left_join(): Works when y matching columns in
by is a non matching column of x (#625)
tidytable 0.8.1
New Functions
consecutive_id.()
if_else.()
- Note: Alternate syntax
ifelse.() (introduced in v0.4.0)
will remain in the package
%in%
Functionality improvements
arrange.(): Can use .env inside arrange
expressions
case_when.(): Gains
.default/.ptype/.size args
coalesce.(): Gains
.ptype/.size args
relocate.(): Now properly handles multiple columns
selected in .before or .after
slice_min.()/slice_max.(): Gain
with_ties argument
Bug Fixes
- Nested calls to
across.() are handled properly
(#505)
across.(): Can namespace functions in .fns
arg (#511)
as_tidytable(): Can keep row names when converting a
matrix (#527)
unnest.(): Handles empty data frames (@roboton, #530)
nth.(): Extracts list elements (#534)
arrange.(): Properly sorts NAs (#541)
Deprecations
arrange_across./mutate_across./summarize_across.
are now defunct. They have been deprecated with warnings since v0.6.4
(Jul 2021). Users must now use across.() inside
arrange.()/mutate.()/summarize.().
tidytable 0.8.0
New functions
Functionality improvements
expand_grid.(): Works with data frame inputs
first.()/last.()/nth.(): Gain
na_rm arg
mutate_rowwise.(): Gains .keep,
.before, and .after args
tidytable(): Auto-unpacks unnamed data frame
inputs
Breaking changes
count.(): Default name is now n instead of
N to match dplyr semantics
Bug fixes
bind_cols.(): Correctly handles lists (#446)
Functions with
notable speed improvements
arrange.()
case_when.(): Faster when conditions evaluate to
NA
group_split.()
left_join.(): Faster when keep = FALSE
(the default)
select.()
uncount.()
Other notes
tidytable no longer directly depends on
lifecycle
tidytable 0.7.2
New functions
Functionality improvements
summarize.(): Now sorts by the grouping variables when
.by is used.
dt(): Experimental support for tidy evaluation
as_tidytable(): Now defaults to
.name_repair = "unique" to relax restrictions on creating
new tidytables
Functions with
notable speed improvements
dt(): Faster when adding new columns or updating
existing columns
as_tidytable(): Faster when converting
data.frame or list objects to a tidytable
get_dummies.()
Bug fixes
across.(): .cols arg can find environment
variables in custom functions (#389)
- Joins: Duplicate columns are treated properly when
keep = FALSE (#397)
- Can use anonymous functions inside
map functions inside
mutate.() (#402)
tidytable 0.7.1
New functions
add_count.()
cur_column.()
cur_data.()
Bug fixes
unite.(): New column is always placed before the first
united column in the data frame, regardless of order provided.
- Can use
.data pronoun inside desc() in
arrange.() (#371)
tidytable 0.7.0
Functionality improvements
left/right/inner/full
joins gain suffix and keep args (#354)
rename.(): Can now rename columns by position
(#361)
unite.(): The new column is placed before united
columns to match tidyr behavior
New functions
tidytable 0.6.7
Functionality improvements
.data and .env pronouns now work in
tidytable functions
across.(): Works inside of a named
mutate.(). Useful with
rowSums/rowMeans (#346)
first.()/last.(): Gain a
default arg
mutate.(): Can use stringr::str_glue()
without specifying .envir
replace_na.(): Checks that replace arg
only uses columns that exist in the data frame
New functions
Bug fixes
mutate.(): Can assign to the same column when
.by = character(0) (#332)
tidytable 0.6.6
Functionality improvements
between() is now auto-translated to
between.() when used inside tidytable
functions
%notin% now uses %chin% on character
vectors
Bug fixes
across.(): Can pass extra arguments to a list of
functions (#319)
tidytable 0.6.5
New functions
Bug fixes
- Traceback error messages no longer unnecessarily print the full data
frame (#305)
tidytable 0.6.4
New functions
unnest_longer.()
unnest_wider.()
Functionality improvements
bind_rows.()/bind_cols.()
- Can splice lists of data frames using
!!!
mutate.(): Can use glue::glue() without
specifying .envir
pull.(): Added a name argument
separate.(): Can omit output columns by using
NA in into
tidytable(): Can splice quosures
unnest.(): Added keep_empty arg
Deprecations
- The following functions have all been deprecated due to the addition
of
across.() in v0.6.1
arrange_across.()
mutate_across.()
summarize_across.()
tidytable 0.6.3
New functions
enframe.()
first.()
last.()
Functionality improvements
- Subclasses and attributes are now preserved when using
bind_cols, bind_rows,
group_split, or joins.
complete.(): Added a .by argument
expand.(): Added a .by argument
mutate.(): Auto-names unnamed inputs
transmute.(): Auto-names unnamed inputs
Bug fixes
expand.() with nesting.()
- Additional double vectors can have negative values (#282)
tidytable 0.6.2
New functions
cur_group_id.()/cur_group_rows.()
n_distinct.()
nesting.()
Functionality improvements
mutate.():
- Added
.keep argument
- Added
.before/.after arguments
tidytable(): Added .name_repair
argument
Functions with
notable speed improvements
filter.() when .by is used
tidytable 0.6.1
New functions
Functionality improvements
group_split.(): Added a .named argument.
If .named = TRUE, the resulting list will have names that
indicate the group they are a part of.
pivot_longer.(): Can now use
names_to = c(".value", NA) to drop the id column.
Bug fixes
if_all.()/if_any.()
- No longer try to use
.by columns (#225)
- Can use
n() in filtering (#226)
pivot_longer.()
- No longer converts factor values to character (#202, #234)
- Preserves column order when using
names_to = c(".value", "id") (#235)
tidytable 0.6.0
New functions
Functionality improvements
replace_na.(): Can now replace NULL values
in list-columns
- Splicing lists using
!!! now works in
crossing.() and expand_grid.()
- The following tidyverse functions are now automatically converted to
their tidytable
verb.() equivalents when used inside of
tidytable’s mutate/arrange/filter/summarize/slice.
- Functions that previously failed but now work:
- Functions that previously worked with tidytable but are now
converted to their faster tidytable equivalents:
if_else()/ifelse()
case_when()
desc()
replace_na()
Bug fixes
pivot_longer.(): Correctly pivots unbalanced datasets
when ".value" is used in names_to (@moutikabdessabour, #212)
Functions with
notable speed improvements
mutate_across.()
mutate_rowwise.() when c_across.() is
used
slice.()/slice_head.()/slice_tail.()/slice_min.()/slice_max.()
Other news
tidytable has dropped the tibble package
as a dependency and now uses pillar instead.
tibble was originally used for its trunc_mat()
function that enabled tidytables to print like tibbles do in console.
pillar is the same package that tibble uses in
the background.
tidytable 0.5.9
New functions
Functionality improvements
as_tidytable(): Gains .name_repair and
.keep_rownames args
count.(): Added wt, sort and
name args. (@moutikabdessabour, #196)
mutate.(): Variables are now updated in order when
using “.by” (#166)
pivot_longer.()
- Added
names_prefix arg
- Can now use
".value" in names_to
pivot_wider.(): Can now use ".value" in
names_glue
separate.(): Added convert arg
summarize.(): Variables are now created in order
(#188)
tidytable 0.5.8
Breaking changes
pivot_longer.(): values_drop_na is no
longer the 5th argument in the function
pivot_wider.(): values_fn &
values_fill args are no longer the 6th & 7th args
New functions
c_across.()
extract.()
mutate_rowwise.()
nest.()
slice_sample.()
Functionality improvements
pivot_longer.()
- New args:
names_sep, names_pattern,
names_ptypes, names_transform,
names_repair, values_ptypes,
values_transform, fast_pivot
fast_pivot arg: Faster pivoting. The “names” column
will be a factor instead of character. Default is set to
FALSE to match tidyverse semantics. Note: This option sets
variable.factor = TRUE in data.table::melt(),
which is what leads to it being faster.
pivot_wider.()
- New args:
names_prefix, names_glue,
names_sort, names_repair
tidytable 0.5.7
Deprecations
- The
dt_verb() versions of functions have been removed
from the package. They have been deprecated with warnings since
v0.5.2
Functionality improvements
crossing.(): Now works with data frame inputs (@moutikabdessabour, #143)
distinct.(): Can rename columns (#153)
get_dummies.(): Dummy columns are now sorted in
alphabetical order
pivot_wider.(): Added values_fill
argument
Functions with
notable speed improvements
get_dummies.()
fill.()
slice.()
Bug fixes
mutate.(): Can delete a column using NULL
when .by is provided (#151)
mutate_across.(): Can reference other columns in .fns
call (#145)
slice.(): Works when .by contains all
columns (#158)
unnest.(): Works when the only column is the list
column (#144)
tidytable 0.5.6
Breaking changes
bind_cols.(): Name repair uses
vec_as_names() instead of
vec_as_names_legacy()
Functionality improvements
bind_cols.(): Added .name_repair arg
unnest.(): Added names_sep and
names_repair args
print()
- Added
n, width, and n_extra
args
- Now prints like tibbles in all cases (special thanks to @moutikabdessabour)
slice.(): Can now drop specified rows with negative
numbers
New functions
arrange_across.()
case_when.()
- The old “case when” translation
case.() will remain in
the package, as it is called like data.table::fcase() but
allows for the default to be a vector.
desc.()
Functions with
notable speed improvements
Deprecations
unnest.(): .keep_all arg changed to
.drop
tidytable 0.5.5
Breaking changes
bind_rows.(): Removed .use_names and
.fill args
- These are now assumed to be
TRUE, which matches
dplyr semantics.
complete.(): .fill arg renamed to
fill
pull.(): defaults to var = -1 instead of
var = NULL
- The result is the same when using the default value, but will break
cases where
var = NULL was explicitly called.
slice.(): .by must be named when slicing
by group
Functionality improvements
tidytable now integrates better with tidyverse
functions through vctrs. This means code such as
vec_cbind(tibble::tibble(x = 1:3), tidytable(y = "foo"))
will bind the results into a tidytable.
complete.(): Can now append extra values
pull.(): Numerical selection now works
slice.(): Now uses ... to select which
rows to slice
New functions
Bug fixes
rename.(): Works when there are spaces in the column
names (#109)
Functions with
notable speed improvements
tidytable 0.5.4
Breaking changes
- Column names auto-generated by
summarize_across.() now
use a suffix instead of a prefix
Functionality improvements
mutate_across.(): Added .names arg to help
with naming newly created columns
summarize_across.(): Added .names arg to
help with naming newly created columns
summarize.(): Added .sort arg to
optionally sort the resulting data.table
Bug fixes
transmute.(): Doesn’t drop “by” columns (#98)
slice.()
- Allows gaps in
rows arg (#99)
- Doesn’t reorder columns when using
.by (#101)
full_join.(): No longer returns a keyed data.table
(#102)
Internal
- Import
tibble
- Enables cleaner console printing of tidytables
glimpse() is now reexported
- Will be replaced by
pillar in a later release once
tibble::trunc_mat() & tibble::glimpse()
are moved there
tidytable 0.5.3
Breaking changes
bind_rows.()
use.names arg renamed to .use_names
fill arg renamed to .fill
New functions
complete.()
crossing.()
expand.()
expand_grid.()
uncount.()
Functionality improvements
replace_na.(): Now works on
data.frame/data.table inputs as well as
vectors
Deprecations
desc.() has been deprecated due to incompatibility with
character columns
tidytable 0.5.2
- This version contains general performance improvements
Deprecations
- Group by arg
by has been deprecated for
.by. This will allow “by” to be used as a column name in
mutate.() & summarize.() in future
releases.
- All
dt_verb() functions are soft deprecated
New functions
Functionality improvements
unnest.(): Added .keep_all arg to keep all
list columns that were not unnested
tidytable 0.5.1
New functions
summarize_across.() &
summarise_across.()
Bug fixes:
- Fixed
distinct.() bug where it wasn’t returning unique
rows
tidytable 0.5.0
tidytable now works with quosures
tidyselect v0.1.1 compatibility: Updated functions to
use where()
Functionality improvements
nest_by.(): Added .keep arg
Deprecations
- Old select helpers like
everything.() are now replaced
by normal helpers like everything()
rename_across/_if/_at/_all have been superseded by
rename_with.()
Breaking changes
group_split.(): keep arg renamed to
.keep
New functions
Functions with
notable speed improvements
tidytable 0.4.1
tidytable now utilizes tidyselect &
vctrs
Breaking changes
get_dummies.(): cols arg defaults to
c(is.character, is.factor). The result is the same, but is
more consistent with other enhanced selection functions
mutate_across.(): .funs argument renamed
to .fns to match dplyr 1.0
New functions
Functionality improvements
- Paged printing now works in Rmarkdown
bind_rows.(): Does name checking and fills missing
distinct.(): Added .keep_all argument
fill.(): Preserves column order when using “by”
group_split.(): Added keep arg
nest_by.()
- Can now unnest multiple columns in one call
- data.tables in a list column can now have different ncols and
different column order
select.(): Now allows column renaming
transmute.(): Summary functions
(mean/max/etc.) can now be used
Functions with
notable speed improvements
distinct.()
drop_na.()
relocate.()
Bug fixes
group_split.(): Now returns a list of tidytables
instead of data.tables
left_join.(): Now works with “by” columns of different
names
tidytable 0.4.0
New syntax
- All functions are now written as
verb.()
dt_verb() functions still work, but may be slowly
deprecated in future releases
Functionality improvements
- Enhanced selection now works in “by” calls
filter.() now works with “by”
pivot_wider.() can now be used with an aggregation
function
pull.() defaults to last column in a data.table
New functions
ifelse.()
n.()
nest_by.() replaces dt_group_nest()
row_number.()
transmute.()
unnest.() replaces dt_unnest_legacy()
- Select helper
any_of.()
Functions with
notable speed improvements
fill.()
pivot_longer.()
unnest.()
tidytable 0.3.2
tidytable class
- New
tidytable class is a data.table subclass with
cleaner printing.
- Operates like a data.table in all other cases.
- All tidytable functions automatically convert to
tidytable class in the background.
New functions
- tidytable constructors:
tidytable(): Create a data.table/tidytable
as_tidytable(): Convert to data.table/tidytable
is_tidytable(): Test if an object is a
data.table/tidytable
dt_get_dummies(): Get dummy variables
dt_separate()
dt_pivot_longer() “cols” arg now defaults to
dt_everything() instead of NULL. The result is
the same, but is more consistent with other enhanced selection
functions
.N now works when using dt_slice() &
variants
- Function length limit bug no longer occurs when using
~
in dt_map()
tidytable 0.3.1
- Updated
dt() docs per CRAN instructions
Breaking changes
dt_mutate() & dt_rename() no longer
modify-by-reference
dt_unnest_legacy() drops “keep” argument, but
automatically keeps all non-nested columns.
- New translation is ~5-10x faster than before
Functionality improvements
dt() now automatically converts data.frame input to a
data.table
dt_mutate_across(): Added “by” arg
Bug fixes
dt_left_join() fix so columns are ordered
correctly
tidytable 0.3.0