special-symbols {data.table} | R Documentation |

`.SD`

, `.BY`

, `.N`

, `.I`

, `.GRP`

, and `.NGRP`

are *read-only* symbols for use in `j`

. `.N`

can be used in `i`

as well. `.I`

can be used in `by`

as well. See the vignettes, Details and Examples here and in `data.table`

.
`.EACHI`

is a symbol passed to `by`

; i.e. `by=.EACHI`

.

The bindings of these variables are locked and attempting to assign to them will generate an error. If you wish to manipulate `.SD`

before returning it, take a `copy(.SD)`

first (see FAQ 4.5). Using `:=`

in the `j`

of `.SD`

is reserved for future use as a (tortuously) flexible way to update `DT`

by reference by group (even when groups are not contiguous in an ad hoc by).

These symbols used in `j`

are defined as follows.

`.SD`

is a`data.table`

containing the**S**ubset of`x`

's**D**ata for each group, excluding any columns used in`by`

(or`keyby`

).`.BY`

is a`list`

containing a length 1 vector for each item in`by`

. This can be useful when`by`

is not known in advance. The`by`

variables are also available to`j`

directly by name; useful for example for titles of graphs if`j`

is a plot command, or to branch with`if()`

depending on the value of a group variable.`.N`

is an integer, length 1, containing the number of rows in the group. This may be useful when the column names are not known in advance and for convenience generally. When grouping by`i`

,`.N`

is the number of rows in`x`

matched to, for each row of`i`

, regardless of whether`nomatch`

is`NA`

or`NULL`

. It is renamed to`N`

(no dot) in the result (otherwise a column called`".N"`

could conflict with the`.N`

variable, see FAQ 4.6 for more details and example), unless it is explicitly named; e.g.,`DT[,list(total=.N),by=a]`

.`.I`

is an integer vector equal to`seq_len(nrow(x))`

. While grouping, it holds for each item in the group, its row location in`x`

. This is useful to subset in`j`

; e.g.`DT[, .I[which.max(somecol)], by=grp]`

. If used in`by`

it corresponds to applying a function rowwise.`.GRP`

is an integer, length 1, containing a simple group counter. 1 for the 1st group, 2 for the 2nd, etc.`.NGRP`

is an integer, length 1, containing the number of groups.

`.EACHI`

is defined as `NULL`

but its value is not used. Its usage is `by=.EACHI`

(or `keyby=.EACHI`

) which invokes grouping-by-each-row-of-i; see `data.table`

's `by`

argument for more details.

Note that `.N`

in `i`

is computed up-front, while that in `j`

applies *after filtering in i*. That means that even absent grouping,

`.N`

in `i`

can be different from `.N`

in `j`

. See Examples.
`data.table`

, `:=`

, `set`

, `datatable-optimize`

```
DT = data.table(x=rep(c("b","a","c"),each=3), v=c(1,1,1,2,2,1,1,2,2), y=c(1,3,6), a=1:9, b=9:1)
DT
X = data.table(x=c("c","b"), v=8:7, foo=c(4,2))
X
DT[.N] # last row, only special symbol allowed in 'i'
DT[, .N] # total number of rows in DT
DT[, .N, by=x] # number of rows in each group
DT[, .SD, .SDcols=x:y] # select columns 'x' through 'y'
DT[, .SD[1]] # first row of all columns
DT[, .SD[1], by=x] # first row of all columns for each group in 'x'
DT[, c(.N, lapply(.SD, sum)), by=x] # get rows *and* sum all columns by group
DT[, .I[1], by=x] # row number in DT corresponding to each group
DT[, .N, by=rleid(v)] # get count of consecutive runs of 'v'
DT[, c(.(y=max(y)), lapply(.SD, min)),
by=rleid(v), .SDcols=v:b] # compute 'j' for each consecutive runs of 'v'
DT[, grp := .GRP, by=x] # add a group counter
DT[, grp_pct := .GRP/.NGRP, by=x] # add a group "progress" counter
X[, DT[.BY, y, on="x"], by=x] # join within each group
# .N can be different in i and j
DT[{cat(sprintf('in i, .N is %d\n', .N)); a < .N/2},
{cat(sprintf('in j, .N is %d\n', .N)); mean(a)}]
# .I can be different in j and by, enabling rowwise operations in by
DT[, .(.I, min(.SD[,-1]))]
DT[, .(min(.SD[,-1])), by=.I]
```

[Package *data.table* version 1.14.3 Index]