Bind values to a parameterized/prepared statement
Source:R/15-dbBind.R
, R/25-dbBindArrow.R
dbBind.Rd
For parametrized or prepared statements,
the dbSendQuery()
, dbSendQueryArrow()
, and dbSendStatement()
functions
can be called with statements that contain placeholders for values.
The dbBind()
and dbBindArrow()
functions bind these placeholders
to actual values,
and are intended to be called on the result set
before calling dbFetch()
or dbFetchArrow()
.
The values are passed to dbBind()
as lists or data frames,
and to dbBindArrow()
as a stream
created by nanoarrow::as_nanoarrow_array_stream()
.
dbBindArrow()
is experimental, as are the other *Arrow
functions.
dbSendQuery()
is compatible with dbBindArrow()
, and dbSendQueryArrow()
is compatible with dbBind()
.
Arguments
- res
An object inheriting from DBIResult.
- params
For
dbBind()
, a list of values, named or unnamed, or a data frame, with one element/column per query parameter. FordbBindArrow()
, values as a nanoarrow stream, with one column per query parameter.- ...
Other arguments passed on to methods.
Value
dbBind()
returns the result set,
invisibly,
for queries issued by dbSendQuery()
or dbSendQueryArrow()
and
also for data manipulation statements issued by
dbSendStatement()
.
Details
DBI supports parametrized (or prepared) queries and statements
via the dbBind()
and dbBindArrow()
generics.
Parametrized queries are different from normal queries
in that they allow an arbitrary number of placeholders,
which are later substituted by actual values.
Parametrized queries (and statements) serve two purposes:
The same query can be executed more than once with different values. The DBMS may cache intermediate information for the query, such as the execution plan, and execute it faster.
Separation of query syntax and parameters protects against SQL injection.
The placeholder format is currently not specified by DBI;
in the future, a uniform placeholder syntax may be supported.
Consult the backend documentation for the supported formats.
For automated testing, backend authors specify the placeholder syntax with
the placeholder_pattern
tweak.
Known examples are:
?
(positional matching in order of appearance) in RMariaDB and RSQLite$1
(positional matching by index) in RPostgres and RSQLite:name
and$name
(named matching) in RSQLite
The data retrieval flow
This section gives a complete overview over the flow for the execution of queries that return tabular data as data frames.
Most of this flow, except repeated calling of dbBind()
or dbBindArrow()
,
is implemented by dbGetQuery()
, which should be sufficient
unless you want to access the results in a paged way
or you have a parameterized query that you want to reuse.
This flow requires an active connection established by dbConnect()
.
See also vignette("dbi-advanced")
for a walkthrough.
Use
dbSendQuery()
to create a result set object of class DBIResult.Optionally, bind query parameters with
dbBind()
ordbBindArrow()
. This is required only if the query contains placeholders such as?
or$1
, depending on the database backend.Optionally, use
dbColumnInfo()
to retrieve the structure of the result set without retrieving actual data.Use
dbFetch()
to get the entire result set, a page of results, or the remaining rows. Fetching zero rows is also possible to retrieve the structure of the result set as a data frame. This step can be called multiple times. Only forward paging is supported, you need to cache previous pages if you need to navigate backwards.Use
dbHasCompleted()
to tell when you're done. This method returnsTRUE
if no more rows are available for fetching.Repeat the last four steps as necessary.
Use
dbClearResult()
to clean up the result set object. This step is mandatory even if no rows have been fetched or if an error has occurred during the processing. It is good practice to useon.exit()
orwithr::defer()
to ensure that this step is always executed.
The data retrieval flow for Arrow streams
This section gives a complete overview over the flow for the execution of queries that return tabular data as an Arrow stream.
Most of this flow, except repeated calling of dbBindArrow()
or dbBind()
,
is implemented by dbGetQueryArrow()
,
which should be sufficient
unless you have a parameterized query that you want to reuse.
This flow requires an active connection established by dbConnect()
.
See also vignette("dbi-advanced")
for a walkthrough.
Use
dbSendQueryArrow()
to create a result set object of class DBIResultArrow.Optionally, bind query parameters with
dbBindArrow()
ordbBind()
. This is required only if the query contains placeholders such as?
or$1
, depending on the database backend.Use
dbFetchArrow()
to get a data stream.Repeat the last two steps as necessary.
Use
dbClearResult()
to clean up the result set object. This step is mandatory even if no rows have been fetched or if an error has occurred during the processing. It is good practice to useon.exit()
orwithr::defer()
to ensure that this step is always executed.
The command execution flow
This section gives a complete overview over the flow
for the execution of SQL statements that have side effects
such as stored procedures, inserting or deleting data,
or setting database or connection options.
Most of this flow, except repeated calling of dbBindArrow()
,
is implemented by dbExecute()
, which should be sufficient
for non-parameterized queries.
This flow requires an active connection established by dbConnect()
.
See also vignette("dbi-advanced")
for a walkthrough.
Use
dbSendStatement()
to create a result set object of class DBIResult. For some queries you need to passimmediate = TRUE
.Optionally, bind query parameters with
dbBind()
ordbBindArrow()
. This is required only if the query contains placeholders such as?
or$1
, depending on the database backend.Optionally, use
dbGetRowsAffected()
to retrieve the number of rows affected by the query.Repeat the last two steps as necessary.
Use
dbClearResult()
to clean up the result set object. This step is mandatory even if no rows have been fetched or if an error has occurred during the processing. It is good practice to useon.exit()
orwithr::defer()
to ensure that this step is always executed.
Failure modes
Calling dbBind()
for a query without parameters
raises an error.
Binding too many
or not enough values,
or parameters with wrong names
or unequal length,
also raises an error.
If the placeholders in the query are named,
all parameter values must have names
(which must not be empty
or NA
),
and vice versa,
otherwise an error is raised.
The behavior for mixing placeholders of different types
(in particular mixing positional and named placeholders)
is not specified.
Calling dbBind()
on a result set already cleared by dbClearResult()
also raises an error.
Specification
DBI clients execute parametrized statements as follows:
Call
dbSendQuery()
,dbSendQueryArrow()
ordbSendStatement()
with a query or statement that contains placeholders, store the returned DBIResult object in a variable. Mixing placeholders (in particular, named and unnamed ones) is not recommended. It is good practice to register a call todbClearResult()
viaon.exit()
right after callingdbSendQuery()
ordbSendStatement()
(see the last enumeration item). UntildbBind()
ordbBindArrow()
have been called, the returned result set object has the following behavior:dbFetch()
raises an error (fordbSendQuery()
anddbSendQueryArrow()
)dbGetRowCount()
returns zero (fordbSendQuery()
anddbSendQueryArrow()
)dbGetRowsAffected()
returns an integerNA
(fordbSendStatement()
)dbIsValid()
returnsTRUE
dbHasCompleted()
returnsFALSE
Call
dbBind()
ordbBindArrow()
:For
dbBind()
, theparams
argument must be a list where all elements have the same lengths and contain values supported by the backend. A data.frame is internally stored as such a list.For
dbBindArrow()
, theparams
argument must be a nanoarrow array stream, with one column per query parameter.
Retrieve the data or the number of affected rows from the
DBIResult
object.For queries issued by
dbSendQuery()
ordbSendQueryArrow()
, calldbFetch()
.For statements issued by
dbSendStatements()
, calldbGetRowsAffected()
. (Execution begins immediately after thedbBind()
call, the statement is processed entirely before the function returns.)
Repeat 2. and 3. as necessary.
Close the result set via
dbClearResult()
.
The elements of the params
argument do not need to be scalars,
vectors of arbitrary length
(including length 0)
are supported.
For queries, calling dbFetch()
binding such parameters returns
concatenated results, equivalent to binding and fetching for each set
of values and connecting via rbind()
.
For data manipulation statements, dbGetRowsAffected()
returns the
total number of rows affected if binding non-scalar parameters.
dbBind()
also accepts repeated calls on the same result set
for both queries
and data manipulation statements,
even if no results are fetched between calls to dbBind()
,
for both queries
and data manipulation statements.
If the placeholders in the query are named,
their order in the params
argument is not important.
At least the following data types are accepted on input (including NA):
logical for Boolean values
character (also with special characters such as spaces, newlines, quotes, and backslashes)
factor (bound as character, with warning)
Date (also when stored internally as integer)
POSIXct timestamps
POSIXlt timestamps
difftime values (also with units other than seconds and with the value stored as integer)
lists of raw for blobs (with
NULL
entries for SQL NULL values)objects of type blob::blob
See also
Other DBIResult generics:
DBIResult-class
,
dbClearResult()
,
dbColumnInfo()
,
dbFetch()
,
dbGetInfo()
,
dbGetRowCount()
,
dbGetRowsAffected()
,
dbGetStatement()
,
dbHasCompleted()
,
dbIsReadOnly()
,
dbIsValid()
,
dbQuoteLiteral()
,
dbQuoteString()
Other DBIResultArrow generics:
DBIResultArrow-class
,
dbClearResult()
,
dbFetchArrow()
,
dbFetchArrowChunk()
,
dbHasCompleted()
,
dbIsValid()
Other data retrieval generics:
dbClearResult()
,
dbFetch()
,
dbFetchArrow()
,
dbFetchArrowChunk()
,
dbGetQuery()
,
dbGetQueryArrow()
,
dbHasCompleted()
,
dbSendQuery()
,
dbSendQueryArrow()
Other command execution generics:
dbClearResult()
,
dbExecute()
,
dbGetRowsAffected()
,
dbSendStatement()
Examples
# Data frame flow:
con <- dbConnect(RSQLite::SQLite(), ":memory:")
dbWriteTable(con, "iris", iris)
# Using the same query for different values
iris_result <- dbSendQuery(con, "SELECT * FROM iris WHERE [Petal.Width] > ?")
dbBind(iris_result, list(2.3))
dbFetch(iris_result)
#> Sepal.Length Sepal.Width Petal.Length Petal.Width Species
#> 1 6.3 3.3 6.0 2.5 virginica
#> 2 7.2 3.6 6.1 2.5 virginica
#> 3 5.8 2.8 5.1 2.4 virginica
#> 4 6.3 3.4 5.6 2.4 virginica
#> 5 6.7 3.1 5.6 2.4 virginica
#> 6 6.7 3.3 5.7 2.5 virginica
dbBind(iris_result, list(3))
dbFetch(iris_result)
#> [1] Sepal.Length Sepal.Width Petal.Length Petal.Width Species
#> <0 rows> (or 0-length row.names)
dbClearResult(iris_result)
# Executing the same statement with different values at once
iris_result <- dbSendStatement(con, "DELETE FROM iris WHERE [Species] = $species")
dbBind(iris_result, list(species = c("setosa", "versicolor", "unknown")))
dbGetRowsAffected(iris_result)
#> [1] 100
dbClearResult(iris_result)
nrow(dbReadTable(con, "iris"))
#> [1] 50
dbDisconnect(con)
# Arrow flow:
con <- dbConnect(RSQLite::SQLite(), ":memory:")
dbWriteTable(con, "iris", iris)
# Using the same query for different values
iris_result <- dbSendQueryArrow(con, "SELECT * FROM iris WHERE [Petal.Width] > ?")
dbBindArrow(
iris_result,
nanoarrow::as_nanoarrow_array_stream(data.frame(2.3, fix.empty.names = FALSE))
)
as.data.frame(dbFetchArrow(iris_result))
#> Sepal.Length Sepal.Width Petal.Length Petal.Width Species
#> 1 6.3 3.3 6.0 2.5 virginica
#> 2 7.2 3.6 6.1 2.5 virginica
#> 3 5.8 2.8 5.1 2.4 virginica
#> 4 6.3 3.4 5.6 2.4 virginica
#> 5 6.7 3.1 5.6 2.4 virginica
#> 6 6.7 3.3 5.7 2.5 virginica
dbBindArrow(
iris_result,
nanoarrow::as_nanoarrow_array_stream(data.frame(3, fix.empty.names = FALSE))
)
as.data.frame(dbFetchArrow(iris_result))
#> [1] Sepal.Length Sepal.Width Petal.Length Petal.Width Species
#> <0 rows> (or 0-length row.names)
dbClearResult(iris_result)
# Executing the same statement with different values at once
iris_result <- dbSendStatement(con, "DELETE FROM iris WHERE [Species] = $species")
dbBindArrow(iris_result, nanoarrow::as_nanoarrow_array_stream(data.frame(
species = c("setosa", "versicolor", "unknown")
)))
dbGetRowsAffected(iris_result)
#> [1] 100
dbClearResult(iris_result)
nrow(dbReadTable(con, "iris"))
#> [1] 50
dbDisconnect(con)