Return type notation (RTN) in bounds and where-clauses
- Feature Name:
return_type_notation
- Start Date: 2024-06-04
- RFC PR: rust-lang/rfcs#3654
- Tracking Issue: rust-lang/rust#109417
Summary
Return type notation (RTN) gives a way to reference or bound the type returned by a trait method. The new bounds look like T: Trait<method(..): Send>
or T::method(..): Send
. The primary use case is to add bounds such as Send
to the futures returned by async fn
s in traits and -> impl Future
functions, but they work for any trait function defined with return-position impl trait (e.g., where T: Factory<widgets(..): DoubleEndedIterator>
would also be valid).
This RFC proposes a new kind of type written <T as Trait>::method(..)
(or T::method(..)
for short). RTN refers to “the type returned by invoking method
on T
”.
To keep this RFC focused, it only covers usage of RTN as the Self
type of a bound or where-clause. The expectation is that, after accepting this RFC, we will gradually expand RTN usage to other places as covered under Future Possibilities. As a notable example, supporting RTN in struct field types would allow constructing types that store the results of a call to a trait -> impl Trait
method, making them more suitable for use in public APIs.
Examples of RTN usage allowed by this RFC include:
where <T as Trait>::method(..): Send
- (the base syntax)
where T: Trait<method(..): Send>
- (sugar for the base syntax with the (recently stabilized) associated type bounds)
where T::method(..): Send
- (sugar where
Trait
is inferred from the compiler)
- (sugar where
dyn Trait<method(..): Send>
- (
dyn
types take lists of bounds)
- (
impl Trait<method(..): Send>
- (…as do
impl
types)
- (…as do
Motivation
Rust now supports async fns and -> impl Trait
in traits (acronymized as AFIT and RPITIT, respectively), but we currently lack the ability for users to declare additional bounds on the values returned by such functions. This is often referred to as the Send bound problem, because the most acute manifestation is the inability to require that an async fn
returns a Send
future, but it is actually more general than both async fns and the Send
trait (as discussed below).
The send bound problem blocks an interoperable async ecosystem
To create an interoperable async ecosystem, we need the ability to write a single trait definition that can be used across all styles of async executors (workstealing, thread-per-core, single-threaded, embedded, etc). One example of such a trait is the Service
trait found in the tower
crate, which defines a generic “service” that can process a Request
and yield some Response
. The current Service
trait is defined with a custom poll
method and explicit usage of Pin
, but the goal is to be able to define Service
like so:
trait Service<Request> {
type Response;
// Invoke the service.
async fn call(&self, req: Request) -> Self::Response;
}
This Service
trait can then be used to define generic middleware that operate over any service. For example, we could write a LogService
that wraps any service and emit logs to stderr:
pub struct LogService<S>(S);
impl<S, R> Service<R> for LogService<S>
where
S: Service<R>,
R: Debug,
{
type Response = S::Response;
async fn call(&self, request: R) -> S::Response {
eprintln!("{request:?}");
self.0.call(request).await
}
}
This definition today works only in some executors
Defining Service
as shown above works fine in a thread-per-core or single-threaded executor, where spawned tasks do not move between threads. But it can encounter compilation errors with a work-stealing executor, such as the default Tokio executor, where all spawned futures must be Send
. Consider this example:
async fn spawn_call<S>(service: S) -> S::Response
where
S: Service<(), Response: Send> + Send + 'static,
{
tokio::spawn(async move {
service.call(()).await // <--- Error
}).await
}
This code will not compile because the future returned by invoking S::call(..)
is not known to be Send
:
error: future cannot be sent between threads safely
--> src/lib.rs:6:5
|
6 | / tokio::spawn(async move {
7 | | service.call(()).await // <--- Error
8 | | }).await.unwrap()
| |______^ future created by async block is not `Send`
|
= help: within `{async block@src/lib.rs:6:18: 8:6}`, the trait `Send` is not implemented for `impl Future<Output = <S as Service<()>>::Response>`, which is required by `{async block@src/lib.rs:6:18: 8:6}: Send`
note: future is not `Send` as it awaits another future which is not `Send`
--> src/lib.rs:7:9
|
7 | service.call(()).await // <--- Error
| ^^^^^^^^^^^^^^^^ await occurs here on type `impl Future<Output = <S as Service<()>>::Response>`, which is not `Send`
The only way today to make this code compile is to modify the Service
trait definition to always return a Send
future, like so (and in fact if you try the above example on the playground, you will see the compiler suggests a change like this):
trait SendService<Request>: Send {
type Response;
// Invoke the service.
fn call(
&self,
req: Request,
) -> impl Future<Output = Self::Response> + Send;
}
But this SendService
trait is too strong for use outside a work-stealing setup. This leaves generic middleware like the LogService
struct we saw earlier in a bind: should they use Service
or SendService
? Really, we want a single single Service
trait that can be used in both contexts.
Comparison to an analogous problem with IntoIterator
It is useful to compare this situation with analogous scenarios that arise elsewhere in Rust, such as with associated types. Imagine a function that takes an I: IntoIterator
and which wishes to make use of the returned iterator in a separate thread:
fn into_iter_example<I: IntoIterator>(i: I) {
let iter = i.into_iter();
std::thread::spawn(move || {
iter.next(); // <-- Error!
});
}
This code will also not compile:
error[E0277]: `<I as IntoIterator>::IntoIter` cannot be sent between threads safely
--> src/lib.rs:3:24
|
3 | std::thread::spawn(move || {
| ------------------ ^------
| | |
| _____|__________________within this `{closure@src/lib.rs:3:24: 3:31}`
| | |
| | required by a bound introduced by this call
4 | | iter.next();
5 | | });
| |_____^ `<I as IntoIterator>::IntoIter` cannot be sent between threads safely
...
help: consider further restricting the associated type
|
1 | fn into_iter_example<I: IntoIterator>(i: I)
| where <I as IntoIterator>::IntoIter: Send {
|
There are two ways the function into_iter_example
could be made to compile:
- Modify the
IntoIterator
trait to require that the target iterator type is alwaysSend
- Modify the function to have a where-clause
I::IntoIter: Send
.
The 1st option is less flexible but more convenient; it is inappropriate in a highly generic trait like IntoIterator
which is used in a number of scenarios. It would be fine for an application- or library-specific crate that is only used in narrow circumstances. Referring back to the compiler’s error message, you can see that an additional where-clause is exactly what it suggested.
This is the challenge: Rust does not currently have a way to write the equivalent of where I::IntoIter: Send
for the futures returned by async fn
(or the results of -> impl Trait
methods in traits). This creates a gap between the first Service
example, which can only be resolved by modifying the trait, and IntoIterator
, which can be resolved either by modifying the trait or by adding a where-clause to the function, whichever is more appropriate.
Return type notation (RTN) permits the return type of AFIT and RPITIT to be bounded, closing the gap
The core feature proposed in this RFC is the ability to write a bound that bounds the return type of an AFIT/RPITIT trait method. This allows the spawn_call
definition to be amended to require that call()
returns a Send
future:
async fn spawn_call<S>(service: S) -> S::Response
where
S: Service<
(),
Response: Send,
// "The method `call` returns a `Send` future."
call(..): Send,
> + Send + 'static,
{
tokio::spawn(async move {
service.call(()).await // <--- OK!
}).await
}
A variant of the proposal in this RFC is already implemented, so you can try this example on the playground and see that it works.
RTN is useful for more than Send
bounds
RTN is useful for more than Send
bounds. For example, consider the trait Factory
, which contains a method that returns an impl Iterator
:
trait Factory {
fn widgets(&self) -> impl Iterator<Item = Widget>;
}
Now imagine that there are many Factory
implementations, but only some of them return iterators that support DoubleEndedIterator
.
Making use of RTN, we can write a “reverse factory” that can be used on precisely those instances (playground):
struct ReverseWidgets<F: Factory<widgets(..): DoubleEndedIterator>> {
factory: F,
}
impl<F> Factory for ReverseWidgets<F>
where
F: Factory<widgets(..): DoubleEndedIterator>,
{
fn widgets(&self) -> impl Iterator<Item = Widget> {
self.factory.widgets().rev()
// ^^^ requires that the iterator be double-ended
}
}
RTN supports convenient trait aliases
The async WG conducted several case studies to test the usefulness of RTN. We found that RTN is very important for using async fn in practice, but we also found that RTN alone can be repetitive in traits that have many methods.
We expect most users in the wild to define “trait aliases” to indicate cases where all methods in a trait are Send
(and perhaps other traits). The (rust-lang supported) trait-variant crate can automate this process. For example, the following code creates a SendService
alias, which is automatically implemented by any type T: Service
where T: Send
and T::call(..): Send
:
#[trait_variant::make(SendService: Send)]
// ----------- ----
// | |
// name of the trait alias |
// |
// additional bound that must be met
// by async or `-> impl Trait` methods
trait Service<Request> {
type Response;
// Invoke the service.
async fn call(&self, req: Request) -> Self::Response;
}
The expansion of this macro use RTN to create a trait that both (1) implies a Service
whose methods return Send
futures and (2) which is automatically implemented for all Service
types whose methods are Send
(this expansion could be altered to make use of true trait aliases once those are stabilized):
trait SendService<R>: // a `SendService` is...
Service< // ...a `Service`...
R,
call(..): Send, // ...where `call` returns
// a `Send` future...
> +
Send // ...and which is itself `Send`.
{}
impl<S, R> SendService<R> for S
where
S: Send + Service<R, call(..): Send>,
{}
The function spawn_call
can then be written as follows:
async fn spawn_call<S>(service: S) -> S::Response
where
S: SendService<(), Response: Send> + 'static,
// ^^^^^^^^^^^ use the alias
{
tokio::spawn(async move {
service.call(()).await // <--- OK!
}).await
}
This trait alias setup means that users (and middleware like LogService
) always write impls for Service
. Functions that consume a service can choose to use SendService
if they require Send
bounds. Without RTN, the best that can be done is to have two distinct traits, which forces middleware like LogService
to choose which they will implement (as previously discussed).
(This RFC is not advocating for a particular naming convention. We use Service
and SendService
to make clear that there is a base trait to which additional bounds are being added. For Tower specifically, based on discussion with Tokio team, the most likely final setup is to call the base trait LocalService
and the Send
-variant simply Service
; this would mean that users would implement LocalService
always. The future directions includes some ways to make the LocalService
/Service
convention more transparent for users.)
Expected usage pattern: “Trait aliases” for the common cases, explicit RTN for the exceptions
Our expectation is that most traits will make use of trait_variant
to define trait aliases like SendService
. This provides the best experience for trait consumers, since they can conveniently bound all methods in the trait at once.
However, even when such an alias exists, there are times when trait consumers may not want to use them. Consider a trait like Backend
:
#[trait_variant::make(SendBackend: Send)]
trait Backend {
async fn get(&self, key: Key) -> Value;
async fn put(&self, key: Key, value: Value);
}
While SendBackend
may be convenient most of the time, it is also stricter than necessary for functions that only invoke one of get
or put
. Now consider two backend types, B1
and B2
, where B1
always returns Send
futures, but only B2::put(..)
operation on B2
is Send
, because B2::get(..)
makes use of Rc
for caching purposes. In that case, a generic function with a bound like Backend<put(..): Send>
could be used on both B1
and B2
.
Design axioms
- Minimal bounds in trait defintion, consumers apply the bounds they need. Rust’s typical pattern is to have traits with minimal bounds (e.g.,
IntoIterator
declares only that itsIntoIter
type will be anIterator
) and then to have consumers apply additional bounds when they need them (e.g., thatIntoIter: DoubleEndedIterator
). This makes for widely reusable traits. - Just say “async fn”. We want simply writing
async fn foo(&self)
to result in a maximally reusable trait (just as it results in a maximally reusable free function today); “best practice” trait definitions should still be simple to read and should not limit the trait’s consumers or future uses. - Support both async fn and
-> impl Trait
. The most pressing user need is for send bounds on async fns, but we want to add a primitive that will also address the limitations of-> impl Trait
methods (both in traits and, eventually, outside of them).
Guide-level explanation
Async functions can be used in many ways. The most common configuration is to use a work stealing setup, in which spawned tasks may migrate between threads. In this case, all futures have to be Send
to ensure that this migration is safe. But many applications prefer to use a thread-per-core setup, in which tasks, once spawned, never move to another thread (one important special case is where the entire application runs on a single thread to begin with, common in embedded environments but also in e.g. Google’s Fuchsia operating system).
For the most part, async functions today do not declare whether they are Send
explicitly. Instead, when a future F
is spawned on a multithreaded executor, the compiler determines whether it implements Send
. So long as F
results from an async fn
that only calls other async fn
s, the compiler can analyze the full range of possible executions. But there are limitations, especially around calls to async trait methods like f.method()
. If the type of f
is either a generic type or a dyn
trait, the compiler cannot determine which impl will be used and hence cannot analyze the function body to see if it is Send
. This can result in compilation errors.
Example: HealthCheck
and SendHealthCheck
For traits whose futures may or may not be Send
, the recommend pattern is to leverage the (rust-lang provided) trait_variant
crate, which can automatically declare two versions of the trait. The default trait, HealthCheck
, returns a future from each method; the alias SendHealthCheck
is used to indicate those cases where all futures are known to be Send
:
#[trait_variant::make(SendHealthCheck: Send)]
trait HealthCheck {
async fn check(&mut self, server: &Server) -> bool;
async fn shutdown(&mut self, server: &Server);
}
Most code can reference HealthCheck
directly
The HealthCheck
trait can now be implemented normally.
This includes cases, like DummyCheck
, where the returned future will always be Send
:
struct DummyCheck;
impl HealthCheck for DummyCheck {
async fn check(&mut self, server: &Server) -> bool {
true
}
async fn shutdown(&mut self, server: &Server) {}
}
But also cases like LogCheck
, which return a Send
future if and only if their generic type argument returns a Send
future:
struct LogCheck<HC: HealthCheck> {
hc: HC,
}
impl<HC: HealthCheck> HealthCheck for LogCheck<HC> {
async fn check(&mut self, server: &Server) -> bool {
self.hc.check(server).await
}
async fn shutdown(&mut self, server: &Server) {
self.hc.shutdown(server).await
}
}
Generic code that needs Send
can use SendHealthCheck
When writing generic functions that spawn tasks, invoking async functions can lead to compilation failures:
fn start_health_check<HC>(health_check: H, server: Server)
where
HC: HealthCheck + Send + 'static,
{
tokio::spawn(async move {
while health_check.check(&server).await {
// ----- Error: Returned future must
// be Send because this code runs.
tokio::time::sleep(Duration::from_secs(1)).await;
}
emit_failure_log(&server).await;
server.shutdown().await;
// ----- Error: Returned future must be Send
// because this code runs.
});
}
The problem is that tokio::spawn
requires a Send
future,
but the future returned by health_check.check
is not guaranteed to be Send
.
To address this, refall that the HealthCheck
trait also used the trait_variant::make
macro to create an alias, SendHealthCheck
, that required all futures to be Send
:
#[trait_variant::make(SendHealthCheck: Send)]
trait HealthCheck {...}
Therefore you can change the HC: HealthCheck
bound to HC: SendHealthCheck
,
the alias that requires all of its futures to be Send
:
fn start_health_check<HC>(health_check: H, server: Server)
where
HC: SendHealthCheck + 'static,
{
...
}
Bounding specific methods
Trait aliases like SendHealthCheck
require all the async methods in the trait to return a Send
future.
Sometimes that is too strict.
For example, the following function spawns a task to shutdown the server:
fn spawn_shutdown<HC>(health_check: H, server: Server)
where
HC: SendHealthCheck + 'static,
// --------------- stricter than necessary
{
tokio::spawn(async move {
server.shutdown().await;
});
}
Because spawn_shutdown
only invokes shutdown
, using SendHealthCheck
is stricter than necessary.
It may be that there are types where the check
method does not return a Send
future
but shutdown
does.
In this case, you can write a bound that specifically applies to the future returned by the shutdown()
method, like so:
fn spawn_shutdown<HC>(health_check: H, server: Server)
where
HC: HealthCheck<shutdown(..): Send> + Send + 'static,
// ------------------ "just right"
{
tokio::spawn(async move {
server.shutdown().await;
});
}
The shutdown(..)
notation acts like an associated type referring to the return type of the method.
The bound HC: HealthCheck<shutdown(..): Send>
indicates that the shutdown
method,
regardless of what arguments it is given,
will return a Send
future.
These bounds do not have to be written in the HealthCheck
trait, it could also be written as follows:
fn spawn_shutdown<HC>(health_check: H, server: Server)
where
HC: HealthCheck + Send + 'static,
HC::shutdown(..): Send,
Guidelines and best practices
Authoring async traits
When defining an async trait (a trait with async functions), best practice is to define a “send variant” with the trait_variant
crate:
#[trait_variant::make(SendMyTrait: Send)]
trait MyTrait {
async fn method1(&self);
async fn method2(&self);
}
Defining a “send alias” in this way has advantages for users of your trait:
- Referencing
T: SendMyTrait
is shorter than using RTN if there are multiple functions- (compare to
T: Send + Mytrait<method1(..): Send, method2(..): Send>
)
- (compare to
- Referencing
T: SendMyTrait
is more forwards compatible:- If you add a new method to your trait (with a default impl), all users of the send alias will be able to call this new method. Users that have named individual methods will not (on the flip side)
But defining a “send alias” in this way comes with obligations for you:
- If you add a new default method to your trait, it must be “Send-preserving” (meaning that it will be
Send
if other functions returnSend
futures).- Why? If there is an existing function that requires
T: SendMyTrait
for some typeT
, then this must remain true even whenMyTrait
grows a new (defaulted) method, or else you will have broken your downstream clients. - On the flip side, if you don’t define an alias, you can add new defaulted methods that are not Send. This won’t break downstream crates but neither will they be able to use them.
- Why? If there is an existing function that requires
Using async traits
When using a trait MyTrait
that defines a sendable alias SendMyTrait
…
- Implement
MyTrait
directly. Your type will implementSendMyTrait
automatically if appropriate. - Prefer
T: SendMyTrait
over a more explicit, method-by-method bound likeT: MyTrait<method1(..): Send, method2(..): Send>
unless you specifically want to “opt-out” from requiring a particular method isSend
.- Using the alias is shorter, but it also means that if the trait grows new default methods, they will be included in the alias by default, allowing you to call them.
Reference-level explanation
Background and running examples
The Widgets
trait
Throughout this section we will make use of the Widgets
trait as a simple running example.
trait Widgets {
fn widgets(&self) -> impl Iterator<Item = Widget>;
}
Background: desugaring to associated types
Per RFC 3425, the return-position impl Trait
types that appear in Widgets
and Log
are desugared by the compiler into generic associated types, roughly as follows:
trait Widgets { // desugared
type $Widgets<'a>: Iterator<Item = u32>;
fn widgets(&self) -> Self::$Widgets<'_>;
}
These desugarings are not exposed to users, so the associated types $Widgets
and $Log
are not directly nameable,
but we will use it to define the semantics of Return Type Notation.
Grammar
Return type notation
Return Type Notation extends the type grammar roughly as follows,
where ?
indicates an optional nonterminal and ,*
indicates a comma
separated list. These changes permit where T::method(..): Send
.
Type = i32
| u32
| ...
| Type "::" AssociatedTypeName
| "<" Type as TraitName Generics? ">" "::" AssociatedTypeName
| ...
| Type "::" MethodName "(" ".." ")" // <--- new
| "<" Type as TraitName Generics? ">" "::" MethodName "(" ".." ")" // <--- new
Generics = "<" Generic,* ">"
Generic = Type | Lifetime | ...
Examples: given the Widgets
trait defined earlier in this section…
T::widgets(..)
is a valid RTN that refers to “widgets
invoked with any arguments”<T as Widgets>::widgets(..)
is a valid RTN that refers to “widgets
invoked with any arguments”
To support the ()
notation for Fn
trait bounds (e.g., T: Fn(u8)
), the Rust grammar already permits T::method_name(T0, T1)
to be parsed as a type (example), but those examples will result in a compiler error in later phases. This RFC requires them to be interpreted as RTN types instead.
Associated type bounds
Associated type bounds are a recently stabilized feature that permits T: Trait<Type: Foo>
to be used to bound an associated type T::Type
. The grammar for these trait references is extended to support RTN notation in this position:
TraitRef = TraitName "<" Generic,* AssociatedBound ">"
AssociatedBound = Identifier "=" Generic
| Identifier ":" TraitRef // (from RFC #2289)
| Identifier "(" ".." ")" ":" TraitRef // <--- new
Examples: given the Widgets
trait defined earlier in this section…
T: Widgets<widgets(..): Send>
is a valid associated type bound
RTN bounds are internally desugared to an RTN in a standalone where-clause,
so e.g. where T: Widgets<widgets(..): Send>
becomes where <T as Widgets>::widgets(..): Send
.
We will not consider them further in this section.
Where RTN can be used (for now)
Although RTN types extend the type grammar, the compiler will not allow them to appear in all positions. Positions where RTN is currently supported include:
- As a standalone type, RTN can only be used as the
Self
type of a where-clause, e.g.,where W::widgets(..): Send
. - As an associated type bound, RTN can be used where associated type bounds appear, e.g.,
trait SendWidgets: Widgets<widgets(..): Send>
fn foo<W: Widgets<widgets(..): Send>>()
dyn Widgets<widgets(..): Send>
impl Widgets<widgets(..): Send>
Nonnormative: The current set of allowed locations correspond to places where generics on the method (e.g.,
widgets(..)
) can be converted into higher-ranked trait bounds, as described in the next section. We expect future RFCs to extend the places where RTN can appear. These RFCs will detail how to manage generic parameters in those functions. The expectation is that the behavior will generally match “whatever'_
would do”. For example,let w: W::widgets(..) = ...
would be equivalent tolet w: W::$Widgets<'_> = ...
.
Converting to higher-ranked trait bounds
The method named in an RTN type may have generic parameters (e.g., fn widgets<'a>(&'a self)
has a lifetime parameter 'a
). Because RTN locations are limited to where-clauses and trait bounds in this RFC, these parameters can always be captured in a for
to form a higher-ranked trait bound.
The semantics are illustrated by the following examples which desugar references to widgets(..)
into the (generic) associated type $Widgets<'_>
described earlier:
<T as Widgets>::widgets(..): Send
where for<'a> <T as Widgets>::$Widgets<'a>: Send
T: Widgets<widgets(..): Send
- Equivalent to
where T: Widgets<for<'a> $Widgets<'a>: Send>
- Equivalent to
impl Widgets<widgets(..): Send>
impl for<'a> Widgets<$Widgets<'a>: Send>
dyn Widgets<widgets(..): Send
dyn for<'a> Widgets<$Widgets<'a>: Send>
- But note that async fn and RPITIT are not yet dyn-safe; this is forward looking.
While all of these examples are using lifetimes, there is ongoing work to support higher-ranked trait bounds that are generic over types, and the expectation is that RTN will be extended to work over generic types and constants when possible.
How this is implemented
The examples above illustrate the semantics but do not make clear how RTN can be implemented in the compiler. A RTN bound like widgets(..)
is implemented internally via unification. To keep the RFC focused on how RTN feels to users, we defer a detailed description to reference material and a future stabilization report.
RTN only applies to AFIT and RPITIT methods
Although conceptually RTN could be used for any trait method, we choose to limits its use to async fn
and other methods that directly return an -> impl Trait
. This limitation can be lifted in the future as we gain more experience.
- RTN may refer to the following examples:
async fn method(&self)
fn method(&self) -> impl Iterator<Item = u32>
- RTN may not presently refer to the following examples:
fn method(&self) -> u32
fn method(&self) -> Option<impl Iterator<Item = u32>>
Drawbacks
Confusion about future type vs awaited type
When writing an async function, the future is implicit:
trait HealthCheck {
async fn check(&mut self, server: Server);
}
It could be confusing that HC::check(..)
refers to a future and not the ()
type that results from await. This is however consistent with expressions (i.e., let c = hc.check(..)
will yield a future, not the result).
Automatic impl of Send
based on current method definition
Implementations of async functions automatically expose whether they are Send
or not, limiting their future (semver-compatible) evolution. E.g., the following impl…
impl HealthCheck for MyType {
async fn check(&mut self, server: Server) {
return;
}
}
…could not in the future be modified to reference an Rc
internally. This is different from ordinary functions which can add references to Rc
transiently without an issue.
The fact that the Send
requirement limits what values async functions can internally reference is not new, however, nor specific to trait functions.
It is a consequence of existing precedent:
- Async functions desugar to returning an
impl Future
value. - Values are automatically
Send
based on their contents.
Rationale and alternatives
What is the impact of not doing this?
The Async Working Group has performed five case studies around the use of async functions in trait, covering usage in the following scenarios:
- configuration and parameterization in the AWS SDK, such as providing a generic credentials provider (link);
- redefining the
Service
trait defined bytower
(link); - usage in the Fuchsia Netstack3 socket handler developed at Google (link);
- usage in an internal Microsoft application (link);
- usage in the embedded runtime
embassy
, which targets simple processors without an operating system (link).
We found that all of these key use cases required a way to handle send bounds, with only two exceptions:
embassy
, where the entire process is single-threaded (and henceSend
is not important),- Fuchsia, where the developers at first thought they needed
Send
bounds, but ultimately found they were able to refactor so that spawns did not occur in generic code (link to the relevant section).
From this we conclude that offering async functions in traits without some solution to the “send bound problem” means it will not be usable for most Rust developers. The Fuchsia case also provides evidence that, even when workarounds exist, they are not obvious to Rust developers.
For most of the cases above, return-type notation as described in this RFC worked well. The major exception was the Microsoft application, which included a trait with many methods. Since doing this study we have developed the trait-variant crate and thus the ability to define “send aliases”, as described in this RFC, which addresses this ergonomic gap.
How did you settle on this particular design?
The goal of this RFC is offer a
- flexible primitive that can support many use cases (including constructing aliases)
- and which is ergonomic enough to be useful directly when needed.
The primitive alone doesn’t fill all needs as it doesn’t address the need to create aliases,
but it provides the means for the #[trait_variant::make]
procedural macro to be written as a stable crate;
in the future providing a more ergonomic syntax – such as trait transformers – for “all async functions return send futures” may be worthwhile.
What are cases where that flexibility is useful?
Versus aliases that always bound every method, RTN can be used to
- bound individual methods
- introduce bounds for traits other than
Send
.
As described in the motivation, bounding individual methods allows for greater reuse. For functions that only make use of a subset of the methods in a trait, RTN can be used to create a “maximally reusable” signature.
What other syntax options were considered?
The lang team held a design meeting reviewing RTN syntax options and covering the pros/cons for each of them in detail. The document also includes a detailed evaluation and recommendations.
The document reviewed the following designs overall:
Option | Bound |
---|---|
StatusQuo | D: Database<items(): DoubleEndedIterator> |
DotDot | D: Database<items(..): DoubleEndedIterator> |
Return | D: Database<items::return: DoubleEndedIterator> |
Output | D: Database<items::Output: DoubleEndedIterator> |
Fn | D: Database<fn items(): DoubleEndedIterator> |
FnDotDot | D: Database<fn items(..): DoubleEndedIterator> |
FnReturn | D: Database<fn items::return: DoubleEndedIterator> |
FnOutput | D: Database<fn items::Output: DoubleEndedIterator> |
We briefly review the key arguments here:
- “StatusQuo”:
D: Database<items(): DoubleEndedIterator>
- This notation is more concise and feels less heavy-weight. However, we expect users to primarily use aliases; also, the syntax “feels” surprising to many users, since Rust tends to use
..
to indicate elided items. The biggest concern here is a potential future conflict. If we (a) extend the notation to allow argument types to be specified (as described in the future possibilities section) AND (b) support some kind of variadic arguments, thenD::items()
would most naturally indicate “no arguments”.
- This notation is more concise and feels less heavy-weight. However, we expect users to primarily use aliases; also, the syntax “feels” surprising to many users, since Rust tends to use
- “Return”:
D: Database<items::return: DoubleEndedIterator>
- This notation avoids looking like a function call. Many team members found it dense and difficult to read. While intended to look more like an associated type, the use of a lower-case keyword still makes it feel like a new thing. The syntax does not support future extensions (e.g., specifying the value of argument types).
- “Output”:
D: Database<items::Output: DoubleEndedIterator>
(see this blog post for details)- This reuses associated types but, as both the function and future traits define an
Output
associated type, raises the potential for confusion about whether this notation means “the future that gets returned” or “the result of the future”.
- This reuses associated types but, as both the function and future traits define an
- “FnDotDot” and friends:
D: Database<fn items(..): DoubleEndedIterator>
- This notation was deemed too close to
fn
pointer types, particularly in stand-alone where-clauses.
- This notation was deemed too close to
Why not use typeof
, isn’t that more general?
The compiler currently supports a typeof
operation as an experimental feature (never RFC’d). The idea is that typeof <expr>
type-checks expr
and evaluates to the result of that expression. Therefore typeof 22_i32
would be equivalent to i32
, and typeof x
would be equivalent to whatever the type of x
is in that context (or an error if there is no identifier x
in scope).
It might appear that typeof
can be used in a similar way to RTN, but in fact it is significantly more complex. Consider our first example, the HealthCheck
trait:
trait HealthCheck {
async fn check(&mut self, server: Server);
}
and a function bounding it
fn start_health_check<H>(health_check: H, server: Server)
where
H: HealthCheck + Send + 'static,
H::check(..): Send, // <--- How would we write this with `typeof`?
To write the above with typeof
, you would do something like this
fn dummy<T>() -> T { panic!() }
fn start_health_check<H>(health_check: H, server: Server)
where
H: HealthCheck + Send + 'static,
for<'a> typeof H::check(
dummy::<&'a mut H>(),
dummy::<Server>(),
): Send,
Alternatively, one could write something like this
fn start_health_check<H>(health_check: H, server: Server)
where
H: HealthCheck + Send + 'static,
typeof {
let hc: &'a mut H;
let s: Server;
H::check(hc, s)
}: Send,
Note that we had to supply a callable expression (even if it will never execute), so we can’t directly talk about the types of the arguments provided to H::check
, instead we have to use the dummy
function to produce a fake value of the type we want or introduce dummy let-bound variables.
Clearly, typeof
on its own fails the “ergonomic enough to use for simple cases” threshold we were shooting for. But it’s also a significantly more powerful feature that introduces a lot of complications. We were able to implement a minimal version of RTN in a few days, demonstrating that it fits relatively naturally into the compiler’s architecture and existing trait system. In contrast, integrating typeof
would be rather more complicated. To start, we would need to be running the type checker in new contexts (e.g., in a where clause) at large scale in order to normalize a type like typeof H::check(x, y)
into the final type it represents.
With typeof
, one would also expect to be able to reference local variables and parameters freely. This would bring Rust full on into dependent types, since one could have a variable whose type is something like typeof x.method_call()
, which is clearly dependent on the type of x
. This isn’t an impossible thing to consider – and indeed the same could be true of some extensions of RTN, if we chose to permit naming closures or other local variables – but it’s a significant bundle of work to sort it out.
Finally, while typeof
clearly is a more general feature, it’s not clear how well motivated that generality is. The main use cases we have in mind are more naturally and directly handled by RTN. To justify typeof
, we’d want to have a solid rationale of use cases.
Why not make all futures Send
?
The #[async_trait]
macro solves the send bounds problem by forcing the trait to declare up front whether it will require send or not. This is required by the desugaring that async-trait uses. For many users, this is a fine solution, since they always work with sendable futures. But there are a significant set of users that do not want send bounds, either because they are in an embedded context or because they are using a thread-per-core architecture. The widely used tokio runtime, for example, can be configured to either use work-stealing (which requires Send
futures) or to be a single-threaded executor (which does not). The glommio
executor does not require Send
bounds on futures because it never moves tasks between threads. The Fuchsia project makes extensive use of single-threaded executors in their runtime, and hence they do not require Send
bounds. The embassy
runtime targets embedded environments that only have a uniprocessor and which have no need for Send
bounds. All of these environments are disadvantaged by defaults that require send bounds.
One of our design goals with async-trait is to support core interoperability traits for things like reading, writing, HTTP, etc. The whole point of these traits is to be usable across many runtimes. If those traits forced Send
bounds, that would be unnecessarily limiting, which would lead to users of non-Send-requiring runtimes to avoid them. If the traits did NOT force Send
bounds, they would not be compatible with work stealing runtimes (the most popular choice) unless there was some additional feature to “opt-in” to needing send bounds – which is exactly the gap RTN is looking to close.
Why not create an associated type that represents the return type?
Early on in our design work, we expected to simply create an associated type within the trait to represent the return type. For example this trait:
trait Factory {
fn widgets(&self) -> impl Iterator<Item = Widget>;
}
might have been desugared as follows:
trait Factory {
type widgets<'a>: Iterator<Item = Widget>; // <--- implicitly introduced
fn widgets(&self) -> Self::widgets<'_>;
}
This would mean that users could write a bound on F::widgets
to bound the return type of widgets
fn use_factory<F>()
where
F: Factory,
for<'a> F::widgets<'a>: Send,
{}
We encountered a number of problems with this design.
If the name is implicit, what name should we use?
The most impmediate problem with this proposal was trying to decide what name to use.
Using Widgets
(capitalized) feels arbitrary and there is no precedent within Rust for automatically creating names with different case conventions in this way.
Using the same name as the method (widgets
) results in an associated type that does not follow Rust’s naming conventions.
It also introduces the potential for a shadowing conflict as today it is allowed to have methods and associated types with the same name:
trait Example {
type method;
fn method();
}
Why not use an explicit name?
To address the challenge of an implicit name, we could allow people to explicitly annotate a name:
trait Factory {
#[associated_return_type(Widgets)]
fn widgets(&self) -> impl Iterator<Item = Widget>;
}
However, this has some downsides:
- It goes against our design axiom that people should be able to “just write
async fn
”. Now for maximum reuse the trait body requires extra annotations. - It means that trait authors must remember to add such an annotation or else their consumers will be limited in their ability to use the trait.
Trait authors should expect a stream of PRs adding this annotation to most every
async fn
in their trait.
What generic parameters should this associated type have?
Regardless of how it is named, it’s not obvious what set of generic type parameters the function should have. In our example, there was only a single lifetime, but in other cases, functions can have a large number of implicit parameters. This occurs with anonymous lifetimes but also with argument-position impl Trait. We have so far avoided committing to a particular order or way of specifying those implicit parameters explicitly, but desugaring to a (user-visible) generic associated type would force us to make a commitment. Example:
trait Consumer {
fn consume_elements(&mut self, context: &Context, widgets: &mut impl Iterator<Item = Widget>);
}
How many generic type parameters should consume_elements
have, and in what order? There are at least three anonymous lifetimes mentioned, and one anonymous type parameter (the impl Iterator
), but that’s not enough to answer the question. First off, without seeing the definitions of Context
and Widget
, we do not know if they have lifetime parameters (although it’s discouraged, Rust permits you to elide lifetime parameters from structs in function declarations). Second, all of the lifetime parameters we see appear in “variant” positions, and so we could get away with a single GAT parameter (simpler). But if (for example) Context
were defined like so:
struct Context<'a> {
x: &'a mut Vec<&'a u32>
}
then the function would require a separate lifetime parameter for Context
. Committing to specific rules here limits us as language designers, but it’s also a demands a deep understanding of the compiler and its desugaring to be successfully used and explained.
Why not use a named associated type that represents the zero-sized method type?
In the previous question, we mentioned that every function in Rust has a unique zero-sized type associated with it, including methods. One natural desugaring then might be to introduce an associated type that represents the method type itself. One could then use the Output
associated type to talk about the return type. Given the Factory
trait we saw before:
trait Factory {
fn widgets(&self) -> impl Iterator<Item = Widget>;
}
one might then take “any factory whose widgets iterator is sendable” like this:
fn use_factory<F>()
where
F: Factory,
for<'a> F::widgets<'a>::Output: Send,
// -------------- ------
// type of the return type
// method
{}
This approach has an appealing generality to it, and it opens up some interesting possibilities. For example, one might consider a trait Const
that is implemented by all function types which are const fn
(discussed in withoutboats’s const as an auto trait blog post). Users could then write for<'a> F::widgets<'a>: Const
to declare that the method is a const method. However, it’s rather unergonomic for the common case. It also doesn’t compose well with the associated type bounds notation – i.e., would we write something like F: Factory<widgets::Output: Send>
?
To resolve the ergonomic problems, our exporations of this future wound up proposing some form of sugar to reference the Output
type – for example, being able to write F::widgets(..): Send
. But that is precisely what this RFC proposes! Indeed, in the future possibilities section of the RFC, we discuss the possibility of giving users some way to name the type of the widgets
method itself, and not just its return type.
So why not just start with this more general approach, if we think it might be a useful extension? First, it’s not clear if it would be useful. We don’t have to solve the question of “const as an auto trait” in order to address the send bounds problem. Second, this approach suffers from some of the complications mentioned in the previous question, such as needing to specify the order of arguments for anonymous lifetime or impl trait parameters, and having to deal with existing traits that may shadow the desired name. Lacking a strong motivation to have this much generality, it’s hard to tell how to resolve those questions, since we don’t really know where/when this more general form would be used.
Why not make trait_variant
crate magic?
With RTN, the #[trait_variant::make]
macro can be defined in “user space”.
It would also be possible to build it into the stdlib and have it defined “magically” through compiler intrinsics.
This would still allow async traits to be defined that can be used across all executors
(in roughly the same way as we recommend),
but it has several downsides.
First, it makes the stdlib more special, which works against the goals of Rust.
Second, it covers far fewer use cases than RTN: it cannot be used to express specifically which methods must be Send
, nor can it be used for traits that were not “pre-imagined” by the trait author.
Why not Send trait transformers?
Trait transformers are a proposal to have “modifiers” on trait bounds that produce a derived version of the trait. For example, T: async Iterator
might mean “T implements a version of Iterator
where the next
function is async
”. Following this idea, one can imagine T: Send HealthCheck
to mean “implement a version of HealthCheck
where every async fn returns a Send
future”. This idea is an ergonomic way to manage traits that have a lot of async functions, as came up in the Microsoft case study.
It seems likely that trait transformers would be more ergonomic than RTN in practice, since they easily accommodate traits with many async functions. However, they are less flexible, as the current idea can only encode the case where you want to add the same auto trait to the return type of all async functions, whereas RTN can be used to encode all manner of patterns, as described in the guide-level explanation. Furthermore, trait transformers are a more fundamental extension to Rust than RTN, and their design is tied up in questions of whether we should have other kinds of transformers, such as async
or const
. It is preferable to give time for exploration until we have a better handle on the motivation and use cases so that we can avoid constraining ourselves today in a way that we might not want in the future. In contrast, it’s hard to imagine a future where we don’t want some way to constrain or refer to the return types of individual methods within a trait.
Prior art
C++
C++ has decltype
expressions which give the type of an expression and the type of a declaration, respectively. Some compilers (e.g., GCC) also support typeof
. The drawbacks section listed reasons why we believe typeof
is not a suitable primitive for us to build upon.
Unresolved questions
Does stabilizing T::foo(..)
notation as a standalone type create a confusing inconsistency with -> ()
shorthand?
Unlike a regular associated type, this RFC does not allow a trait bound that specifies the return type of a method, only the ability to put bounds on that return type.
rpjohnst suggested that we may wish to support a syntax like T: Trait<method(..) -> T>
, perhaps in conjunction with specified argument types.
They further pointed out that permitting T::method(..)
as a standalone type could be seen as inconsistent, given that fn foo()
is normally shorthand for -> ()
.
However, not supporting T::method(..)
as a standalone type could also be seen as inconsistent, since normally T: Trait<Bar: Send>
and T::Bar: Send
are equivalent.
Prior to stabilizing the “associated type position” syntax, we should be sure we are comfortable with this.
Future possibilities
Implementing trait aliases
Referring to the Service
trait specifically,
the Tokio developers expressed a preference to name the “base trait” LocalService
and to call the “sendable alias” Service
.
This reflects the way that Tokio uses work-stealing executors by default.
This formulation can be done with trait_variant
like so
#[trait_variant::make(Service: Send)]
trait LocalService<R> {
type Response;
async fn call(&self, request: R) -> Self::Response;
}
However, it carries the downside that users must implement LocalService
and hence must be aware of the desugaring.
It would be nicer if users could choose to implement Service
and then (in so doing) effectively assert that all their async functions are always Send
.
This is not possible today due to the fact that trait-variant
is emulating trait alias functioanlity with a blanket impl and supertraits; this is because true trait alias functionality is not yet stable.
RFC 3437 has proposed an extension to trait aliases that makes them implementable.
The combination of accepting RFC 3437 and stabilizing trait aliases would make these aliases nicer for users as a result.
Permit RTN for more functions
RTN is currently limited to async fn
and -> impl Trait
methods in traits.
But the same syntax could be used for any methods as well as for free functions (e.g., foo(..)
might refer to the return type of fn foo()
).
One area that would be challenging to support is RTN for the return types of closures,
as that would introduce an element of dependent types that would complicate the type checker
(e.g., if let y: x(..)
meant that y
is the type returned from invoking the closure x
, another local variable).
Specifying the values for argument types
The T::method(..): Send
notation we’ve been using so far means
“the return type of method(..)
is Send
, no matter what arguments you provide”.
We could extend this notation to permit specifying the argument types explicitly.
For example, consider the capture
method below, which takes a parameter of type input
:
trait Capture {
async fn capture<T>(&mut self, input: T) -> Option<T>;
}
and now consider a function that invokes capture
with an i32
value:
async fn capture_i32<C: Capture>(mut c: C) {
c.capture(22_i32);
}
Now imagine we wanted to invoke capture
on another thread,
and hence we need a where-clause indicating that the future
returned by capture
will be Send
:
async fn capture_i32<C: Capture + Send + 'static>(mut c: C)
where
/* where-clause for C::check() needed here! */
{
workstealing_runtime::spawn(async move {
c.capture(22_i32);
})
}
There are multiple ways we could write this where-clause, varying in their specificity…
where C::capture(..): Send
– this indicates thatC::capture()
will return aSend
value for any possible set of parameterswhere C::capture(&mut C, i32): Send
– this indicates thatC::capture()
will return aSend
value when invoked specifically on a&mut C
(for theself
parameter) and ani32
where for<'a> C::capture(&'a mut C, i32): Send
– same as the previous rule, but with the higher-ranked'a
written explicitlywhere C::capture::<i32>(..): Send
– this indicates thatC::capture()
will return aSend
value for any possible set of parameters, but with itsT
parameter set explicitly toi32
where C::capture::<i32>(&mut C, i32): Send
– this indicates thatC::capture()
will return aSend
value when itsT
parameter isi32
where for<'a> C::capture::<i32>(&'a mut C, i32): Send
– same as the previous rule, but with the higher-ranked'a
written explicitly
Possible rules for an RTN are as follows:
- Parameter types:
- If parameter types are specified as
..
(e.g.,C::check(..)
orC::check::<i32>(..)
), then the where-clause applies to any possible argument types - If parameter types are given, then the where-clause applies to those specific argument types
- the
self
type must be given explicitly when usingC::check(..)
notation, just as it would in a function call (e.g.,let x = C::check(a, b)
) - elided lifetimes like (e.g.,
C::check(&mut Self, i32)
) are translated to a higher-ranked lifetime (e.g.,for<'a> C::check(&'a mut Self, i32)
) covering the where-clause
- the
- If parameter types are specified as
- Turbofish:
- If turbofish is not used, then the where-clause applies to any possible values for the type parameters
- If turbofish is used, then the values for the type parameters are explicitly specified
Supporting RTN in more locations
To contain the scope, this RFC only describes how RTN types work as the self type of a where-clause.
However, one advantage of RTNs is that they can be extended to work in more places.
This would address the gap that has existed in -> impl Trait
(and hence in async fn
) since it was introduced in RFC 1522,
namely that there is no way to name the return type of such a function explicitly.
This in turn means that given a function like fn odd_integers() -> impl Iterator<Item = u32>
, one cannot name
the iterator type that is returned.
For free functions, best practice today is to use a named return type; once type alias impl trait is stabilized, that will also be an option.
But neither of these are practical for async functions that appear in traits.
RTN as specified in this RFC could be extended with relative ease to appear in any location where '_
is accepted. For example:
trait DataFactory {
async fn load(&self) -> Data;
}
fn load_data<D: DataFactory>(data_factory: D) {
let load_future: D::load(..) = data_factory.load();
// -------
// Expands to `D::load(&'_ D)` -- in this context,
// `'_` means that the compiler will infer a suitable
// value.
await_future(load_future);
}
fn await_future<D: DataFactory>(load_future: D::load(..)) -> Data {
// -------
// As above, expands to `D::load(&'_ D)`, which
// means "for some `_`".
argument.await
}
The most useful place to use RTN, however, is likely struct fields, and in that location we do not accept '_
.
We would therefore have to support specifying the types of arguments in RTN.
That would enable writing structs that wrap the future returned via some trait method:
struct Wrap<'a, D: DataFactory> {
load_future: D::load(&'a D), // the future returned by `D::load`.
}
Dyn support
We expect to make traits with async functions and RPITIT dyn safe in the future. One benefit of the RTN design is that it continues to hide the presence and precise value of the associated types that define the return value of an async function. This means that given HealthCheck
, we can later define the type of the future <dyn HealthCheck>::check(..)
to be anything.
Naming the zero-sized types for a method
Every function and method f
in Rust has a corresponding zero-sized type that uniquely identifies f
. The RTN notation T::check(..)
refers to the return value of check
; conceivably T::check
(without the parens) could be used to refer the type of check
itself. In this case, T::check(..)
can be thought of as shorthand for <T::check as Fn<_>>::Output
.