Further trait features - Rust for C-Programmers

11.4 Further Trait Features

Beyond the basics, Rust’s trait system includes several features that enhance its power and flexibility, such as dynamic dispatch via trait objects and associated types.

11.4.1 Trait Objects for Dynamic Dispatch

So far, we’ve used traits with generics (<T: Trait>), which results in static dispatch. The compiler knows the concrete type at compile time and generates specialized code (monomorphization).

Rust also supports dynamic dispatch using trait objects, specified with the dyn Trait syntax. A trait object is typically a reference (like &dyn Trait or Box<dyn Trait>) that points to some instance of a type implementing Trait. The concrete type is unknown at compile time.

trait Drawable {
    fn draw(&self);
}

struct Button { id: u32 }
impl Drawable for Button {
    fn draw(&self) { println!("Drawing button {}", self.id); }
}

struct Label { text: String }
impl Drawable for Label {
    fn draw(&self) { println!("Drawing label: {}", self.text); }
}

fn main() {
    // Create a vector of trait objects (Box<dyn Drawable>).
    // Box is used for heap allocation because the size of different
    // Drawable types (Button, Label) may vary, and Vec needs elements
    // of a known, uniform size. Box<dyn Drawable> is a 'fat pointer'
    // containing a pointer to the data and a pointer to a vtable.
    let components: Vec<Box<dyn Drawable>> = vec![
        Box::new(Button { id: 1 }),
        Box::new(Label { text: String::from("Submit") }),
        Box::new(Button { id: 2 }),
    ];

    // Iterate and call draw() on each component.
    // The actual method called (Button::draw or Label::draw) is determined
    // at runtime based on the vtable associated with each trait object.
    for component in components {
        component.draw(); // Dynamic dispatch occurs here via vtable lookup.
    }
}

Trade-offs:

Static Dispatch (Generics):
- Performance: Generally faster due to direct function calls (or inlining) after monomorphization.
- Compile-time Knowledge: Requires the concrete type to be known at compile time.
- Code Size: Can lead to larger binaries if the generic code is instantiated for many different types (code bloat).
Dynamic Dispatch (Trait Objects):
- Flexibility: Allows mixing different concrete types that implement the same trait in collections (heterogeneous collections). Concrete type doesn’t need to be known at compile time.
- Performance: Involves runtime overhead due to pointer indirection and vtable lookup to find the correct method address. Usually a minor cost, but potentially significant in performance-critical loops.
- Code Size: Avoids code duplication from monomorphization, potentially leading to smaller binaries if used extensively with many types.

Trait objects are crucial for patterns where you need heterogeneous collections or runtime polymorphism, similar to using interfaces or base class pointers in object-oriented languages. We will explore this further in Chapter 20.

11.4.2 Object Safety

Not all traits can be made into trait objects. A trait must be object-safe. The main rules for object safety are:

The return type of methods cannot be Self. If a method returned Self, the compiler wouldn’t know the concrete size of the type to allocate space for the return value at the call site, as the actual type is hidden behind the dyn Trait.
Methods cannot use generic type parameters. If a method took a generic parameter <T>, the compiler wouldn’t know which concrete type T to use when the method is called through a trait object.

(There are other technical rules, related to where Self: Sized bounds, but these are the most common constraints.)

Most common traits are object-safe. The Clone trait, for example, is not object-safe because its clone method signature is fn clone(&self) -> Self.

11.4.3 Associated Types

Traits can define associated types, which are placeholder types used within the trait’s definition. Implementing types specify the concrete type for these placeholders. This is often preferred over using generic type parameters on the trait itself when there’s a natural, single type associated with the implementor for that trait role.

The classic example is the Iterator trait:

#![allow(unused)]
fn main() {
// Simplified Iterator trait definition from the standard library
trait Iterator {
    // 'Item' is an associated type. Each iterator implementation specifies
    // what type of items it produces.
    type Item;

    // 'next' returns an Option containing an item of the associated type.
    // Note: Self::Item refers to the concrete type specified by the implementor.
    fn next(&mut self) -> Option<Self::Item>;
}
}

Implementing Iterator requires specifying the concrete type for Item:

struct Counter {
    current: u32,
    max: u32,
}

// Implement Iterator for Counter
impl Iterator for Counter {
    // Specify the associated type 'Item' as u32 for this implementation
    type Item = u32;

    // Implement the 'next' method, returning Option<u32>
    fn next(&mut self) -> Option<Self::Item> { // Self::Item resolves to u32 here
        if self.current < self.max {
            self.current += 1;
            Some(self.current - 1) // Return the value *before* incrementing
        } else {
            None // Signal the end of iteration
        }
    }
}

fn main() {
    let mut counter = Counter { current: 0, max: 3 }; // Will produce 0, 1, 2
    println!("{:?}", counter.next()); // Some(0)
    println!("{:?}", counter.next()); // Some(1)
    println!("{:?}", counter.next()); // Some(2)
    println!("{:?}", counter.next()); // None
}

Benefits of Associated Types vs. Generic Parameters on the Trait:

Clarity: When a trait implementation logically yields or works with only one specific type for a given role (like the Item produced by an iterator), associated types make the relationship clearer. impl Iterator for Counter is arguably simpler than impl Iterator<u32> for Counter.
Type Inference: Can sometimes improve type inference compared to generic parameters on the trait itself.
Ergonomics: Method signatures within the trait use Self::Item rather than requiring a generic parameter like Item to be passed down, making the trait definition less cluttered.

11.4.4 The Orphan Rule

Rust’s orphan rule dictates where trait implementations can be written, ensuring coherence and preventing conflicts. It states that you can implement a trait T for a type U only if at least one of the following is true:

The trait T is defined in the current crate (your local package).
The type U is defined in the current crate.

// --- In current crate ---
// Define our local trait
trait MyTrait { fn do_something(&self); }

// Define our local type
struct MyType;

// Assume ForeignTrait and ForeignType are defined in external crates (e.g., `std`)
use std::fmt::Display; // ForeignTrait
use std::collections::HashMap; // ForeignType (example)

// Allowed: Implement local trait for local type
impl MyTrait for MyType { /* ... */ }

// Allowed: Implement local trait for foreign type
impl MyTrait for HashMap<String, i32> { /* ... */ }

// Allowed: Implement foreign trait for local type
impl Display for MyType { /* ... */ }

// Not Allowed (Orphan Rule violation):
// Cannot implement a foreign trait (Display) for a foreign type (HashMap)
// impl Display for HashMap<String, i32> { /* ... */ }
// Error! Both Display and HashMap are external.

This rule prevents multiple crates from providing conflicting implementations of the same trait for the same external type. If you need to implement an external trait for an external type, the standard practice is to define a newtype wrapper around the external type in your crate and implement the trait for your wrapper.

use std::fmt;

// Foreign type we want to Display differently
struct ExternalType { value: i32 }

// Define a newtype wrapper in our crate
struct MyWrapper(ExternalType);

// Implement the foreign trait (Display) for our local wrapper type
impl fmt::Display for MyWrapper {
    fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
        write!(f, "MyWrapper({})", self.0.value) // Access inner value via self.0
    }
}

fn main() {
    let external_val = ExternalType { value: 42 };
    let wrapped_val = MyWrapper(external_val);
    println!("{}", wrapped_val); // Uses our Display impl for MyWrapper
}

11.4.5 Common Standard Library Traits

Many fundamental operations in Rust are defined via traits in the standard library. Implementing these traits allows your types to integrate seamlessly with language features and standard library functions. The #[derive] attribute can automatically generate implementations for several common ones, provided the types contained within your struct or enum also implement them.

Debug: Enables formatting with {:?} (for developer-focused output).
Clone: Allows creating a deep copy of a value via the .clone() method. The type must explicitly implement how to duplicate itself.
Copy: A marker trait indicating that a type’s value can be duplicated simply by copying its bits (like C memcpy). Requires Clone. Only applicable to types whose values reside entirely on the stack and have no ownership semantics needing special handling on copy (e.g., integers, floats, bools, function pointers, or structs/enums composed solely of Copy types). Copy types are implicitly duplicated when moved or passed by value.
PartialEq, Eq: Enable equality comparisons (==, !=). PartialEq allows for types where equality might not be defined for all pairs (e.g., floating-point NaN). Eq requires that equality is reflexive, symmetric, and transitive (a true equivalence relation). Deriving Eq requires PartialEq.
PartialOrd, Ord: Enable ordering comparisons (<, >, <=, >=). PartialOrd allows for types where ordering might not be defined for all pairs (e.g., NaN). Ord requires a total ordering. Deriving Ord requires PartialOrd and Eq.
Default: Provides a way to create a sensible default value for a type via Type::default(). Often used for initialization.
Hash: Enables computing a hash value for an instance, required for types used as keys in HashMap or elements in HashSet. Deriving Hash requires Eq.

use std::collections::HashMap;

// Automatically derive implementations for several common traits
#[derive(Debug, Clone, Copy, PartialEq, Eq, PartialOrd, Ord, Default, Hash)]
struct Point {
    x: i32,
    y: i32,
}

fn main() {
    let p1 = Point { x: 1, y: 2 };
    let p2 = p1; // Allowed because Point is Copy; p1 is bitwise copied to p2.
    let p3 = Point::default(); // Uses derived Default impl (x=0, y=0)
    let p4 = p1.clone(); // Uses derived Clone impl (same as Copy here)

    println!("p1: {:?}", p1);         // Uses Debug
    println!("p3: {:?}", p3);         // Uses Debug
    println!("p1 == p2: {}", p1 == p2); // Uses PartialEq
    println!("p1 < p4: {}", p1 < p4);    // Uses PartialOrd (false, as p1==p4)
    println!("p1 == p3: {}", p1 == p3); // Uses PartialEq (false)

    // Use Point as a HashMap key because it derives Hash and Eq
    let mut map = HashMap::new();
    map.insert(p1, "Origin Point");
    println!("Map value for p1: {:?}", map.get(&p1));
}