Primitive Type char

1.0.0 · [−]

Expand description

A character type.

The char type represents a single character. More specifically, since ‘character’ isn’t a well-defined concept in Unicode, char is a ‘Unicode scalar value’, which is similar to, but not the same as, a ‘Unicode code point’.

This documentation describes a number of methods and trait implementations on the char type. For technical reasons, there is additional, separate documentation in the std::char module as well.

Representation

char is always four bytes in size. This is a different representation than a given character would have as part of a String. For example:

let v = vec!['h', 'e', 'l', 'l', 'o'];

// five elements times four bytes for each element
assert_eq!(20, v.len() * std::mem::size_of::<char>());

let s = String::from("hello");

// five elements times one byte per element
assert_eq!(5, s.len() * std::mem::size_of::<u8>());

Run

As always, remember that a human intuition for ‘character’ might not map to Unicode’s definitions. For example, despite looking similar, the ‘é’ character is one Unicode code point while ‘é’ is two Unicode code points:

let mut chars = "é".chars();
// U+00e9: 'latin small letter e with acute'
assert_eq!(Some('\u{00e9}'), chars.next());
assert_eq!(None, chars.next());

let mut chars = "é".chars();
// U+0065: 'latin small letter e'
assert_eq!(Some('\u{0065}'), chars.next());
// U+0301: 'combining acute accent'
assert_eq!(Some('\u{0301}'), chars.next());
assert_eq!(None, chars.next());

Run

This means that the contents of the first string above will fit into a char while the contents of the second string will not. Trying to create a char literal with the contents of the second string gives an error:

error: character literal may only contain one codepoint: 'é'
let c = 'é';
        ^^^

Another implication of the 4-byte fixed size of a char is that per-char processing can end up using a lot more memory:

let s = String::from("love: ❤️");
let v: Vec<char> = s.chars().collect();

assert_eq!(12, std::mem::size_of_val(&s[..]));
assert_eq!(32, std::mem::size_of_val(&v[..]));

Primitive Type char

Implementations

impl char

pub const MAX: char

pub const REPLACEMENT_CHARACTER: char

pub const UNICODE_VERSION: (u8, u8, u8)

pub fn decode_utf16<I>(iter: I) -> DecodeUtf16<<I as IntoIterator>::IntoIter>ⓘNotable traits for DecodeUtf16<I>impl<I> Iterator for DecodeUtf16<I> where I: Iterator<Item = u16>, type Item = Result<char, DecodeUtf16Error>; where I: IntoIterator<Item = u16>,

pub fn from_u32(i: u32) -> Option<char>

pub unsafe fn from_u32_unchecked(i: u32) -> char

pub fn from_digit(num: u32, radix: u32) -> Option<char>

pub fn is_digit(self, radix: u32) -> bool

pub fn to_digit(self, radix: u32) -> Option<u32>

pub fn escape_unicode(self) -> EscapeUnicodeⓘNotable traits for EscapeUnicodeimpl Iterator for EscapeUnicode type Item = char;

pub fn escape_debug(self) -> EscapeDebugⓘNotable traits for EscapeDebugimpl Iterator for EscapeDebug type Item = char;

pub fn escape_default(self) -> EscapeDefaultⓘNotable traits for EscapeDefaultimpl Iterator for EscapeDefault type Item = char;

pub const fn len_utf8(self) -> usize

pub const fn len_utf16(self) -> usize

pub fn encode_utf8(self, dst: &mut [u8]) -> &mut str

pub fn encode_utf16(self, dst: &mut [u16]) -> &mut [u16]

pub fn is_alphabetic(self) -> bool

pub fn is_lowercase(self) -> bool

pub fn is_uppercase(self) -> bool

pub fn is_whitespace(self) -> bool

pub fn is_alphanumeric(self) -> bool

pub fn is_control(self) -> bool

pub fn is_numeric(self) -> bool

pub fn to_lowercase(self) -> ToLowercaseⓘNotable traits for ToLowercaseimpl Iterator for ToLowercase type Item = char;

pub fn to_uppercase(self) -> ToUppercaseⓘNotable traits for ToUppercaseimpl Iterator for ToUppercase type Item = char;

pub const fn is_ascii(&self) -> bool

pub const fn to_ascii_uppercase(&self) -> char

pub const fn to_ascii_lowercase(&self) -> char

pub const fn eq_ignore_ascii_case(&self, other: &char) -> bool

pub fn make_ascii_uppercase(&mut self)

pub fn make_ascii_lowercase(&mut self)

pub const fn is_ascii_alphabetic(&self) -> bool

pub const fn is_ascii_uppercase(&self) -> bool

pub const fn is_ascii_lowercase(&self) -> bool

pub const fn is_ascii_alphanumeric(&self) -> bool

pub const fn is_ascii_digit(&self) -> bool

pub const fn is_ascii_hexdigit(&self) -> bool

pub const fn is_ascii_punctuation(&self) -> bool

pub const fn is_ascii_graphic(&self) -> bool

pub const fn is_ascii_whitespace(&self) -> bool

pub const fn is_ascii_control(&self) -> bool

Trait Implementations

impl AsciiExt for char

type Owned = char

fn is_ascii(&self) -> bool

fn to_ascii_uppercase(&self) -> Self::Owned

fn to_ascii_lowercase(&self) -> Self::Owned

fn eq_ignore_ascii_case(&self, o: &Self) -> bool

fn make_ascii_uppercase(&mut self)

fn make_ascii_lowercase(&mut self)

impl Clone for char

pub fn clone(&self) -> char

fn clone_from(&mut self, source: &Self)

impl Debug for char

pub fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

impl Default for char

pub fn default() -> char

impl Display for char

pub fn fmt(&self, f: &mut Formatter<'_>) -> Result<(), Error>

impl<'a> Extend<&'a char> for String

pub fn extend<I>(&mut self, iter: I) where I: IntoIterator<Item = &'a char>,

pub fn extend_one(&mut self, &'a char)

pub fn extend_reserve(&mut self, additional: usize)

impl Extend<char> for String

pub fn extend<I>(&mut self, iter: I) where I: IntoIterator<Item = char>,

pub fn extend_one(&mut self, c: char)

pub fn extend_reserve(&mut self, additional: usize)

impl From<char> for u128

pub fn from(c: char) -> u128

impl From<char> for u32

pub fn from(c: char) -> u32

impl From<char> for u64

pub fn from(c: char) -> u64

impl From<char> for String

pub fn from(c: char) -> String

impl From<u8> for char

pub fn from(i: u8) -> char

pub const UNICODE_VERSION: (u8, u8, u8 )

pub fn decode_utf16<I>(iter: I) -> DecodeUtf16<<I as IntoIterator>::IntoIter>ⓘ
Notable traits for DecodeUtf16<I>
`impl<I> Iterator for DecodeUtf16<I> where I: Iterator<Item = u16>, type Item = Result<char, DecodeUtf16Error>;`
where
I: IntoIterator<Item = u16>,

pub fn escape_unicode(self) -> EscapeUnicodeⓘ
Notable traits for EscapeUnicode
`impl Iterator for EscapeUnicode type Item = char;`

pub fn escape_debug(self) -> EscapeDebugⓘ
Notable traits for EscapeDebug
`impl Iterator for EscapeDebug type Item = char;`

pub fn escape_default(self) -> EscapeDefaultⓘ
Notable traits for EscapeDefault
`impl Iterator for EscapeDefault type Item = char;`

pub fn encode_utf8(self, dst: &mut [u8 ]) -> &mut str

pub fn encode_utf16(self, dst: &mut [u16 ]) -> &mut [u16 ]

pub fn to_lowercase(self) -> ToLowercaseⓘ
Notable traits for ToLowercase
`impl Iterator for ToLowercase type Item = char;`

pub fn to_uppercase(self) -> ToUppercaseⓘ
Notable traits for ToUppercase
`impl Iterator for ToUppercase type Item = char;`

pub fn extend<I>(&mut self, iter: I) where
I: IntoIterator<Item = &'a char>,

pub fn extend<I>(&mut self, iter: I) where
I: IntoIterator<Item = char>,

pub fn from_iter<I>(iter: I) -> String where
I: IntoIterator<Item = &'a char>,

pub fn from_iter<I>(iter: I) -> String where
I: IntoIterator<Item = char>,

pub fn from_iter<I>(it: I) -> Cow<'a, str> where
I: IntoIterator<Item = char>,