C++ swscanf()

The swscanf() function in C++ reads wide character from wide string buffer.

The swscanf() function is defined in <cwchar> header file.

swscanf() prototype

int swscanf( const wchar_t* buffer, const wchar_t* format, ... );

The swscanf() function reads the data from buffer and stores the values into the respective variables.


swscanf() Parameters

  • buffer: Pointer to the wide string buffer to read the data from.
  • format: Pointer to a null-terminated character string that specifies how to read the input. It consists of format specifiers starting with %.
    The format string has the following parts:
    • Non whitespace characters except % each of which consumes one identical character from the input stream. It can cause the function to fail if the next character on the stream does not compare equal.
    • Whitespace character: All the consecutive whitespace characters are treated as single whitespace character. Further, '\n', '\t' and ' ' are considered same.
    • Conversion specification: It follows the following format:
      • Initial % character that specifies the beginning
      • An optional * called assignment-suppressing character. If this character is present, swscanf() does not assign the result to any receiving argument.
      • An optional positive integer number that specifies maximum field width. It specifies the maximum number of characters that swscanf() is allowed to consume when doing the conversion specified by the current conversion specification.
      • An optional length modifier specifying the size of the receiving argument.
      • A conversion format specifier.
    Format specifiers
    Format Specifier Description
    % Matches the literal %
    c Matches a single character or multiple characters. If width is defined, matches exactly width characters.
    s Matches consecutive non whitespace characters. If width is defined, matches exactly width characters or until first whitespace is found.
    [set] Matches a non empty sequence of character from the given set of characters. If ^ is present at the beginning of set, then all the characters not in set are matched.
    d Matches a decimal integer.
    i Matches an integer.
    o Matches an unsigned octal integer.
    X or x Matches an unsigned hexadecimal integer.
    u Matches an unsigned decimal integer.
    A or a, E or e, F or f, G or g Matches a floating-point number.
    n Returns the number of characters read so far.
    p Matches an implementation defined character sequence defining a pointer.

    So the general format of format specifier is:
    %[*][width][length]specifier
  • : Other additional arguments for receiving data. They occur in a sequence according to the format specifier.

swscanf() Return value

  • The swscanf() function returns the number of receiving arguments successfully assigned.
  • If failure occurs before the first receiving argument was assigned, EOF is returned.

Example: How swscanf() function works?

#include <cwchar>
#include <clocale>

int main()
{
	wchar_t buffer[100] = L"Theta \u03b8";
	wchar_t symbol, desc[10];

	setlocale(LC_ALL, "en_US.UTF-8");
	swscanf(buffer, L"%ls %lc", desc, &symbol);
	
	wprintf(L"%lc is %ls\n", symbol, desc);
	return 0;
}

When you run the program, a possible output will be:

θ is Theta
Did you find this article helpful?