Introduction

Python supports several built-in data types that are integral part of programming language. Everything in Python is object. Built-in data types are object type of data. They are the easiest way to start using Python because of its simplicity. If you need complex types you will use instances of you own class.

Python is dynamically and strongly typed language at the same time. What does that mean? Dynamically corresponds to fact that you don’t need to explicitly defined type of the variable that is assigned to value. Just populate the value with the desired data type. Strong means once the variable is assigned you can only do operations related with data type. For example integer variable can do only numeric tasks and string variable can do only string operations.

Let’s see what kind of data types Python has:

  • Numeric data types (int, float, long, complex)
  • Sequence types (strings, lists, tuples, ranges, bytes, bytearray)
  • Mappings (dictionaries)
  • Sets
  • Function, Modules, Classes

Data types

Numeric data types

Numeric data types in Python 3.x are divided into three main types: integer, float and complex.

Integer are positive and negative whole numbers including zero. You can store numbers between –9223372036854775808 and 9223372036854775807.

Floats are positive and negative decimal numbers and complex numbers. You can store numbers, ranging from 2.2250738585072014 x 10308 to 1.7976931348623157 x 10308.

Complex numbers include real component and imaginary part j.

Integer numbers can be also represented by hexadecimal (base16), octal (base 8) or binary (base2) format. Hexadecimal numbers are represented by hexadecimal digits (A-F, 0-9). Octal start with 0o or 0O followed by digits (0-7). Binary have 0 and 1.

Sequence types (Strings, Lists, Tuples, Ranges, Bytes, ByteArray)

Sequence types include all data types that are ordered in position in the way they can be indexed and sliced. Indexed means that each element can be accessed by its index. Slicing the sequence takes only part of the sequence.

Text sequence types (Strings)

Strings represent text sequence type and one of the most popular data types to operate. Strings in Python 3.x are array bytes representing Unicode characters. Array of bytes? Yes, they are array of bytes with Unicode representation. It is your choice which type of encoding you will use. Encoding and decoding is done automatically while reading and writing data. Default encoding is UTF-8.

Strings are immutable which means that original string objects cannot be modified in any way. Check later in text details about this feature. Strings can be indexed and sliced which is one of a great way to manipulate strings. Because strings are objects it use their specific built-in method like upper, lowercase, split, encode and many more. Those methods are bound to string type of data only.

Binary sequence types

Bytes and bytearray are part of binary sequence types in Python. Both types of binary representation of string type of data.

Bytes

Python 3 has a clear difference between the sequence of characters and the sequence of raw data machine format – bytes. Bytes are immutable string representations of data. Bytes can be stored in directly in memory but strings could not, because byte are machine representation of data. String must be encoded first to store its values into memory. The range bytes can display data is ASCII code format which means you can display integers from 0 to 256. If Unicode string is represented by ‘abc’ then bytes are represented by b’abc’. Let’s check the connection in the following figure:

C:\docs\stranice\binarymaps\docker\python\python strings bytes.jpg

Bytearrays

Very similar to string but with one major difference. Bytearrays are mutable but also raw type of data. It can be very handy to use if you need to change a lot of little chunks in memory without a need to copy the data but to replace data in place like database engines and image libraries. The range bytes can display data is ASCII code format which means you can display integers from 0 to 256.

Lists

Lists are collections of sequential (ordered) data that do not have a limited number. They are one of the most commonly used data types and, and unlike strings, are mutable. Which means that every element of the list can be changed. List can contain any type of data that is dynamically typed which means that you can create list with different types (strings, integers, bytes, etc). Lists can be nested and can be used to create matrices with duplicated data.

Tuples

Tuples are similar to lists expect they are immutable and cannot be dynamically changed.

Ranges

Ranges are unordered sequence of integer numbers. Range has three parameters: start, end and step. For example range (1,10, 2) means even numbers from 1 to 10. Usually used as shortcut in for loops.

Mappings (Dictionaries)

Dictionaries are something completely different than lists. While lists are indexed by position, dictionaries are accessed by its key. They are unordered which means that cannot be indexed. Dictionaries always stores two values which are regularly called key-value pair, where all keys have their values. This is the reason why dictionaries reside in mapping category. Dictionaries share some characteristics with lists: they are mutable, dynamic and can be nested. Very useful when you want to describe properties of some objects.

Sets

Sets are a collection of unordered collection of data which can grow and shrink on demand. They differ from lists and dictionaries just because of its features: unordered and not mapping type. They hold unique collection of objects and usually used with mathematical operations like union and difference. Because sets are not ordered their elements cannot be accessed with index but major advantage of sets are their easy way to confirm if object is part of set because it maintains hash table of objects. Another type of immutable set is frozen set.

Frozen Set

Frozen Sets are immutable types of Sets.

Other types

Because everything in Python is object, there are several types included like objects, methods, functions, files and types.

Immutable vs Mutable

All these Python built-in data types are sharing or not sharing some common properties like: mutable, immutable, dynamic, ordered or unordered. Mutability is the reason why we have many more Python types. Built-in types (int, float, bool, str, tuple, unicode) are immutable. Built-in types (list, set, dict) are mutable. Mutable objects can be changed while immutable objects can’t. Best way to identify object in memory is to use id() functions which return integer number of object located in memory.

Use cases:

  • Access to Immutable objects is quicker than mutable
  • Mutable objects are used when you need to change size of the object
  • Immutable objects are used when you want to ensure that object is always the same
  • Changing the immutable objects involves much expensive operation because you have to create copy of existing object
Object Type Domain Mutable Ordered Dynamic
Int Number Z No N/A N/A
Float Number Z No N/A N/A
Complex Number Z No N/A N/A
String Text Text Yes N/A N/A
Byte Binary Unicode No N/A N/A
ByteArray Binary Unicode Yes N/A N/A
List Sequence Collection of things Yes Yes Yes
Tuple Sequence Collection of things No Yes No
Range Sequence Collection of numbers No Yes No
Set Sequence Collection of things Yes No Yes
Frozen Set Sequence Collection of things No No Yes
Dictionary Mapping Mapping between things Yes No Yes

Python operators

Python operators are divided into several major categories:

  • Arithmetic operators
  • Comparison operators
  • Logical operators
  • Bitwise operators
  • Assignment operators
  • Identity operators
  • Membership operators

Arithmetic operators

OPERATOR DESCRIPTION SYNTAX
+ Addition x + y
Subtraction x – y
* Multiplication x * y
/ Division x / y
% Modulus x % y
** Exponentiation x ** y
// Floor division (integer division) x // y

Comparison operators

OPERATOR DESCRIPTION OPERATION
== If values of two operands are equal, result is true a == b is TRUE
!= If values of two operands are not equal, result is true a != b IS TRUE
<> If values of two operands are not equal, result is true a <> b IS TRUE
> If the value of left operand is greater than the value of right operand, then condition becomes true. a>b IS TRUE
< If the value of left operand is less than the value of right operand, then condition becomes true. a<b IS TRUE
>= If the value of left operand is greater than or equal to the value of right operand, then condition becomes true. a>=b IS TRUE
<= If the value of left operand is less than or equal to the value of right operand, then condition becomes true. a<=b IS TRUE

Logical operators

OPERATOR DESCRIPTION OPERATION
AND If both operands are TRUE, result is TRUE a AND b
OR If any of operands are TRUE, results is TRUE a OR b
NOT Used to reverse the logical state of its operands NOT (a AND b)

NOT (a OR b)

Bitwise operators

OPERATOR DESCRIPTION SYNTAX
& Bitwise AND

Returns 1 if bits on same position are 1

x & y
| Bitwise OR

Returns 0 if bits on same position are 0

x | y
~ Bitwise NOT

Inverts all bits

~x
^ Bitwise XOR

Returns 1 if any of bits on same position is 1

x ^ y
>> Bitwise right shift

Shifts 1 bit from left to right

x>>
<< Bitwise left shift

Shifts 1 bit from right to left

x<<

Assignment operators

Operator Syntax Eqivalent
= x = 1 x = 1
+= x += 2 x = x + 2
-= x -= 1 x = x – 1
*= x *= 2 x = x * 2
/= x /= 1 x = x / 1
%= x %= 2 x = x % 2
//= x //= 1 x = x // 1
**= x **= 2 x = x ** 2
&= x &= 1 x = x & 1
|= x |= 2 x = x | 2
^= x ^= 1 x = x ^ 1
>>= x >>= 2 x = x >> 2
//= x //= 1 x = x // 1

Identity operators

OPERATOR DESCRIPTION OPERATION
IS TRUE if operands are same X is TRUE
IS NOT TRUE if operands are not same X is NOT TRUE

Membership operators

OPERATOR DESCRIPTION OPERATION
IN Returns TRUE if specified value is present in object x in y
NOT IN Returns TRUE if specified value is not present in object x not in y