The following document collects a list of tips and tricks for writing high-performance Swift code. The documentation is intended for compilers and library developers.

The documentation includes tips to help improve the quality of your Swift program and make your code less error-prone and more readable. Explicitly marking the final class and the class protocol are two obvious examples. However, there are also some tricks in the documentation that are unorthodox and distorted, addressing specific AD hoc needs than a compiler or language. Many of the recommendations in the document come from trade-offs such as runtime, byte size, code readability, and so on.

To enable optimized

The first thing you should do is enable optimizations. Swift offers three different optimization levels:

  • -Onone: This means normal development. It performs minimal optimizations and saves all debugging information.
  • -o: This means for most production code. The compiler performs aggressive optimizations that can significantly change the type and amount of submitted code. Debugging information will be omitted but still damaging.
  • -ounchecked: This is a special optimization mode, which means that a particular library or application is traded for security. The compiler removes all overflow checks as well as some implicit type checks. This is not usually used, as it can cause memory safety issues and integer overflows. Integer overflows and type conversions are safe if you scrutinize your code carefully.

The current optimization levels that can be modified in Xcode UI are as follows:…

Overall component optimization

By default, Swift compiles each file individually. This allows Xcode to compile multiple files in parallel very quickly. However, compiling each file separately can prevent some compiler optimizations. Swift can also compile the entire program as if it were a file, optimizing the program as if it were a single compilation unit. This pattern can be activated using the command line flag-whole-module-optimization. Programs compiled in this mode will most likely take longer to compile and will run faster.

This pattern can be activated by “Whole Module Optimization” in the XCode build Settings.

Reduce dynamic scheduling

Swift is by default a very dynamic language like Objective-C. Unlike Objective-C, Swift gives programmers the ability to provide runtime performance by eliminating and reducing this feature. This section provides several examples of language constructs that can be used for such operations.

Dynamic scheduling

Class uses dynamically scheduled methods with default property access. So in the following snippet, a.property, a.dosomething (), and a.doSomethingelse () will all be called by dynamic scheduling:

class A {
  var aProperty: [Int]
  func doSomething() { ... }
  dynamic doSomethingElse() { ... }
}

class B : A {
  override var aProperty {
    get { ... }
    set { ... }
  }

  override func doSomething() { ... }
}

func usingAnA(a: A) {
  a.doSomething()
  a.aProperty = ...
}Copy the code

In Swift, dynamic scheduling is invoked indirectly by default through a Vtable [1] (virtual function table). If declared with a dynamic keyword, Swift will send the call instead by calling an Objective-C notification. In both cases, this is slower than a direct function call because it prevents many compiler optimizations for program overhead beyond the indirect call itself [2]. In performance-critical code, people often want to limit this dynamic behavior.

Tip: Use “final” when you know the declaration doesn’t need to be overwritten.

The final keyword is a restriction ina class, method, or property declaration that prevents such declarations from being overridden. This means that the compiler can call a direct function call instead of an indirect one. For example, c. array1 and D.array1 below will be accessed directly [3]. Instead, d. array2 will be accessed through a virtual function table:

final class C {
  // No declarations in class 'C' can be overridden.
  var array1: [Int]
  func doSomething() { ... }
}

class D {
  final var array1 [Int] // 'array1' cannot be overridden by a computed property.
  var array2: [Int]      // 'array2' *can* be overridden by a computed property.
}

func usingC(c: C) {
   c.array1[i] = ... // Can directly access C.array without going through dynamic dispatch.
   c.doSomething() = ... // Can directly call C.doSomething without going through virtual dispatch.
}

func usingD(d: D) {
   d.array1[i] = ... // Can directly access D.array1 without going through dynamic dispatch.
   d.array2[i] = ... // Will access D.array2 through dynamic dispatch.
}Copy the code

Suggestion: Use “private” when declaring something that does not need to be accessed outside the file.

Using the private keyword on a declaration limits the visibility of the file to which it is declared. This gives the editor the ability to identify all other potential coverage claims. Thus, the absence of any such declaration allows the compiler to automatically infer final keywords and accordingly strip out indirect calls to the other side and access to properties. For example, in e.dosomething () and f.myprivatevar, it would be directly accessible, assuming that E and F do not have any overrides declared in the same file:

private class E { func doSomething() { ... } } class F { private var myPrivateVar : Int } func usingE(e: E) { e.doSomething() // There is no sub class in the file that declares this class. // The compiler can remove virtual Call to doSomething() // and directly call A doSomething method.} func usingF(f: f) -> Int {return f.myprivatevar}Copy the code

Use container types efficiently

The generic containers Array and Dictionary are an important feature provided by the Swift standard library. This section shows you how to use these types in a high-performance way.

Suggestion: Use value types in arrays

In Swift, types can be divided into two distinct categories: value types (structs, enumerations, tuples) and reference types (classes). One key distinction is that NSArray cannot contain value types. So when using value types, the optimizer doesn’t have to deal with NSArray support, saving most of the overhead on arrays.

In addition, if a value type recursively contains a reference type compared to a reference type, the value type only needs a reference counter. If you use a value type that has no reference type, you can avoid the extra overhead and thus free up traffic in the array.

// Don't use a class here.
struct PhonebookEntry {
  var name : String
  var number : [Int]
}

var a : [PhonebookEntry]Copy the code

Remember to make a trade-off between using large-value types and using reference types. In some cases, the cost of copying and moving large-value type data is greater than the cost of removing the bridge and holding/releasing.

Suggestion: If the NSArray bridge is unnecessary, use a ContiguousArray to store the reference type. If you need a reference Array, and the Array doesn’t need to bridge to an NSArray, use a ContiguousArray instead of Array:

class C { ... }
var a: ContiguousArray<C> = [C(...), C(...), ..., C(...)]Copy the code

Suggestion: Use appropriate changes instead of object assignments.

All standard library containers in Swift use copy-on-write (COW) to perform copy instead of instant copy. In many cases, this allows the compiler to eliminate unnecessary copies by holding containers instead of deep copies. If the container’s reference count is greater than 1 and the container’s time is changed, the underlying container is copied. For example: in the following case: when D is assigned to C, it is not copied, but when D undergoes a structural change to append 2, then D will be copied, and 2 will append to B:

var c: [Int] = [ ... ]  var d = c // No copy will occur here. d.append(2) // A copy *does* occur here.Copy the code

Sometimes COW causes extra copies if the user is not careful. For example, in a function, an attempt is made to perform a modification through object allocation. In Swift, all arguments are copied as they are passed; for example, arguments are held at the point of call and then released at the end of the called function. That is, functions like the following:

func append_one(a: [Int]) -> [Int] {
  a.append(1)
  return a
}

var a = [1, 2, 3]
a = append_one(a)Copy the code

Although the version of a has not changed due to assignment and is not used after append_one, a may be copied. This can be avoided by using the inout parameter:

func append_one_in_place(inout a: [Int]) {
  a.append(1)
}

var a = [1, 2, 3]
append_one_in_place(&a)Copy the code

Unchecked operation

Swift solves the integer overflow bug by checking for overflow when performing ordinary calculations. These checks are not appropriate in efficient code where it has been determined that no memory safety issues will occur.

Suggestion: Use unchecked integer computations when you know for sure that overflow will not occur.

In performance-critical code, you can ignore overflow checking if you know your code is safe.

a : [Int]
b : [Int]
c : [Int]

// Precondition: for all a[i], b[i]: a[i] + b[i] does not overflow!
for i in 0 ... n {
  c[i] = a[i] &+ b[i]
}Copy the code

The generic

Swift provides a very powerful abstraction mechanism through the use of generic types. The Swift compiler issues a concrete code block that executes MySwiftFunc<T> on any T. The generated code requires a table of function Pointers and a box containing T as additional arguments. The different behavior between MySwiftFunc<Int> and MySwiftFunc<String> is illustrated by passing different function pointer tables and abstract sizes provided through boxes. An example of generics:

class MySwiftFunc<T> { ... }

MySwiftFunc<Int> X    // Will emit code that works with Int...
MySwiftFunc<String> Y // ... as well as String.Copy the code

When the optimizer is enabled, the Swift compiler looks for a call to this code and tries to identify the specific type used in the call (for example, a non-generic type). If the definition of a generic function is visible to the optimizer and the specific type is known, the Swift compiler will generate a special generic function with a special type. The process of calling this particular function avoids the expense of associated generics. Some examples of generics:

class MyStack<T> {
  func push(element: T) { ... }
  func pop() -> T { ... }
}

func myAlgorithm(a: [T], length: Int) { ... }

// The compiler can specialize code of MyStack[Int]
var stackOfInts: MyStack[Int]
// Use stack of ints.
for i in ... {
  stack.push(...)
  stack.pop(...)
}

var arrayOfInts: [Int]
// The compiler can emit a specialized version of 'myAlgorithm' targeted for
// [Int]' types.
myAlgorithm(arrayOfInts, arrayOfInts.length)Copy the code

Suggestion: Put the declaration of a generic in the file that uses it

The optimizer can perform specialization only if the generic declaration is visible to the current module. This can only happen if the code that uses generics and the code that declares them are in the same file. Note that the standard library is an exception. Generics declared in the standard library are visible to all modules and can be specialized.

Suggestion: Allow the compiler to specialize

The compiler can specialize generic code only if the calling location is in the same compilation unit as the called function. A trick we can use to get the compiler to optimize the called function is to perform type checking in the compilation unit in which the called function resides. The code that performs the type checking redistributes the call to the generic function — but this time it carries the type information. In the following code, we have inserted type checking into the function play_a_game, making the code hundreds of times faster.

//Framework.swift: protocol Pingable { func ping() -> Self } protocol Playable { func play() } extension Int : Pingable { func ping() -> Int { return self + 1 } } class Game<T : Pingable> : Playable { var t : T init (_ v : T) {t = v} func play() { for _ in 0... 100_000_000 { t = t.ping() } } } func play_a_game(game : Playable ) { // This check allows the optimizer to specialize the // generic call 'play' if let z = game as? Game<Int> { z.play() } else { game.play() } } /// -------------- >8 // Application.swift: play_a_game(Game(10))Copy the code

Overhead of large value objects

In swift, value types hold a unique copy of their data. There are many advantages to using value types, such as the independent state of value types. When we copy a value type (i.e. copy, pass initialization parameters, etc.), the program creates a copy of the value type. For large value types, this copying can be time consuming and may affect program performance.

Let’s take a look at the following code. This code defines a tree using nodes of value types that contain other nodes of protocol types. Computer graphics scenarios often consist of entities and morphologies that can be represented using value types, so this example makes practical sense

protocol P {} struct Node : P { var left, right : P? } struct Tree { var node : P? init() { ... }}Copy the code

When a tree is copied (parameter passing, initialization, or assignment) the entire tree needs to be copied. This is an expensive operation, requiring a lot of malloc/free calls and a lot of reference-counting

However, it does not matter whether the values are copied or not, as long as they are in memory.

Use COW for large value types (copy-on-write, copy-on-write is similar to arrays)

One way to reduce the overhead of copying data of large value types is to use the copy-on-write behavior (the actual copying takes place only when the object changes). The simplest copy-on-write scheme uses pre-existing copy-on-write data structures, such as arrays. Swift’s data is of value type, but the array is not copied every time it is passed as a parameter, because it is copy-on-write.

In our Tree example we reduce the cost of copying by wrapping the contents of the Tree as an array. This simple change has a huge impact on the performance of our tree data structure, changing the cost of passing the array as a parameter from O(n) to O(1).

struct tree : P { var node : [P?]  init() { node = [ thing ] } }Copy the code

But there are two obvious drawbacks to implementing COW with arrays. The first problem is that array exposed methods such as Append and count have no use in the context of value wrapping, and these methods make wrapping reference types tricky. Perhaps we could solve this problem by creating a wrapped structure and hiding the unused apis, but it would not solve the second problem. The second problem is that there is code inside the array to secure the program and to interact with the OC. Swift checks to see if the table is enclosed within the bounds of the array, and to see if the storage space needs to be expanded when saving values. These runtime checks can slow things down.

An alternative is to implement a dedicated data structure that uses COW instead of wrapping values with arrays. An example of building such a data structure is shown below:

final class Ref<T> { var val : T init(_ v : T) {val = v} } struct Box<T> { var ref : Ref<T> init(_ x : T) { ref = Ref(x) } var value: T { get { return ref.val } set { if (! isUniquelyReferencedNonObjC(&ref)) { ref = Ref(newValue) return } ref.val = newValue } } }Copy the code

The type Box can replace the array in the previous example

Unsafe code

Swift language classes use reference counting for memory management. The Swift compiler inserts code that increases the reference count each time an object is accessed. For example, consider an example of traversing a linked list implemented using a class. Traversing the list is accomplished by moving a reference to the next node in the list: elem = elem.next. Each time moving this reference, Swift increases the reference count of the next object and decreases the reference count of the previous object, which is expensive but unavoidable with the Swift class

final class Node {
 var next: Node?
 var data: Int
 ...
}Copy the code

Suggestion: Use unmanaged references to avoid the overhead of reference counting

You can choose to use unmanaged references in efficient code. The Unmanaged<T> structure allows developers to turn off reference counting for specific references

var Ref : Unmanaged<Node> = Unmanaged.passUnretained(Head)

while let Next = Ref.takeUnretainedValue().next {
  ...
  Ref = Unmanaged.passUnretained(Next)
}Copy the code

agreement

Suggestion: Mark protocols that have only class implementations as class protocols

Swift can specify that protocols are implemented only by classes. One advantage of the tag protocol being implemented only by classes is that the compiler can optimize the program based on this. For example, the ARC memory management system can easily hold (increase the reference count for that object) if it knows it is dealing with a class object. If the compiler does not know this, it must assume that the structure can also implement the protocol, and it must be prepared to hold or release different data structures, which can be expensive.

If a protocol is restricted to being implemented by a class, mark the protocol as a class for better performance

protocol Pingable : class { func ping() -> Int }Copy the code

footnotes

[1] A virtual method table, or Vtable, is a type constraint table containing the addresses of type methods referenced by an instance. For dynamic distribution, the table is first looked up from the object and then the methods in the table. [2] This is because the compiler does not know which specific method is to be called. [3] For example, Load a class field directly or call a method directly. [4] Explain what COW is. [5] In certain cases can the optimizer use inline and ARC optimization techniques to remove retain, release because there is no replication caused