1.2 Understanding Cocoa’s Role in Mac OS X
Cocoa is a large and important part of development on Mac OS X, but it is not the only part. It is important to understand how it fits in to the larger picture. Figure 1.1 gives a simple overview of how the major components of OS X fit together. As a developer, you can choose to ignore most of the details of the layers underneath Cocoa, but understanding them will help you to write better programs.
Darwin
XNU IOKit
Quartz Core Foundation
Foundation
AppKit
Figure 1.1: An overview of the major components of OS X.
1.2.1 Cocoa
Cocoa is the top level of the application developer’s stack. It provides two, layered, frameworks: Foundation and AppKit. The first provides standard data types, such as strings, and collection classes. It also includes a lot of lower level functionality, such as interfaces to the filesystem and the network.
One of the most powerful parts of Foundation is theDistributed Objectsframe- work. This uses the proxy facilities of Objective-C and the serialization capabilities of the Foundation objects to allow different objects running in different processes, or on different machines, to interact as if they were local.
The most important part of the Foundation library is the memory management code. Prior to OpenStep, NeXT code used a C-like allocate and free mechanism, where objects all had an owner and needed to be explicitly freed by this owner.
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
10 Chapter 1. Cocoa and Mac OS X
Tracking ownership was a major source of bugs. With OpenStep, objects main- tained an internal reference count, making this a much simpler problem. Reference counting isn’t perfect, and recent versions of Cocoa incorporate a tracing garbage collector. This doesn’t completely eliminate memory management problems—
even some of Apple’s own applications still manage to leak memory—but it can make life a lot easier.
On top of this is the Application Kit, or AppKit for short. This contains all of the GUI-related code and a lot of other things. It is closely integrated with the Foundation framework, and there are lots of examples of classes declared in Foundation and either extended or wrapped in AppKit.
One such example is the run loop. NSRunLoopis a class defined in Foundation that handles a simple loop calling methods on objects in each iteration. In AppKit, this is extended by having the application object handle event delivery for each run-loop iteration.
Cocoa and the iPhone
In 2007, Apple introduced the iPhone and iPod Touch, handheld devices run- ning a cut-down version of OS X. These run the Foundation part of Cocoa, but not AppKit. Instead, they use UIKit, a framework designed for small form- factor devices. UIKit is based on a lot of the same concepts as AppKit, and a lot of AppKit classes have direct analogues in UIKit.
Unlike desktop OS X, iPhone OS X does not need to support legacy appli- cations. It is a completely new platform, and Apple took this opportunity to break a lot of legacy features. You can think of UIKit as a cleaned-up version of AppKit. New additions to AppKit share a lot more in common with UIKit than they do with older parts of AppKit. Because UIKit and AppKit have a large overlapping subset, it is easy for developers familiar with one to move to the other. It is also easy to port code between the two.
A number of the more advanced desktop frameworks are not available on the iPhone version of OS X, and neither are some newer features like garbage collection, which would not run well on low-powered devices like the iPhone.
You can expect future versions of desktop and iPhone OS X to converge. Al- though it is unlikely that they will ever share exactly the same APIs, there is an increasingly large common subset that works on both.
Another case is the NSAttributedString class. This is defined in Foundation as a simple class that stores mappings from ranges in a string to dictionaries of attributes. In Foundation, the attributes are arbitrary key-value pairs. AppKit
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
1.2. Understanding Cocoa’s Role in Mac OS X 11
extends this by defining meanings for certain keys, such as the “font” key for the typeface of the text range.
A large number of other frameworks are now included under the slightly amor- phous Cocoa brand. Some of these relate to system integration, such as the Address Book framework. This provides support for storing arbitrary data about people, allowing instant messaging, email, and other communication tools to share a single store. Others, such as Core Data, make managing data in your application easier.
The term Cocoa is used to mean two things. The Cocoa framework is a simple wrapper that includes just the Foundation and AppKit frameworks. Often, however, people who say “Cocoa” mean “Objective-C frameworks included with OS X.” There are a huge number of these and an even bigger collection of third-party frameworks available to developers.
1.2.2 Quartz
NeXTSTEP usedDisplay PostScript (DPS) for the display. This made the plat- form very attractive for desktop publishing, since the output to the display and printer were the same.
PostScript is a very complex language. It is Turing-complete, meaning that it can implement any algorithm. This was convenient for printing, since it allowed complex programs to be sent to the printer and generate complex documents. It is less convenient for on-screen drawing. If you send a PostScript program to the printer that infinite loops, you just turn the printer off and on again and resubmit your jobs. If you do the same with a display server, you have a problem. A well-designed Display PostScript implementation can kill widgets that run for too long, but defining “too long” is nontrivial.
Most users didn’t use much of the power of DPS. They used bezier curves and simple drawing commands, but wrote all of the control structures in C or Objective-C. This removed one of the main advantages of a DPS-like environment.
Sun produced a similar PostScript-based windowing system calledNeWS, which competed with the X Windowing System in the early days of graphical UNIX.
The main advantage of NeWS over X was that graphical objects were written in PostScript and sent over the network to the display. When you clicked on a button on an X GUI, it sent a “mouse clicked” event to the remote machine, which then sent X commands back to handle updating the UI. With NeWS, the PostScript UI processed the event, drew the pressed button image, and sent a “button pressed”
event to the remote machine. This enabled much lower-latency remote display.
NeXT, however, never pushed remote display for DPS and so this capability was not exploited.
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
12 Chapter 1. Cocoa and Mac OS X
DPS, since it was standard on other UNIX-like platforms at the time (and still is).
They found that it lacked a number of features they needed, such as good support for antialiased fonts, color calibration, and compositing. DPS didn’t support all of these things either, so Apple wrote a new system.
Since no one was using the DPS flow control structures, these were the first to go. Adobe had already created a PostScript-like language without flow con- trol: Portable Document Format (PDF). The new windowing system adopted the PDF drawing model and integrated compositing support from the start and is sometimes referred to as Display PDF. Earlier windowing systems—X11, DPS, NeWS, and the Windows GDI—were all created in an era when RAM was very expensive. Buffering every single window on a display could easily take tens of megabytes of memory. The framebuffer alone for the original NeXT workstations was almost 2MB (1120×832 in 16-bit color), and buffers for overlapping windows could easily have filled the 12MB of RAM the color version included. When OS X was introduced, the target machine had at least 64MB of RAM, 16MB of video RAM, and a similar sized display. This made buffering everything a lot more attractive, since the cost was relatively small and the benefit was reducing the CPU used for redrawing and eliminating tearing.
The first versions of OS X were relatively slow for large graphics updates, since all of the compositing was performed on the CPU. Later versions offloaded this to the GPU that could perform composting operations much faster. The updated version was calledQuartz Extreme.
With 10.4, Apple attempted to offload even more to the GPU. Somewhat ironically, the most expensive graphical operation on OS X is rendering text.
Quartz GL, formerlyQuartz 2D Extreme, accelerated this by rendering each font glyph as an antialiased outline to a texture and then drawing them by compositing these together. This was a lot faster than drawing the bezier curves of each glyph for every character. When it was introduced, Quartz 2D Extreme was less reliable than its predecessor and so was not enabled by default. It can still be enabled by the end user with theQuartz Debug utility, which is included with the developer tools bundle.
It is often said that Quartz uses OpenGL. This is not true. Quartz may use the 3D hardware, but it does so via its own interface. Both OpenGL and Quartz are built on top of the 3D hardware drivers. Quartz windows may contain OpenGL contexts, allowing OpenGL to be used on OS X. The deformations on Windows and compositing operations are all implemented directly by Quartz and the window server.
The programmatic interface to the Quartz is calledCore Graphics. This pro- vides low-level drawing functions following the PDF model. You can use Core Graphics to draw antialiased bezier curves, filled shapes, and composited bitmaps.
These functions are also wrapped by AppKit APIs. The iPhone, which does not
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
1.2. Understanding Cocoa’s Role in Mac OS X 13
use AppKit, does have an implementation of Core Graphics, so drawing code using these APIs will work on both systems.
1.2.3 Core Foundation
While Quartz provides the low-level display primitives,Core Foundation provides low-level data manipulation primitives. Both Cocoa and Carbon use Core Foun- dation for a number of things, including representing strings and collections.
When you inspect a Cocoa object in the debugger, you may find that the real object has a CF prefix, instead of (or in addition to) the NS prefix. This is because Core Foundation implements a simple version of the Objective-C object model for C, and some of the common objects are implemented in this way. This allows them to be used easily as C opaque types in Carbon and as Objective-C objects in Cocoa. You can see this when you create an Objective-C string object. This appears in your code as an instance of theNSStringclass, but in the debugger as anNSCFString. A C programmer will create these using macros and will see them as instances of theCFStringopaque type.
The phrasetoll-free bridge is used to describe this arrangement. This means that you can pass the Cocoa and Core Foundation objects to functions or methods that expect the other form, and it will all work without any additional overhead.
There is some deep voodoo used to implement this in practice. Core Foundation objects all have their class pointer set to a value less than 0xFFFF, and the message dispatch functions treat these values as special.
Every Objective-C object is a structure whose first element is a pointer to the class. This is used by the message sending functions to look up the correct method to call. Core Foundation types set this pointer to a value that is invalid as a pointer, allowing their methods to be called by message sending functions or directly. Unlike Objective-C methods, Core Foundation functions take the object, but not the selector (method name), as arguments. This means that some of the more advanced features of Objective-C, such as message forwarding, are unavailable to Core Foundation types.
The Core Foundation functions are used directly from Carbon applications, and Cocoa developers can use equivalent methods. A number of parts of the userland, such as Launchd, make use of Core Foundation. All of the data types that can be serialized in property lists are implemented by Core Foundation, allowing pure C applications to be use property lists created by Cocoa applications.
This is relatively common in OS X, where system dổmons use Core Foundation but the GUIs responsible for their configuration use Cocoa.
In the Core Foundation documentation, you will often see the term “class” used in relation to Core Foundation types. This means roughly the same thing that
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
14 Chapter 1. Cocoa and Mac OS X
purely an abstract concept; it has no run-time existence. You can get a pointer to an Objective-C class and inspect it, but a Core Foundation class is identified by an integer and all of its behavior is hard-coded.
CFLite
In addition to the full version of Core Foundation, Apple have released an open source subset of it, called CFLite. This does not implement the toll-free bridging support, since this is a feature of the Apple Objective-C runtime;
however, it does implement all of the C APIs. This means that code that only uses the Core Foundation libraries can be ported to other platforms relatively easily. Note, however, that CFLite is not used by GNUstep, and so it does not aid in porting code that mixes Core Foundation and Cocoa calls, unless the objects used for each are entirely separate.
1.2.4 Darwin
The core of OS X is the Darwin operating system. The original NeXTSTEP system was based on CMU’s Mach operating system, with a single BSD server providing a lot of the services usually associated with a kernel. Mach is a microkernel, a very simple kernel that tries to do as little as possible. There are two ways of using a microkernel. The intended design is to decompose the kernel into a set of different components, calledservers, each providing some aspects of the operating system’s functionality. The other alternative is to run a single server that does everything. The latter approach is used by OS X. For efficiency reasons, the single BSD server runs inside the kernel’s address space, eliminating a lot of the performance problems that plagued early Mach-based systems at the cost of most of the stability advantages that Mach brought over traditional UNIX-like systems.
When Apple acquired NeXT, it began updating the core operating system with code from NetBSD, and later from FreeBSD. The early userland contained a mixture of utilities from NeXTSTEP, FreeBSD, and GNU. A lot of the older NeXTSTEP code has now been removed. The init system and a number of related systems such as cron and inetd were rolled into Launchd, and the NetInfo directory service from NeXTSTEP was replaced with LDAP and a simpler local directory system based on property lists.
The current userland is very similar to a FreeBSD system, with some Apple embellishments. A few changes were made to make Linux users feel more at home, for example, including the GNU shell,bash, rather than a C shell, as the default.
Several other GNU utilities are also included, replacing the FreeBSD equivalents.
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
1.2. Understanding Cocoa’s Role in Mac OS X 15
The biggest difference between Darwin and most other open-source UNIX-like systems is that it does not use the GNU binary utilities. While other systems use theExecutable and Linking Format (ELF) for binaries, Darwin uses the Mach- O format. These are roughly equivalent in terms of capabilities. To support this format, Darwin uses its own linker and loader, as well as its own tools for inspecting binaries.
If you are familiar with other UNIX systems, you may be used to usinglddto inspect shared libraries. This does not exist on Darwin; its functionality is sub- sumed into theotoolprogram, which provides a number of options for inspecting binary objects in Mach-O format.
As well as the binary utilities, OS X provides its own C++ standard library and a C library based on the FreeBSD version.
As of 10.5, OS X was certified by The Open Group as compliant with the Single Unix Specification 2003. This means that it has the right to be described as UNIXTM, something Apple was doing already for marketing purposes. Note that the certification is per platform, as well as per version. Technically speaking, OS X on Intel is UNIX, while OS X on PowerPC is not.
Apple has released the core Darwin code as open source under a variety of licenses. Most Apple-original parts are under the Apple Public Source License (APSL), while others are under whatever license their original authors picked.
It is possible to run Darwin as a stand-alone operating system, although this is not a popular choice. Until 10.5 (Darwin 9), kernel performance was markedly inferior to other open source UNIX-like systems. There are also some limitations.
On most other UNIX systems, sound is generated by writing to /dev/dsp or a similar location. On OS X, there is no low-level access to the sound hardware from userspace, all sound goes through Core Audio, which is not open source.
This means sound is not easily supported by Darwin unless you want to go to the trouble of writing a replacement that interfaces to the top layers of the drivers directly. A similar situation exists for 3D graphics.
Note, by the way, that Darwin version numbers correspond directly to version numbers from NeXTSTEP and OPENSTEP. The first release of OS X was Darwin 5, following on from OPENSTEP 4.
1.2.5 XNU
The core of Darwin is the XNU kernel. This is a single-server Mach microkernel with a BSD server derived largely from FreeBSD. The distinction between Mach and BSD in the XNU kernel is largely academic, since they both run in the same process. Mach in XNU fills a similar rˆole to the HAL in Windows. It is a small layer of platform-specific code that is easy to port to new architectures. It handles
Simpo PDF Merge and Split Unregistered Version - http://www.simpopdf.com
ptg
16 Chapter 1. Cocoa and Mac OS X
by the BSD layer. In addition, the BSD layer is responsible for providing a POSIX interface to the Mach functionality.
The Mach microkernel is responsible for handling threads, processes, and inter- process communication. Everything else is implemented by the BSD layer. Some things are shared between the two. Although it is possible to interact with Mach threads directly, most of the time developers choose to use the POSIX functions.
The BSD subsystem is responsible for maintaining UNIX process structures that are implemented in terms of Mach tasks and allowing UNIX-like system calls to control the kernel.
Device drivers for OPENSTEP were written using DriverKit, an Objective- C framework. It is worth noting that, on a platform with a 25MHz Motorola 68040 CPU, Objective-C was regarded as being fast enough for writing device drivers. On OS X, DriverKit was replaced with IOKit. This framework has a similar design, but is implemented in Embedded C++, a very small subset of C++ omitting a large amount of the language to make it suitable for embedded work.