CC MODE Version 5 Documentation


Introduction

@macro ccmode CC Mode

Welcome to @ccmode{}. This is a GNU Emacs mode for editing files containing C, C++, Objective-C, Java, and CORBA IDL code. This incarnation of the mode is descendant from `c-mode.el' (also called "Boring Old C Mode" or BOCM :-), and `c++-mode.el' version 2, which I have been maintaining since 1992. @ccmode{} represents a significant milestone in the mode's life. It has been fully merged back with Emacs 19's `c-mode.el'. Also a new, more intuitive and flexible mechanism for controlling indentation has been developed.

@ccmode{} supports the editing of K&R and ANSI C, ARM (1) C++, Objective-C, Java and CORBA's Interface Definition Language files. In this way, you can easily set up consistent coding styles for use in editing all C, C++, Objective-C, Java and IDL programs. @ccmode{} does not handle font-locking (a.k.a. syntax coloring, keyword highlighting) or anything of that nature, for any of these modes. Font-locking is handled by other Emacs packages.

This manual will describe the following:

Note that the name of this package is "@ccmode{}", but there is no top level cc-mode entry point. All of the variables, commands, and functions in @ccmode{} are prefixed with c-<thing>, and c-mode, c++-mode, objc-mode, java-mode, and idl-mode entry points are provided. This file is intended to be a replacement for `c-mode.el' and `c++-mode.el'.

This distribution also contains a file called `cc-compat.el' which should ease your transition from BOCM to @ccmode{}. If you have a BOCM configuration you are really happy with, and want to postpone learning how to configure @ccmode{}, take a look at that file. It maps BOCM configuration variables to @ccmode{}'s new indentation model. It is not actively supported so for the long run, you should learn how to customize @ccmode{} to support your coding style.

A special word of thanks goes to Krishna Padmasola for his work in converting the original `README' file to Texinfo format. I'd also like to thank all the @ccmode{} victims who help enormously during the early beta stages of @ccmode{}'s development.

Getting Connected

If you got this version of @ccmode{} with Emacs or XEmacs, it should work just fine right out of the box. Note however that you may not have the latest @ccmode{} release and may want to upgrade your copy.

If you are upgrading an existing @ccmode{} installation, please see the `README' file for installation details. @ccmode{} may not work with older versions of Emacs or XEmacs. See the @ccmode{} release notes Web pages for the latest information on Emacs version and package compatibility (see section Getting the latest CC Mode release).

Note that @ccmode{ no longer works with Emacs 18!} The `cc-mode-18.el' file is no longer distributed with @ccmode{}. If you haven't upgraded from Emacs 18 by now, you are out of luck.

You can find out what version of @ccmode{} you are using by visiting a C file and entering M-x c-version RET. You should see this message in the echo area:


Using CC Mode version 5.XX

where `XX' is the minor release number.

New Indentation Engine

@ccmode{} has a new indentation engine, providing a simplified, yet flexible and general mechanism for customizing indentation. It separates indentation calculation into two steps: first, @ccmode{} analyzes the line of code being indented to determine the kind of language construct it's looking at, then it applies user defined offsets to the current line based on this analysis.

This section will briefly cover how indentation is calculated in @ccmode{}. It is important to understand the indentation model being used so that you will know how to customize @ccmode{} for your personal coding style.

Syntactic Analysis

The first thing @ccmode{} does when indenting a line of code, is to analyze the line, determining the syntactic component list of the construct on that line. A syntactic component consists of a pair of information (in lisp parlance, a cons cell), where the first part is a syntactic symbol, and the second part is a relative buffer position. Syntactic symbols describe elements of C code (2), e.g. statement, substatement, class-open, class-close, etc. See section Syntactic Symbols, for a complete list of currently recognized syntactic symbols and their semantics. The variable c-offsets-alist also contains the list of currently supported syntactic symbols.

Conceptually, a line of C code is always indented relative to the indentation of some line higher up in the buffer. This is represented by the relative buffer position in the syntactic component.

Here is an example. Suppose we had the following code as the only thing in a c++-mode buffer (3):


  1: void swap( int& a, int& b )
  2: {
  3:     int tmp = a;
  4:     a = b;
  5:     b = tmp;
  6: }

We can use the command C-c C-s (c-show-syntactic-information) to simply report what the syntactic analysis is for the current line. Running this command on line 4 of this example, we'd see in the echo area(4):


((statement . 35))

This tells us that the line is a statement and it is indented relative to buffer position 35, which happens to be the `i' in int on line 3. If you were to move point to line 3 and hit C-c C-s, you would see:


((defun-block-intro . 29))

This indicates that the `int' line is the first statement in a top level function block, and is indented relative to buffer position 29, which is the brace just after the function header.

Here's another example:


  1: int add( int val, int incr, int doit )
  2: {
  3:     if( doit )
  4:         {
  5:             return( val + incr );
  6:         }
  7:     return( val );
  8: }

Hitting C-c C-s on line 4 gives us:


((substatement-open . 46))

which tells us that this is a brace that opens a substatement block. (5)

Syntactic component lists can contain more than one component, and individual syntactic components need not have relative buffer positions. The most common example of this is a line that contains a comment only line.


  1: void draw_list( List<Drawables>& drawables )
  2: {
  3:         // call the virtual draw() method on each element in list
  4:     for( int i=0; i < drawables.count(), ++i )
  5:     {
  6:         drawables[i].draw();
  7:     }
  8: }

Hitting C-c C-s on line 3 of this example gives:


((comment-intro) (defun-block-intro . 46))

and you can see that the syntactic component list contains two syntactic components. Also notice that the first component, `(comment-intro)' has no relative buffer position.

Indentation Calculation

Indentation for a line is calculated using the syntactic component list derived in step 1 above (see section Syntactic Analysis). Each component contributes to the final total indentation of the line in two ways.

First, the syntactic symbols are looked up in the c-offsets-alist variable, which is an association list of syntactic symbols and the offsets to apply for those symbols. These offsets are added to a running total.

Second, if the component has a relative buffer position, @ccmode{} adds the column number of that position to the running total. By adding up the offsets and columns for every syntactic component on the list, the final total indentation for the current line is computed.

Let's use our two code examples above to see how this works. Here is our first example again:


    1: void swap( int& a, int& b )
    2: {
    3:     int tmp = a;
    4:     a = b;
    5:     b = tmp;
    6: }

Let's say point is on line 3 and we hit the TAB key to re-indent the line. Remember that the syntactic component list for that line is:


((defun-block-intro . 29))

@ccmode{} looks up defun-block-intro in the c-offsets-alist variable. Let's say it finds the value `4'; it adds this to the running total (initialized to zero), yielding a running total indentation of 4 spaces.

Next @ccmode{} goes to buffer position 29 and asks for the current column. This brace is in column zero, so @ccmode{} adds `0' to the running total. Since there is only one syntactic component on the list for this line, indentation calculation is complete, and the total indentation for the line is 4 spaces.

Here's another example:


    1: int add( int val, int incr, int doit )
    2: {
    3:     if( doit )
    4:         {
    5:             return( val + incr );
    6:         }
    7:     return( val );
    8: }

If we were to hit TAB on line 4 in the above example, the same basic process is performed, despite the differences in the syntactic component list. Remember that the list for this line is:


((substatement-open . 46))

Here, @ccmode{} first looks up the substatement-open symbol in c-offsets-alist. Let's say it finds the value `4'. This yields a running total of 4. @ccmode{} then goes to buffer position 46, which is the `i' in if on line 3. This character is in the fourth column on that line so adding this to the running total yields an indentation for the line of 8 spaces.

Simple, huh?

Actually, the mode usually just does The Right Thing without you having to think about it in this much detail. But when customizing indentation, it's helpful to understand the general indentation model being used.

As you configure @ccmode{}, you might want to set the variable c-echo-syntactic-information-p to non-nil so that the syntactic component list and calculated offset will always be echoed in the minibuffer when you hit TAB.

Minor Modes

@ccmode{} contains two minor-mode-like features that you should find useful while you enter new C code. The first is called auto-newline mode, and the second is called hungry-delete mode. These minor modes can be toggled on and off independently, and @ccmode{} can be configured so that it starts up with any combination of these minor modes. By default, both of these minor modes are turned off.

The state of the minor modes is always reflected in the minor mode list on the modeline of the @ccmode{} buffer. When auto-newline mode is enabled, you will see `C/a' on the mode line (6). When hungry delete mode is enabled you would see `C/h' and when both modes are enabled, you'd see `C/ah'.

@ccmode{} provides keybindings which allow you to toggle the minor modes on the fly while editing code. To toggle just the auto-newline state, hit C-c C-a (c-toggle-auto-state). When you do this, you should see the `a' indicator either appear or disappear on the modeline. Similarly, to toggle just the hungry-delete state, use C-c C-d (c-toggle-hungry-state), and to toggle both states, use C-c C-t (c-toggle-auto-hungry-state).

To set up the auto-newline and hungry-delete states to your preferred values, you would need to add some lisp to your `.emacs' file that called one of the c-toggle-*-state functions directly. When called programmatically, each function takes a numeric value, where a positive number enables the minor mode, a negative number disables the mode, and zero toggles the current state of the mode.

So for example, if you wanted to enable both auto-newline and hungry-delete for all your C file editing, you could add the following to your `.emacs' file:


(add-hook 'c-mode-common-hook
	  '(lambda () (c-toggle-auto-hungry-state 1)))

Auto-newline insertion

Auto-newline minor mode works by enabling certain electric commands. Electric commands are typically bound to special characters such as the left and right braces, colons, semi-colons, etc., which when typed, perform some magic formatting in addition to inserting the typed character. As a general rule, electric commands are only electric when the following conditions apply:

Hanging Braces

When you type either an open or close brace (i.e. { or }), the electric command c-electric-brace gets run. This command has two electric formatting behaviors. First, it will perform some re-indentation of the line the brace was typed on, and second, it will add various newlines before and/or after the typed brace. Re-indentation occurs automatically whenever the electric behavior is enabled. If the brace ends up on a line other than the one it was typed on, then that line is also re-indented.

The insertion of newlines is controlled by the c-hanging-braces-alist variable. This variable contains a mapping between syntactic symbols related to braces, and a list of places to insert a newline. The syntactic symbols that are useful for this list are: class-open, class-close, defun-open, defun-close, inline-open, inline-close, brace-list-open, brace-list-close, brace-list-intro, brace-list-entry, block-open, block-close, substatement-open, statement-case-open, extern-lang-open, extern-lang-close, namespace-open, and namespace-close. See section Syntactic Symbols, for a more detailed description of these syntactic symbols.

The value associated with each syntactic symbol in this association list is called an ACTION which can be either a function or a list. See section Custom Brace and Colon Hanging, for a more detailed discussion of using a function as a brace hanging ACTION.

When the ACTION is a list, it can contain any combination of the symbols before and after, directing @ccmode{} where to put newlines in relationship to the brace being inserted. Thus, if the list contains only the symbol after, then the brace is said to hang on the right side of the line, as in:


// here, open braces always `hang'
void spam( int i ) {
    if( i == 7 ) {
        dosomething(i);
    }
}

When the list contains both after and before, the braces will appear on a line by themselves, as shown by the close braces in the above example. The list can also be empty, in which case no newlines are added either before or after the brace.

For example, the default value of c-hanging-braces-alist is:


(defvar c-hanging-braces-alist '((brace-list-open)
                                 (substatement-open after)
                                 (block-close . c-snug-do-while)
                                 (extern-lang-open after)))

which says that brace-list-open braces should both hang on the right side, and allow subsequent text to follow on the same line as the brace. Also, substatement-open and extern-lang-open braces should hang on the right side, but subsequent text should follow on the next line. Here, in the block-close entry, you also see an example of using a function as an ACTION.

A word of caution: it is not a good idea to hang top-level construct introducing braces, such as class-open or defun-open. Emacs makes an assumption that such braces will always appear in column zero, hanging such braces can introduce performance problems. See section Performance Issues, for more information.

Hanging Colons

Using a mechanism similar to brace hanging (see section Hanging Braces), colons can also be made to hang using the variable c-hanging-colons-alist. The syntactic symbols appropriate for this assocation list are: case-label, label, access-label, member-init-intro, and inher-intro. Note however that for c-hanging-colons-alist, ACTIONs as functions are not supported. See also section Custom Brace and Colon Hanging for details.

In C++, double-colons are used as a scope operator but because these colons always appear right next to each other, newlines before and after them are controlled by a different mechanism, called clean-ups in @ccmode{}. See section Clean-ups, for details.

Hanging Semi-colons and commas

Semicolons and commas are also electric in @ccmode{}, but since these characters do not correspond directly to syntactic symbols, a different mechanism is used to determine whether newlines should be automatically inserted after these characters. See section Customizing Semi-colons and Commas, for details.

Other electric commands

A few other keys also provide electric behavior. For example # (c-electric-pound) is electric when typed as the first non-whitespace character on a line. In this case, the variable c-electric-pound-behavior is consulted for the electric behavior. This variable takes a list value, although the only element currently defined is alignleft, which tells this command to force the `#' character into column zero. This is useful for entering C preprocessor macro definitions.

Stars and slashes (i.e. * and /, c-electric-star and c-electric-slash respectively) are also electric under certain circumstances. If a star is inserted as the second character of a C style block comment on a comment-only line, then the comment delimiter is indented as defined by c-offsets-alist. A comment-only line is defined as a line which contains only a comment, as in:


void spam( int i ) 
{
        // this is a comment-only line...
    if( i == 7 )                             // but this is not
    {
        dosomething(i);
    }
}

Likewise, if a slash is inserted as the second slash in a C++ style line comment (also only on a comment-only line), then the line is indented as defined by c-offsets-alist.

Less-than and greater-than signs (c-electric-lt-gt) are also electric, but only in C++ mode. Hitting the second of two < or > keys re-indents the line if it is a C++ style stream operator.

Clean-ups

Clean-ups are a mechanism complementary to colon and brace hanging. On the surface, it would seem that clean-ups overlap the functionality provided by the c-hanging-*-alist variables, and similarly, clean-ups are only enabled when auto-newline minor mode is enabled. Clean-ups are used however to adjust code "after-the-fact", i.e. to eliminate some whitespace that is inserted by electric commands, or whitespace that contains intervening constructs.

You can configure @ccmode{}'s clean-ups by setting the variable c-cleanup-list, which is a list of clean-up symbols. By default, @ccmode{} cleans up only the scope-operator construct, which is necessary for proper C++ support. Note that clean-ups are only performed when the construct does not occur within a literal (see section Auto-newline insertion), and when there is nothing but whitespace appearing between the individual components of the construct.

There are currently only five specific constructs that @ccmode{} can clean up, as indicated by these symbols:

Hungry-deletion of whitespace

Hungry deletion of whitespace, or as it more commonly called, hungry-delete mode, is a simple feature that some people find extremely useful. In fact, you might find yourself wanting hungry-delete in all your editing modes!

In a nutshell, when hungry-delete mode is enabled, hitting the Backspace key(9) will consume all preceding whitespace, including newlines and tabs. This can really cut down on the number of Backspace's you have to type if, for example you made a mistake on the preceding line.

By default, when you hit the Backspace key @ccmode{} runs the command c-electric-backspace, which deletes text in the backwards direction. When deleting a single character, or when Backspace is hit in a literal (see section Auto-newline insertion), or when hungry-delete mode is disabled, the function contained in the c-backspace-function variable is called with one argument (the number of characters to delete). This variable is set to backward-delete-char-untabify by default.

Similarly, hitting the Delete key runs the command c-electric-delete. When deleting a single character, or when Delete is hit in a literal, or when hungry-delete mode is disabled, the function contained in the c-delete-function variable is called with one argument (the number of characters to delete). This variable is set to delete-char by default.

However, if delete-key-deletes-forward is nil, or your Emacs does not support separation of Backspace and DEL, then c-electric-delete simply calls c-electric-backspace.

Auto-fill mode interaction

One other note about minor modes is worth mentioning here. CC Mode now works much better with auto-fill mode (a standard Emacs minor mode) by correctly auto-filling both line (e.g. C++ style) and block (e.g. C style) oriented comments. When auto-fill-mode is enabled, line oriented comments will also be auto-filled by inserting a newline at the line break, and inserting `//' at the start of the next line.

When auto-filling block oriented comments, the behavior is dependent on the value of the variable c-comment-continuation-stars. When this variable is nil, the old behavior for auto-filling C comments is in effect. In this case, the line is broken by closing the comment and starting a new comment on the next line.

If you set c-comment-continuation-stars to a string, then a long C block comment line is broken by inserting a newline at the line break position, and inserting this string at the beginning of the next comment line. The default value for c-comment-continuation-stars is `* ' (a star followed by a single space)(10).

Commands

Indentation Commands

Various commands are provided which allow you to conveniently re-indent C constructs. There are several things to note about these indentation commands. First, when you change your programming style, either interactively or through some other means, your file does not automatically get re-indented. When you change style parameters, you will typically need to reformat the line, expression, or buffer to see the effects of your changes.

Second, changing some variables have no effect on existing code, even when you do re-indent. For example, the c-hanging-* variables and c-cleanup-list only affect new code as it is typed in on-the-fly, so changing c-hanging-braces-alist and re-indenting the buffer will not adjust placement of braces already in the file.

Third, re-indenting large portions of code is currently rather inefficient. Improvements have been made since previous releases of @ccmode{}, and much more radical improvements are planned, but for now you need to be aware of this (11). Some provision has been made to at least inform you as to the progress of the re-indentation. The variable c-progress-interval controls how often a progress message is displayed. Set this variable to nil to inhibit progress messages, including messages normally printed when indentation is started and completed.

Also, except as noted below, re-indentation is always driven by the same mechanisms that control on-the-fly indentation of code. See section New Indentation Engine, for details.

To indent a single line of code, use TAB (c-indent-command). The behavior of this command is controlled by the variable c-tab-always-indent. When this variable is t, TAB always just indents the current line. When nil, the line is indented only if point is at the left margin, or on or before the first non-whitespace character on the line, otherwise something else happens(12). If the value of c-tab-always-indent is something other than t or nil (e.g. 'other), then a real tab character(13) is inserted only when point is inside a literal (see section Auto-newline insertion), otherwise the line is indented.

To indent an entire balanced brace or parenthesis expression, use M-C-q (c-indent-exp). Note that point should be on the opening brace or parenthesis of the expression you want to indent.

Another very convenient keystroke is C-c C-q (c-indent-defun) when re-indents the entire top-level function or class definition that encompasses point. It leaves point at the same position within the buffer.

To indent any arbitrary region of code, use M-C-\ (indent-region). This is a standard Emacs command, specially tailored for C code in a @ccmode{} buffer. Note that of course, point and mark must delineate the region you want to indent.

While not strictly an indentation function, M-C-h (c-mark-function) is useful for marking the current top-level function or class definition as the current region.

Other Commands

@ccmode{} contains other useful command for moving around in C code.

M-x c-beginning-of-defun
Moves point back to the least-enclosing brace. This function is analogous to the Emacs built-in command beginning-of-defun, except it eliminates the constraint that the top-level opening brace must be in column zero. See beginning-of-defun for more information. Depending on the coding style being used, you might prefer c-beginning-of-defun to beginning-of-defun. If so, consider binding C-M-a to the former instead. For backwards compatibility reasons, the default binding remains in effect.
M-x c-end-of-defun
Moves point to the end of the current top-level definition. This function is analogous to the Emacs built-in command end-of-defun, except it eliminates the constraint that the top-level opening brace of the defun must be in column zero. See beginning-of-defun for more information. Depending on the coding style being used, you might prefer c-end-of-defun to end-of-defun. If so, consider binding C-M-e to the former instead. For backwards compatibility reasons, the default binding remains in effect.
C-c C-u (c-up-conditional)
Move point back to the containing preprocessor conditional, leaving the mark behind. A prefix argument acts as a repeat count. With a negative argument, move point forward to the end of the containing preprocessor conditional. When going backwards, #elif is treated like #else followed by #if. When going forwards, #elif is ignored.
C-c C-p (c-backward-conditional)
Move point back over a preprocessor conditional, leaving the mark behind. A prefix argument acts as a repeat count. With a negative argument, move forward.
C-c C-n (c-forward-conditional)
Move point forward across a preprocessor conditional, leaving the mark behind. A prefix argument acts as a repeat count. With a negative argument, move backward.
M-a (c-beginning-of-statement)
Move point to the beginning of the innermost C statement. If point is already at the beginning of a statement, it moves to the beginning of the closest preceding statement, even if that means moving into a block (you can use M-C-b to move over a balanced block). With prefix argument n, move back n - 1 statements. If point is within a comment, or next to a comment, this command moves by sentences instead of statements. When called from a program, this function takes three optional arguments: the numeric prefix argument, a buffer position limit (used as a starting point for syntactic parsing and as a limit for backward movement), and a flag to indicate whether movement should be by statements (if nil) or sentence (if non-nil).
M-e (c-end-of-statement)
Move point to the end of the innermost C statement. If point is at the end of a statement, move to the end of the next statement, even if it's inside a nested block (use M-C-f to move to the other side of the block). With prefix argument n, move forward n - 1 statements. If point is within a comment, or next to a comment, this command moves by sentences instead of statements. When called from a program, this function takes three optional arguments: the numeric prefix argument, a buffer position limit (used as a starting point for syntactic parsing and as a limit for backward movement), and a flag to indicate whether movement should be by statements (if nil) or sentence (if non-nil).
M-x c-forward-into-nomenclature
A popular programming style, especially for object-oriented languages such as C++ is to write symbols in a mixed case format, where the first letter of each word is capitalized, and not separated by underscores. E.g. `SymbolsWithMixedCaseAndNoUnderlines'. This command moves point forward to next capitalized word. With prefix argument n, move n times.
M-x c-backward-into-nomenclature
Move point backward to beginning of the next capitalized word. With prefix argument n, move n times. If n is negative, move forward.
C-c : (c-scope-operator)
In C++, it is also sometimes desirable to insert the double-colon scope operator without performing the electric behavior of colon insertion. C-c : does just this.
M-q (fill-paragraph)
The command is used to fill a block style (C) or line style (C++) comment, in much the same way that text in the various text modes can be filled(14). You should never attempt to fill non-comment code sections; you'll end up with garbage! Two variables control how C style block comments are filled, specifically how the comment start and end delimiters are handled. The variable c-hanging-comment-starter-p controls whether comment start delimiters which appear on a line by themselves, end up on a line by themselves after the fill. When the value is nil, the comment starter will remain on its own line(15). Otherwise, text on the next line will be put on the same line as the comment starter. This is called hanging because the following text hangs on the line with the comment starter(16) The variable c-hanging-comment-ender-p controls the analogous behavior for the block comment end delimiter. When the value is nil, the comment ender will remain on its own line after the file(17). Otherwise, the comment end delimiter will be placed at the end of the previous line.

Customizing Indentation

The variable c-offsets-alist contains the mappings between syntactic symbols and the offsets to apply for those symbols. You should never modify this variable directly though. Use the function c-set-offset instead (see below for details).

The c-offsets-alist variable is where you customize all your indentations. You simply need to decide what additional offset you want to add for every syntactic symbol. You can use the command C-c C-o (c-set-offset) as the way to set offsets, both interactively and from your mode hook. Also, you can set up styles of indentatio. Most likely, you'll find one of the pre-defined styles will suit your needs, but if not, this section will describe how to set up basic editing configurations. See section Styles, for an explanation of how to set up named styles.

As mentioned previously, the variable c-offsets-alist is an association list of syntactic symbols and the offsets to be applied for those symbols. In fact, these offset values can be any of an integer, a function or lambda expression, a variable name, or one of the following symbols: +, -, ++, --, *, or /. These symbols describe offset in multiples of the value of the variable c-basic-offset. By defining a style's indentation in terms of this fundamental variable, you can change the amount of whitespace given to an indentation level while leaving the same relationship between levels. Here are the values that the special symbols correspond to:

+
c-basic-offset times 1
-
c-basic-offset times -1
++
c-basic-offset times 2
--
c-basic-offset times -2
*
c-basic-offset times 0.5
/
c-basic-offset times -0.5

So, for example, because most of the default offsets are defined in terms of +, -, and 0, if you like the general indentation style, but you use 4 spaces instead of 2 spaces per level, you can probably achieve your style just by changing c-basic-offset like so (in your `.emacs' file):


(setq c-basic-offset 4)

This would change


int add( int val, int incr, int doit )
{
  if( doit )
    {
      return( val + incr );
    }
  return( val );
}

to


int add( int val, int incr, int doit )
{
    if( doit )
        {
            return( val + incr );
        }
    return( val );
}

To change indentation styles more radically, you will want to change the value associated with the syntactic symbols in the c-offsets-alist variable. First, I'll show you how to do that interactively, then I'll describe how to make changes to your `.emacs' file so that your changes are more permanent.

Interactive Customization

As an example of how to customize indentation, let's change the style of this example(18):


1: int add( int val, int incr, int doit )
2: {
3:   if( doit )
4:     {
5:       return( val + incr );
6:     }
7:   return( val );
8: }

to:


1: int add( int val, int incr, int doit )
2: {
3:   if( doit )
4:   {
5:     return( val + incr );
6:   }
7:   return( val );
8: }

In other words, we want to change the indentation of braces that open a block following a condition so that the braces line up under the conditional, instead of being indented. Notice that the construct we want to change starts on line 4. To change the indentation of a line, we need to see which syntactic components affect the offset calculations for that line. Hitting C-c C-s on line 4 yields:


((substatement-open . 44))

so we know that to change the offset of the open brace, we need to change the indentation for the substatement-open syntactic symbol. To do this interactively, just hit C-c C-o (c-set-offset). This prompts you for the syntactic symbol to change, providing a reasonable default. In this case, the default is substatement-open, which is just the syntactic symbol we want to change!

After you hit return, @ccmode{} will then prompt you for the new offset value, with the old value as the default. The default in this case is `+', but we want no extra indentation so enter `0' and RET. This will associate the offset 0 with the syntactic symbol substatement-open in the c-offsets-alist variable.

To check your changes quickly, just hit C-c C-q (c-indent-defun) to reindent the entire function. The example should now look like:


1: int add( int val, int incr, int doit )
2: {
3:   if( doit )
4:   {
5:     return( val + incr );
6:   }
7:   return( val );
8: }

Notice how just changing the open brace offset on line 4 is all we needed to do. Since the other affected lines are indented relative to line 4, they are automatically indented the way you'd expect. For more complicated examples, this may not always work. The general approach to take is to always start adjusting offsets for lines higher up in the file, then re-indent and see if any following lines need further adjustments.

Permanent Customization

To make your changes permanent, you need to add some lisp code to your `.emacs' file, but first you need to decide whether your styles should be global in every buffer, or local to each specific buffer.

If you edit primarily one style of code, you may want to make the @ccmode{} style variables have global values so that every buffer will share the style settings. This will allow you to set the @ccmode{} variables at the top level of your `.emacs' file, and is the way @ccmode{} works by default.

If you edit many different styles of code at the same time, you might want to make the @ccmode{} style variables have buffer local values. If you do this, then you will need to set any @ccmode{} style variables in a hook function (e.g. off of c-mode-common-hook instead of at the top level of your `.emacs' file). The recommended way to do this is to set the variable c-style-variables-are-local-p to t before @ccmode{} is loaded into your Emacs session.

@ccmode{} provides several hooks that you can use to customize the mode according to your coding style. Each language mode has its own hook, adhering to standard Emacs major mode conventions. There is also one general hook and one package initialization hook:

The language hooks get run as the last thing when you enter that language mode. The c-mode-common-hook is run by all supported modes before the language specific hook, and thus can contain customizations that are common across all languages. Most of the examples in this section will assume you are using the common hook(19).

Here's a simplified example of what you can add to your `.emacs' file to make the changes described in the previous section (section Interactive Customization) more permanent. See the Emacs manuals for more information on customizing Emacs via hooks. See section Sample .emacs file, for a more complete sample `.emacs' file.


(defun my-c-mode-common-hook ()
  ;; my customizations for all of c-mode and related modes
  (c-set-offset 'substatement-open 0)
  ;; other customizations can go here
  )
(add-hook 'c-mode-common-hook 'my-c-mode-common-hook)

For complex customizations, you will probably want to set up a style that groups all your customizations under a single name.

Styles

Most people only need to edit code formatted in just a few well-defined and consistent styles. For example, their organization might impose a "blessed" style that all its programmers must conform to. Similarly, people who work on GNU software will have to use the GNU coding style on C code. Some shops are more lenient, allowing a variety of coding styles, and as programmers come and go, there could be a number of styles in use. For this reason, @ccmode{} makes it convenient for you to set up logical groupings of customizations called styles, associate a single name for any particular style, and pretty easily start editing new or existing code using these styles.

Built-in Styles

If you're lucky, one of @ccmode{}'s built-in styles might be just what you're looking for. These include:

If you'd like to experiment with these built-in styles you can simply type the following in a @ccmode{} buffer:


C-c . STYLE-NAME RET

C-c . runs the command c-set-style. Note that all style names are case insensitive, even the ones you define.

Setting a style in this way does not automatically re-indent your file. For commands that you can use to view the effect of your changes, see section Commands.

Once you find a built-in style you like, you can make the change permanent by adding some lisp to your `.emacs' file. Let's say for example that you want to use the `ellemtel' style in all your files. You would add this:


(defun my-c-mode-common-hook ()
  ;; use Ellemtel style for all C like languages
  (c-set-style "ellemtel")
  ;; other customizations can go here
  )
(add-hook 'c-mode-common-hook 'my-c-mode-common-hook)

Note that for BOCM compatibility, `gnu' is the default style, and any non-style based customizations you make (i.e. in c-mode-common-hook in your `.emacs' file) will be based on `gnu' style unless you do a c-set-style as the first thing in your hook. The variable c-indentation-style always contains the buffer's current style name, as a string.

Adding Styles

If none of the built-in styles is appropriate, you'll probably want to add a new style definition. Styles are kept in the c-style-alist variable, but you should never modify this variable directly. Instead, @ccmode{} provides the function c-add-style that you can use to easily add new styles or change existing styles. This function takes two arguments, a stylename string, and an association list description of style customizations. If stylename is not already in c-style-alist, the new style is added, otherwise the style is changed to the new description. This function also takes an optional third argument, which if non-nil, automatically applies the new style to the current buffer.

The sample `.emacs' file provides a concrete example of how a new style can be added and automatically set. See section Sample .emacs file.

File Styles

The Emacs manual describes how you can customize certain variables on a per-file basis by including a Local Variable block at the end of the file. So far, you've only seen a functional interface to @ccmode{} customization, which is highly inconvenient for use in a Local Variable block. @ccmode{} provides two variables that make it easier for you to customize your style on a per-file basis. It works via the standard Emacs hook variable hack-local-variables-hook.

The variable c-file-style can be set to a style name string. When the file is visited, @ccmode{} will automatically set the file's style to this style using c-set-style.

Another variable, c-file-offsets, takes an association list similar to what is allowed in c-offsets-alist. When the file is visited, @ccmode{} will automatically institute these offets using c-set-offset.

Note that file style settings (i.e. c-file-style) are applied before file offset settings (i.e. c-file-offsets). Also, if either of these are set in a file's local variable section, all the style variable values are made local to that buffer.

Advanced Customizations

For most users, @ccmode{} will support their coding styles with very little need for more advanced customizations. Usually, one of the standard styles defined in c-style-alist will do the trick. At most, perhaps one of the syntactic symbol offsets will need to be tweaked slightly, or maybe c-basic-offset will need to be changed. However, some styles require a more flexible framework for customization, and one of the real strengths of @ccmode{} is that the syntactic analysis model provides just such a framework. This allows you to implement custom indentation calculations for situations not handled by the mode directly.

Note that the style controlling variables can either have global values, or can be buffer local (e.g. different in every buffer). If all the C files you edit tend to have the same style, you might want to keep the variables global. If you tend to edit files with many different styles, you will have to make the variables buffer local. The variable c-style-variables-are-local-p controls this.

When c-style-variables-are-local-p is non-nil, then the style variables will have a different settable value for each buffer, otherwise all buffers will share the same values. By default, its value is nil (i.e. global values). You must set this variable before @ccmode{} is loaded into your Emacs session, and once the variables are made buffer local, they cannot be made global again (unless you restart Emacs of course!)

Custom Indentation Functions

The most flexible way to customize @ccmode{} is by writing @dfn{custom indentation functions} and associating them with specific syntactic symbols (see section Syntactic Symbols). @ccmode{} itself uses custom indentation functions to provide more sophisticated indentation, for example when lining up C++ stream operator blocks:


1: void main(int argc, char**)
2: {
3:   cout << "There were "
4:     << argc
5:     << "arguments passed to the program"
6:     << endl;
7: }

In this example, lines 4 through 6 are assigned the stream-op syntactic symbol. Here, stream-op has an offset of +, and with a c-basic-offset of 2, you can see that lines 4 through 6 are simply indented two spaces to the right of line 3. But perhaps we'd like @ccmode{} to be a little more intelligent so that it aligns all the `<<' symbols in lines 3 through 6. To do this, we have to write a custom indentation function which finds the column of first stream operator on the first line of the statement. Here is sample lisp code implementing this:


(defun c-lineup-streamop (langelem)
  ;; lineup stream operators
  (save-excursion
    (let* ((relpos (cdr langelem))
           (curcol (progn (goto-char relpos)
                          (current-column))))
      (re-search-forward "<<\\|>>" (c-point 'eol) 'move)
      (goto-char (match-beginning 0))
      (- (current-column) curcol))))

Custom indent functions take a single argument, which is a syntactic component cons cell (see section Syntactic Analysis). The function returns an integer offset value that will be added to the running total indentation for the line. Note that what actually gets returned is the difference between the column that the first stream operator is on, and the column of the buffer relative position passed in the function's argument. Remember that @ccmode{} automatically adds in the column of the component's relative buffer position and we don't the column offset added in twice.

Now, to associate the function c-lineup-streamop with the stream-op syntactic symbol, we can add something like the following to our c++-mode-hook(22):


(c-set-offset 'stream-op 'c-lineup-streamop)

Now the function looks like this after re-indenting (using C-c C-q):


1: void main(int argc, char**)
2: {
3:   cout << "There were "
4:        << argc
5:        << "arguments passed to the program"
6:        << endl;
7: }

Custom indentation functions can be as simple or as complex as you like, and any syntactic symbol that appears in c-offsets-alist can have a custom indentation function associated with it. @ccmode{} comes with several standard custom indentation functions, not all of which are used by the default styles.

Custom Brace and Colon Hanging

Syntactic symbols aren't the only place where you can customize @ccmode{} with the lisp equivalent of callback functions. Brace "hanginess" can also be determined by custom functions associated with syntactic symbols on the c-hanging-braces-alist variable. Remember that ACTION's are typically a list containing some combination of the symbols before and after (see section Hanging Braces). However, an ACTION can also be a function which gets called when a brace matching that syntactic symbol is entered.

These ACTION functions are called with two arguments: the syntactic symbol for the brace, and the buffer position at which the brace was inserted. The ACTION function is expected to return a list containing some combination of before and after. The function can also return nil. This return value has the normal brace hanging semantics.

As an example, @ccmode{} itself uses this feature to dynamically determine the hanginess of braces which close "do-while" constructs:


void do_list( int count, char** atleast_one_string )
{
    int i=0;
    do {
        handle_string( atleast_one_string[i] );
        i++;
    } while( i < count );
}

@ccmode{} assigns the block-close syntactic symbol to the brace that closes the do construct, and normally we'd like the line that follows a block-close brace to begin on a separate line. However, with "do-while" constructs, we want the while clause to follow the closing brace. To do this, we associate the block-close symbol with the ACTION function c-snug-do-while:


(defun c-snug-do-while (syntax pos)
  "Dynamically calculate brace hanginess for do-while statements.
Using this function, `while' clauses that end a `do-while' block will
remain on the same line as the brace that closes that block.

See `c-hanging-braces-alist' for how to utilize this function as an
ACTION associated with `block-close' syntax."
  (save-excursion
    (let (langelem)
      (if (and (eq syntax 'block-close)
               (setq langelem (assq 'block-close c-syntactic-context))
               (progn (goto-char (cdr langelem))
                      (if (= (following-char) ?{)
                          (forward-sexp -1))
                      (looking-at "\\<do\\>[^_]")))
          '(before)
        '(before after)))))

This function simply looks to see if the brace closes a "do-while" clause and if so, returns the list `(before)' indicating that a newline should be inserted before the brace, but not after it. In all other cases, it returns the list `(before after)' so that the brace appears on a line by itself.

During the call to the brace hanging ACTION function, the variable c-syntactic-context is bound to the full syntactic analysis list.

Note that for symmetry, colon hanginess should be customizable by allowing function symbols as ACTIONs on the c-hanging-colon-alist variable. Since no use has actually been found for this feature, it isn't currently implemented!

Customizing Semi-colons and Commas

You can also customize the insertion of newlines after semi-colons and commas, when the auto-newline minor mode is enabled (see section Minor Modes). This is controlled by the variable c-hanging-semi&comma-criteria, which contains a list of functions that are called in the order they appear. Each function is called with zero arguments, and is expected to return one of the following values:

If every function in the list is called without a determination being made, then no newline is added. The default value for this variable is a list containing a single function which inserts newlines only after semi-colons which do not appear inside parenthesis lists (i.e. those that separate for-clause statements).

Here's an example of a criteria function, provided by @ccmode{}, that will prevent newlines from being inserted after semicolons when there is a non-blank following line. Otherwise, it makes no determination. To use, add this to the front of the c-hanging-semi&comma-criteria list.


(defun c-semi&comma-no-newlines-before-nonblanks ()
  (save-excursion
    (if (and (eq last-command-char ?\;)
             (zerop (forward-line 1))
             (not (looking-at "^[ \t]*$")))
        'stop
      nil)))

The default value of c-hanging-semi&comma-criteria is a list containing just the function c-semi&comma-inside-parenlist, which suppresses newlines after semicolons inside parenthesis lists (e.g. for-loops). In addition to c-semi&comma-no-newlines-before-nonblanks described above, @ccmode{} also comes with the criteria function c-semi&comma-no-newlines-for-oneline-inliners, which suppresses newlines after semicolons inside one-line inline method definitions (i.e. in C++ or Java).

Other Special Indentations

In `gnu' style (see section Built-in Styles), a minimum indentation is imposed on lines inside top-level constructs. This minimum indentation is controlled by the variable c-label-minimum-indentation. The default value for this variable is 1.

One other customization variable is available in @ccmode{}: c-special-indent-hook. This is a standard hook variable that is called after every line is indented by @ccmode{}. You can use it to do any special indentation or line adjustments your style dictates, such as adding extra indentation to constructors or destructor declarations in a class definition, etc. Note however, that you should not change point or mark inside your c-special-indent-hook functions (i.e. you'll probably want to wrap your function in a save-excursion).

Setting c-special-indent-hook in your style definition is handled slightly differently than other variables. In your style definition, you should set the value for c-special-indent-hook to a function or list of functions, which will be appended to c-special-indent-hook using add-hook. That way, the current setting for the buffer local value of c-special-indent-hook won't be overridden.

Normally, the standard Emacs command M-; (indent-for-comment) will indent comment only lines to comment-column. Some users however, prefer that M-; act just like TAB for purposes of indenting comment-only lines; i.e. they want the comments to always indent as they would for normal code, regardless of whether TAB or M-; were used. This behavior is controlled by the variable c-indent-comments-syntactically-p. When nil (the default), M-; indents comment-only lines to comment-column, otherwise, they are indented just as they would be if TAB were typed.

Syntactic Symbols

Here is a complete list of the recognized syntactic symbols as described in the c-offsets-alist variable, along with a brief description. More detailed descriptions follow below.

Most syntactic symbol names follow a general naming convention. When a line begins with an open or close brace, the syntactic symbol will contain the suffix -open or -close respectively.

Usually, a distinction is made between the first line that introduces a construct and lines that continue a construct, and the syntactic symbols that represent these lines will contain the suffix -intro or -cont respectively. As a sub-classification of this scheme, a line which is the first of a particular brace block construct will contain the suffix -block-intro.

Let's look at some examples to understand how this works. Remember that you can check the syntax of any line by using C-c C-s.


  1: void
  2: swap( int& a, int& b )
  3: {
  4:     int tmp = a;
  5:     a = b;
  6:     b = tmp;
  7:     int ignored =
  8:         a + b;
  9: }

Line 1 shows a topmost-intro since it is the first line that introduces a top-level construct. Line 2 is a continuation of the top-level construct introduction so it has the syntax topmost-intro-cont. Line 3 shows a defun-open since it is the brace that opens a top-level function definition. Line 9 is a defun-close since it contains the brace that closes the top-level function definition. Line 4 is a defun-block-intro, i.e. it is the first line of a brace-block, enclosed in a top-level function definition.

Lines 5, 6, and 7 are all given statement syntax since there isn't much special about them. Note however that line 8 is given statement-cont syntax since it continues the statement begun on the previous line.

Here's another example, which illustrates some C++ class syntactic symbols:


   1: class Bass
   2:     : public Guitar,
   3:       public Amplifiable
   4: {
   5: public:
   6:     Bass()
   7:         : eString( new BassString( 0.105 )),
   8:           aString( new BassString( 0.085 )),
   9:           dString( new BassString( 0.065 )),
  10:           gString( new BassString( 0.045 ))
  11:     {
  12:         eString.tune( 'E' );
  13:         aString.tune( 'A' );
  14:         dString.tune( 'D' );
  15:         gString.tune( 'G' );
  16:     }
  17:     friend class Luthier;
  18: }

As in the previous example, line 1 has the topmost-intro syntax. Here however, the brace that opens a C++ class definition on line 4 is assigned the class-open syntax. Note that in C++, classes, structs, and unions are essentially equivalent syntactically (and are very similar semantically), so replacing the class keyword in the example above with struct or union would still result in a syntax of class-open for line 4 (24). Similarly, line 18 is assigned class-close syntax.

Line 2 introduces the inheritance list for the class so it is assigned the inher-intro syntax, and line 3, which continues the inheritance list is given inher-cont syntax.

Hitting C-c C-s on line 5 shows the following analysis:


((inclass . 1) (access-label . 67))

The primary syntactic symbol for this line is access-label as this a label keyword that specifies access protection in C++. However, because this line is also a top-level construct inside a class definition, the analysis actually shows two syntactic symbols. The other syntactic symbol assigned to this line is inclass. Similarly, line 6 is given both inclass and topmost-intro syntax:


((inclass . 58) (topmost-intro . 60))

Line 7 introduces a C++ member initialization list and as such is given member-init-intro syntax. Note that in this case it is not assigned inclass since this is not considered a top-level construct. Lines 8 through 10 are all assigned member-init-cont since they continue the member initialization list started on line 7.

Line 11's analysis is a bit more complicated:


((inclass . 1) (inline-open))

This line is assigned a syntax of both inline-open and inclass because it opens an in-class C++ inline method definition. This is distinct from, but related to, the C++ notion of an inline function in that its definition occurs inside an enclosing class definition, which in C++ implies that the function should be inlined. If though, the definition of the Bass constructor appeared outside the class definition, the construct would be given the defun-open syntax, even if the keyword inline appeared before the method name, as in:


class Bass
    : public Guitar,
      public Amplifiable
{
public:
    Bass();
}

inline
Bass::Bass()
    : eString( new BassString( 0.105 )),
      aString( new BassString( 0.085 )),
      dString( new BassString( 0.065 )),
      gString( new BassString( 0.045 ))
{
    eString.tune( 'E' );
    aString.tune( 'A' );
    dString.tune( 'D' );
    gString.tune( 'G' );
}

Returning to the previous example, line 16 is given inline-close syntax, while line 12 is given defun-block-open syntax, and lines 13 through 15 are all given statement syntax. Line 17 is interesting in that its syntactic analysis list contains three elements:


((friend) (inclass . 58) (topmost-intro . 380))

The friend syntactic symbol is a modifier that typically does not have a relative buffer position.

Template definitions introduce yet another syntactic symbol:


   1: ThingManager <int,
   2:    Framework::Callback *,
   3:    Mutex> framework_callbacks;

Here, line 1 is analyzed as a topmost-intro, but lines 2 and 3 are both analyzed as template-args-cont lines.

Here is another (totally contrived) example which illustrates how syntax is assigned to various conditional constructs:


   1: void spam( int index )
   2: {
   3:     for( int i=0; i<index; i++ )
   4:     {
   5:         if( i == 10 )
   6:         {
   7:             do_something_special();
   8:         }
   9:         else
  10:             do_something( i );
  11:     }
  12:     do {
  13:         another_thing( i-- );
  14:     }
  15:     while( i > 0 );
  16: }

Only the lines that illustrate new syntactic symbols will be discussed.

Line 4 has a brace which opens a conditional's substatement block. It is thus assigned substatement-open syntax, and since line 5 is the first line in the substatement block, it is assigned substatement-block-intro syntax. Lines 6 and 7 are assigned similar syntax. Line 8 contains the brace that closes the inner substatement block. It is given the syntax block-close, as are lines 11 and 14.

Line 9 is a little different -- since it contains the keyword else matching the if statement introduced on line 5, it is given the else-clause syntax. Note also that line 10 is slightly different too. Because else is considered a conditional introducing keyword (25), and because the following substatement is not a brace block, line 10 is assigned the substatement syntax.

One other difference is seen on line 15. The while construct that closes a do conditional is given the special syntax do-while-closure if it appears on a line by itself. Note that if the while appeared on the same line as the preceding close brace, that line would have been assigned block-close syntax instead.

Switch statements have their own set of syntactic symbols. Here's an example:


   1: void spam( enum Ingredient i )
   2: {
   3:     switch( i ) {
   4:     case Ham:
   5:         be_a_pig();
   6:         break;
   7:     case Salt:
   8:         drink_some_water();
   9:         break;
  10:     default:
  11:         {
  12:             what_is_it();
  13:             break;
  14:         }
  15:     }
  14: }

Here, lines 4, 7, and 10 are all assigned case-label syntax, while lines 5 and 8 are assigned statement-case-intro. Line 11 is treated slightly differently since it contains a brace that opens a block -- it is given statement-case-open syntax.

There are a set of syntactic symbols that are used to recognize constructs inside of brace lists. A brace list is defined as an enum or aggregate initializer list, such as might statically initialize an array of structs. For example:


  1: static char* ingredients[] =
  2: {
  3:     "Ham",
  4:     "Salt",
  5:     NULL
  6: }

Following convention, line 2 in this example is assigned brace-list-open syntax, and line 3 is assigned brace-list-intro syntax. Likewise, line 6 is assigned brace-list-close syntax. Lines 4 and 5 however, are assigned brace-list-entry syntax, as would all subsequent lines in this initializer list.

External language definition blocks also have their own syntactic symbols. In this example:


   1: extern "C" 
   2: {
   3:     int thing_one( int );
   4:     int thing_two( double );
   5: }

line 2 is given the extern-lang-open syntax, while line 5 is given the extern-lang-close syntax. The analysis for line 3 yields: ((inextern-lang) (topmost-intro . 14)), where inextern-lang is a modifier similar in purpose to inclass.

Similarly, C++ namespace constructs have their own associated syntactic symbols. In this example:


   1: namespace foo
   2: {
   3:     void xxx() {}
   4: }

line 2 is given the namespace-open syntax, while line 4 is given the namespace-close syntax. The analysis for line 3 yields: ((innamespace) (topmost-intro . 17)), where innamespace is a modifier similar in purpose to inextern-lang and inclass.

A number of syntactic symbols are associated with parenthesis lists, a.k.a argument lists, as found in function declarations and function calls. This example illustrates these:


   1: void a_function( int line1,
   2:                  int line2 );
   3: 
   4: void a_longer_function(
   5:     int line1,
   6:     int line2
   7:     );
   8: 
   9: void call_them( int line1, int line2 )
  10: {
  11:     a_function(
  12:         line1,
  13:         line2
  14:         );
  15: 
  16:     a_longer_function( line1,
  17:                        line2 );
  18: }

Lines 5 and 12 are assigned arglist-intro syntax since they are the first line following the open parenthesis, and lines 7 and 14 are assigned arglist-close syntax since they contain the parenthesis that closes the argument list.

Lines that continue argument lists can be assigned one of two syntactic symbols. For example, Lines 2 and 17 are assigned arglist-cont-nonempty syntax. What this means is that they continue an argument list, but that the line containing the parenthesis that opens the list is not empty following the open parenthesis. Contrast this against lines 6 and 13 which are assigned arglist-cont syntax. This is because the parenthesis that opens their argument lists is the last character on that line.

Note that there is no arglist-open syntax. This is because any parenthesis that opens an argument list, appearing on a separate line, is assigned the statement-cont syntax instead.

A few miscellaneous syntactic symbols that haven't been previously covered are illustrated by this C++ example:


   1: void Bass::play( int volume )
   2: const
   3: {
   4:     /* this line starts a multi-line
   5:      * comment.  This line should get `c' syntax */
   6: 
   7:     char* a_multiline_string = "This line starts a multi-line \
   8: string.  This line should get `string' syntax.";
   9: 
  10:   note:
  11:     {
  12: #ifdef LOCK
  13:         Lock acquire();
  14: #endif // LOCK
  15:         slap_pop();
  16:         cout << "I played "
  17:              << "a note\n";
  18:     }
  19: }

The lines to note in this example include:

Multi-line C preprocessor macros are now (somewhat) supported. At least CC Mode now recognizes the fact that it is inside a multi-line macro, and it properly skips such macros as syntactic whitespace. In this example:


   1: #define LIST_LOOP(cons, listp)                           \
   2:   for (cons = listp; !NILP (cons); cons = XCDR (cons))   \
   3:      if (!CONSP (cons))                                  \
   4:        signal_error ("Invalid list format", listp);      \
   5:      else

line 1 is given the syntactic symbol cpp-macro. This first line of a macro is always given this symbol. The second and subsequent lines (e.g. lines 2 through 5) are given the cpp-macro-cont syntactic symbol, with a relative buffer position pointing to the # which starts the macro definition.

In Objective-C buffers, there are three additional syntactic symbols assigned to various message calling constructs. Here's an example illustrating these:


  1: - (void)setDelegate:anObject
  2:           withStuff:stuff
  3: {
  4:     [delegate masterWillRebind:self
  5:               toDelegate:anObject
  6:               withExtraStuff:stuff];
  7: }

Here, line 1 is assigned objc-method-intro syntax, and line 2 is assigned objc-method-args-cont syntax. Lines 5 and 6 are both assigned objc-method-call-cont syntax.

Two other syntactic symbols can appear in old style, non-prototyped C code (26):


  1: int add_three_integers(a, b, c)
  2:      int a;
  3:      int b;
  4:      int c;
  5: {
  6:     return a + b + c;
  7: }

Here, line 2 is the first line in an argument declaration list and so is given the knr-argdecl-intro syntactic symbol. Subsequent lines (i.e. lines 3 and 4 in this example), are given knr-argdecl syntax.

Performance Issues

C and its derivative languages are highly complex creatures. Often, ambiguous code situations arise that require @ccmode{} to scan large portions of the buffer to determine syntactic context. Such pathological code(27) can cause @ccmode{} to perform fairly badly. This section identifies some of the coding styles to watch out for, and suggests some workarounds that you can use to improve performance.

Because @ccmode{} has to scan the buffer backwards from the current insertion point, and because C's syntax is fairly difficult to parse in the backwards direction, @ccmode{} often tries to find the nearest position higher up in the buffer from which to begin a forward scan. The farther this position is from the current insertion point, the slower the mode gets. Some coding styles can even force @ccmode{} to scan from the beginning of the buffer for every line of code!

One of the simplest things you can do to reduce scan time, is make sure any brace that opens a top-level construct(28) always appears in the leftmost column. This is actually an Emacs constraint, as embodied in the beginning-of-defun function which @ccmode{} uses heavily. If you insist on hanging top-level open braces on the right side of the line, then you might want to set the variable defun-prompt-regexp to something reasonable (29), however that "something reasonable" is difficult to define, so @ccmode{} doesn't do it for you.

A special note about defun-prompt-regexp in Java mode: while much of the early sample Java code seems to encourage a style where the brace that opens a class is hung on the right side of the line, this is not a good style to pursue in Emacs. @ccmode{} comes with a variable c-Java-defun-prompt-regexp which tries to define a regular expression usable for this style, but there are problems with it. In some cases it can cause beginning-of-defun to hang(30). For this reason, it is not used by default, but if you feel adventurous, you can set defun-prompt-regexp to it in your mode hook. In any event, setting and rely on defun-prompt-regexp will definitely slow things down!

You will probably notice pathological behavior from @ccmode{} when working in files containing large amounts of C preprocessor macros. This is because Emacs cannot skip backwards over these lines as quickly as it can comment.

Previous versions of @ccmode{} had potential performance problems when recognizing K&R style function argument declarations. This was because there are ambiguities in the C syntax when K&R style argument lists are used(31). @ccmode{} has adopted BOCM's convention for limiting the search: it assumes that argdecls are indented at least one space, and that the function headers are not indented at all. With current versions of @ccmode{}, user customization of c-recognize-knr-p is deprecated. Just don't put argdecls in column zero!

You might want to investigate the speed-ups contained in the file `cc-lobotomy.el', which comes as part of the @ccmode{} distribution, but is completely unsupported. As mentioned previous, @ccmode{} always trades speed for accuracy, however it is recognized that sometimes you need speed and can sacrifice some accuracy in indentation. The file `cc-lobotomy.el' contains hacks that will "dumb down" @ccmode{} in some specific ways, making that trade-off of accurancy for speed. I won't go into details of its use here; you should read the comments at the top of the file, and look at the variable cc-lobotomy-pith-list for details.

Frequently Asked Questions

Q. How do I re-indent the whole file?

A. Visit the file and hit C-x h to mark the whole buffer. Then hit ESC C-\.

Q. How do I re-indent the entire function? ESC C-x doesn't work.

A. ESC C-x is reserved for future Emacs use. To re-indent the entire function hit C-c C-q.

Q. How do I re-indent the current block?

A. First move to the brace which opens the block with ESC C-u, then re-indent that expression with ESC C-q.

Q. Why doesn't the RET key indent the line to where the new text should go after inserting the newline?

A. Emacs' convention is that RET just adds a newline, and that C-j adds a newline and indents it. You can make RET do this too by adding this to your c-mode-common-hook (see the sample `.emacs' file section Sample .emacs file):


(define-key c-mode-base-map "\C-m" 'newline-and-indent)

This is a very common question. If you want this to be the default behavior, don't lobby me, lobby RMS! :-)

Q. I put (c-set-offset 'substatement-open 0) in my `.emacs' file but I get an error saying that c-set-offset's function definition is void.

A. This means that @ccmode{} wasn't loaded into your Emacs session by the time the c-set-offset call was reached, mostly likely because @ccmode{} is being autoloaded. Instead of putting the c-set-offset line in your top-level `.emacs' file, put it in your c-mode-common-hook, or simply add the following to the top of your `.emacs' file:


(require 'cc-mode)

See the sample `.emacs' file section Sample .emacs file for details.

Q. How do I make strings, comments, keywords, and other constructs appear in different colors, or in bold face, etc.?

A. "Syntax Colorization" is a standard Emacs feature, controlled by font-lock-mode. It is not part of @ccmode{}.

Q. M-a and M-e used to move over entire balanced brace lists, but now they move into blocks. How do I get the old behavior back?

A. Use C-M-f and C-M-b to move over balanced brace blocks. Use M-a and M-e to move by statements, which will move into blocks.

Getting the latest CC Mode release

@ccmode{} is now standard with the latest versions of Emacs 19 and XEmacs 19. It is also the standard for Emacs 20 and XEmacs 20. You would typically just use the version that comes with your X/Emacs. These may be slightly out of date due to release schedule skew, so you should always check the canonical site for the latest version.


    World Wide Web:

        http://www.python.org/ftp/emacs/

    Anonymous FTP:

        ftp://ftp.python.org/pub/emacs/

There are many files under these directories; you can pick up the entire distribution (named cc-mode.tar.gz; a gzip'd tar file), or any of the individual files, including PostScript documentation.

If you do not have World Wide Web, or anonymous ftp access, you can get the distribution through an anonymous ftp-to-mail gateway, such as the one run by DEC at:


ftpmail@decwrl.dec.com

To get @ccmode{} via email, send the following message in the body of your mail to that address:


reply <a valid net address back to you>
connect ftp.python.org
binary
uuencode
chdir pub/emacs
get cc-mode.tar.gz

or just send the message "help" for more information on ftpmail. Response times will vary with the number of requests in the queue. I am in no way connected to this service, so I make no claims or guarantees about its availability!

Sample .emacs file

;; Here's a sample .emacs file that might help you along the way.  Just
;; copy this region and paste it into your .emacs file.  You may want to
;; change some of the actual values.

(defconst my-c-style
  '((c-tab-always-indent        . t)
    (c-comment-only-line-offset . 4)
    (c-hanging-braces-alist     . ((substatement-open after)
                                   (brace-list-open)))
    (c-hanging-colons-alist     . ((member-init-intro before)
                                   (inher-intro)
                                   (case-label after)
                                   (label after)
                                   (access-label after)))
    (c-cleanup-list             . (scope-operator
                                   empty-defun-braces
                                   defun-close-semi))
    (c-offsets-alist            . ((arglist-close . c-lineup-arglist)
                                   (substatement-open . 0)
                                   (case-label        . 4)
                                   (block-open        . 0)
                                   (knr-argdecl-intro . -)))
    (c-echo-syntactic-information-p . t)
    )
  "My C Programming Style")

;; Customizations for all of c-mode, c++-mode, and objc-mode
(defun my-c-mode-common-hook ()
  ;; add my personal style and set it for the current buffer
  (c-add-style "PERSONAL" my-c-style t)
  ;; offset customizations not in my-c-style
  (c-set-offset 'member-init-intro '++)
  ;; other customizations
  (setq tab-width 8
        ;; this will make sure spaces are used instead of tabs
        indent-tabs-mode nil)
  ;; we like auto-newline and hungry-delete
  (c-toggle-auto-hungry-state 1)
  ;; keybindings for all supported languages.  We can put these in
  ;; c-mode-base-map because c-mode-map, c++-mode-map, objc-mode-map,
  ;; java-mode-map, and idl-mode-map inherit from it.
  (define-key c-mode-base-map "\C-m" 'newline-and-indent)
  )

(add-hook 'c-mode-common-hook 'my-c-mode-common-hook)

Limitations and Known Bugs

Mailing Lists and Submitting Bug Reports

To report bugs, use the C-c C-b (c-submit-bug-report) command. This provides vital information I need to reproduce your problem. Make sure you include a concise, but complete code example. Please try to boil your example down to just the essential code needed to reproduce the problem, and include an exact recipe of steps needed to expose the bug. Be especially sure to include any code that appears before your bug example, if you think it might affect my ability to reproduce it.

Bug reports are now sent to the following email addresses: cc-mode-help@python.org and bug-gnu-emacs@gnu.org; the latter is mirrored on the Usenet newsgroup gnu.emacs.bug. You can send other questions and suggestions (kudos? ;-) to cc-mode-help@python.org, or help-gnu-emacs@gnu.org which is mirrored on newsgroup gnu.emacs.help.

If you want to get announcements of new CC Mode releases, send the word subscribe in the body of a message to cc-mode-announce-request@python.org. Announcements will also be posted to the Usenet newsgroup gnu.emacs.sources. Note that the cc-mode-victims@python.org mailing list was recently decommissioned.

Concept Index

Jump to: - - . - a - b - c - d - e - f - g - h - i - j - k - l - m - n - o - p - r - s - t - u - w

-

  • -block-intro syntactic symbols
  • -close syntactic symbols
  • -cont syntactic symbols
  • -intro syntactic symbols
  • -open syntactic symbols
  • .

  • .emacs file
  • a

  • access-label syntactic symbol
  • Adding Styles
  • Advanced Customizations
  • announcement mailing list
  • arglist-close syntactic symbol
  • arglist-cont syntactic symbol
  • arglist-cont-nonempty syntactic symbol
  • arglist-intro syntactic symbol
  • Auto-fill mode interaction
  • Auto-newline insertion
  • b

  • basic-offset (c-)
  • beta testers mailing list
  • block-close syntactic symbol, block-close syntactic symbol
  • block-open syntactic symbol, block-open syntactic symbol
  • BOCM
  • brace lists
  • brace-list-close syntactic symbol, brace-list-close syntactic symbol
  • brace-list-entry syntactic symbol, brace-list-entry syntactic symbol
  • brace-list-intro syntactic symbol, brace-list-intro syntactic symbol
  • brace-list-open syntactic symbol, brace-list-open syntactic symbol
  • BSD style
  • Built-in Styles
  • c

  • c syntactic symbol
  • c-basic-offset
  • c-hanging- functions
  • c-indent-exp
  • c-set-offset
  • case-label syntactic symbol
  • `cc-compat.el' file
  • `cc-lobotomy.el' file
  • `cc-mode-18.el' file
  • class-close syntactic symbol, class-close syntactic symbol
  • class-open syntactic symbol, class-open syntactic symbol
  • Clean-ups, Clean-ups
  • Commands
  • comment-intro syntactic symbol
  • comment-only line, comment-only line
  • cpp-macro syntactic symbol
  • cpp-macro-cont syntactic symbol
  • Custom Brace and Colon Hanging
  • Custom Indentation Functions, Custom Indentation Functions, Custom Indentation Functions
  • customizing brace hanging
  • customizing colon hanging
  • Customizing Indentation
  • Customizing Semi-colons and Commas, Customizing Semi-colons and Commas, Customizing Semi-colons and Commas
  • d

  • defun-block-intro syntactic symbol
  • defun-close syntactic symbol, defun-close syntactic symbol
  • defun-open syntactic symbol, defun-open syntactic symbol
  • do-while-closure syntactic symbol
  • e

  • electric characters
  • electric commands
  • Ellemtel style
  • else-clause syntactic symbol
  • extern-lang-close syntactic symbol, extern-lang-close syntactic symbol
  • extern-lang-open syntactic symbol, extern-lang-open syntactic symbol
  • f

  • File Styles
  • Frequently Asked Questions
  • friend syntactic symbol
  • func-decl-cont syntactic symbol
  • g

  • Getting Connected
  • Getting the latest CC Mode release
  • GNU style
  • h

  • Hanging Braces
  • Hanging Colons
  • Hanging Semi-colons and commas
  • hooks
  • Hungry-deletion of whitespace
  • i

  • in-class inline methods
  • inclass syntactic symbol, inclass syntactic symbol
  • indent-exp (c-)
  • Indentation Calculation
  • Indentation Commands
  • inextern-lang syntactic symbol
  • inher-cont syntactic symbol
  • inher-intro syntactic symbol
  • inline-close syntactic symbol, inline-close syntactic symbol
  • inline-open syntactic symbol, inline-open syntactic symbol
  • innamespace syntactic-symbol
  • Interactive Customization
  • Introduction
  • j

  • Java style
  • java-mode
  • k

  • K&R style
  • knr-argdecl syntactic symbol
  • knr-argdecl-intro syntactic symbol
  • l

  • label syntactic symbol
  • Limitations and Known Bugs
  • Linux style
  • literal, literal, literal, literal
  • local variables
  • m

  • Mailing Lists and Submitting Bug Reports
  • member-init-cont syntactic symbol
  • member-init-intro syntactic symbol
  • Minor Modes
  • modifier syntactic symbol
  • multi-line macros
  • n

  • namespace-close symbol
  • namespace-close syntactic-symbol
  • namespace-open symbol
  • namespace-open syntactic-symbol
  • New Indentation Engine
  • o

  • objc-method-args-cont syntactic symbol
  • objc-method-call-cont syntactic symbol
  • objc-method-intro syntactic symbol
  • Other Commands
  • Other electric commands
  • p

  • Performance Issues
  • Permanent Customization
  • Python style
  • r

  • relative buffer position, relative buffer position
  • s

  • Sample .emacs file
  • set-offset (c-)
  • statement syntactic symbol
  • statement-case-intro syntactic symbol
  • statement-case-open syntactic symbol, statement-case-open syntactic symbol
  • statement-cont syntactic symbol
  • stream-op syntactic symbol, stream-op syntactic symbol
  • string syntactic symbol
  • Stroustrup style
  • Styles
  • substatement
  • substatement syntactic symbol
  • substatement-block-intro syntactic symbol
  • substatement-open syntactic symbol, substatement-open syntactic symbol
  • substatment block
  • Syntactic Analysis
  • syntactic component
  • syntactic component list
  • syntactic symbol
  • Syntactic Symbols
  • syntactic whitespace, syntactic whitespace, syntactic whitespace
  • t

  • TAB
  • topmost-intro syntactic symbol
  • topmost-intro-cont syntactic symbol
  • u

  • User style
  • w

  • Whitesmith style
  • Command Index

    Since all @ccmode{} commands are prepended with the string `c-', each appears under its c-<thing> name and its <thing> (c-) name. Jump to: a - b - c - d - e - f - h - i - j - l - m - n - o - s - t - u - v

    a

  • add-style (c-)
  • b

  • backward-conditional (c-)
  • backward-delete-char-untabify
  • backward-into-nomenclature (c-)
  • beginning-of-defun, beginning-of-defun
  • beginning-of-defun (c-)
  • beginning-of-statement (c-)
  • c

  • c++-mode
  • c-add-style
  • c-backward-conditional
  • c-backward-into-nomenclature
  • c-beginning-of-defun
  • c-beginning-of-statement
  • c-electric-backspace
  • c-electric-brace
  • c-electric-delete
  • c-electric-lt-gt
  • c-electric-pound
  • c-electric-slash
  • c-electric-star
  • c-end-of-defun
  • c-end-of-statement
  • c-forward-conditional
  • c-forward-into-nomenclature
  • c-hanging-braces-alist
  • c-indent-command
  • c-indent-defun, c-indent-defun
  • c-indent-exp
  • c-indent-one-line-block
  • c-lineup-arglist
  • c-lineup-arglist-close-under-paren
  • c-lineup-arglist-intro-after-paren
  • c-lineup-C-comments
  • c-lineup-close-paren
  • c-lineup-comment
  • c-lineup-dont-change
  • c-lineup-math
  • c-lineup-multi-inher
  • c-lineup-ObjC-method-args
  • c-lineup-ObjC-method-args-2
  • c-lineup-ObjC-method-call
  • c-lineup-runin-statements
  • c-lineup-streamop, c-lineup-streamop
  • c-mark-function
  • c-mode
  • c-scope-operator
  • c-semi&comma-inside-parenlist
  • c-semi&comma-no-newlines-before-nonblanks
  • c-semi&comma-no-newlines-for-oneline-inliners
  • c-set-offset, c-set-offset
  • c-set-style
  • c-show-syntactic-information
  • c-snug-do-while
  • c-submit-bug-report
  • c-toggle-auto-hungry-state
  • c-toggle-auto-state
  • c-toggle-hungry-state
  • c-up-conditional
  • c-version
  • d

  • defun-prompt-regexp
  • delete-char
  • e

  • electric-backspace (c-)
  • electric-brace (c-)
  • electric-delete (c-)
  • electric-lt-gt (c-)
  • electric-pound (c-)
  • electric-slash (c-)
  • electric-star (c-)
  • end-of-defun
  • end-of-defun (c-)
  • end-of-statement (c-)
  • f

  • fill-paragraph
  • forward-conditional (c-)
  • forward-into-nomenclature (c-)
  • h

  • hanging-braces-alist (c-)
  • i

  • idl-mode
  • indent-command (c-)
  • indent-defun (c-), indent-defun (c-)
  • indent-exp (c-)
  • indent-for-comment
  • indent-one-line-block (c-)
  • indent-region
  • j

  • java-mode
  • l

  • lineup-arglist (c-)
  • lineup-arglist-close-under-paren (c-)
  • lineup-arglist-intro-after-paren (c-)
  • lineup-C-comments (c-)
  • lineup-close-paren (c-)
  • lineup-comment (c-)
  • lineup-dont-change (c-)
  • lineup-math (c-)
  • lineup-multi-inher (c-)
  • lineup-ObjC-method-args (c-)
  • lineup-ObjC-method-args-2 (c-)
  • lineup-ObjC-method-call (c-)
  • lineup-runin-statements (c-)
  • lineup-streamop (c-), lineup-streamop (c-)
  • m

  • mark-function (c-)
  • n

  • newline-and-indent
  • o

  • objc-mode
  • s

  • scope-operator (c-)
  • semi&comma-inside-parenlist (c-)
  • semi&comma-no-newlines-before-nonblanks (c-)
  • semi&comma-no-newlines-for-oneline-inliners (c-)
  • set-offset (c-), set-offset (c-)
  • set-style (c-)
  • show-syntactic-information (c-)
  • snug-do-while (c-)
  • submit-bug-report (c-)
  • t

  • tab-to-tab-stop
  • toggle-auto-hungry-state (c-)
  • toggle-auto-state (c-)
  • toggle-hungry-state (c-)
  • u

  • up-conditional (c-)
  • v

  • version (c-)
  • Key Index

    Jump to: # - < - > - b - c - d - e - m - r - t

    #

  • #
  • <

  • <
  • >

  • >
  • b

  • Backspace
  • c

  • C-c .
  • C-c :
  • C-c C-a
  • C-c C-b
  • C-c C-d
  • C-c C-n
  • C-c C-o
  • C-c C-p
  • C-c C-q, C-c C-q, C-c C-q, C-c C-q
  • C-c C-s, C-c C-s
  • C-c C-t
  • C-c C-u
  • C-j
  • C-u
  • C-x h
  • d

  • DEL
  • e

  • ESC a
  • ESC C-\
  • ESC C-q
  • ESC C-u
  • ESC C-x
  • ESC e
  • ESC q
  • m

  • M-;
  • M-C-\
  • M-C-h
  • M-C-q
  • r

  • RET
  • t

  • TAB, TAB
  • Variable Index

    Since all @ccmode{} variables are prepended with the string `c-', each appears under its c-<thing> name and its <thing> (c-) name. Jump to: b - c - d - e - f - h - i - j - l - m - o - p - r - s - t

    b

  • backspace-function (c-)
  • basic-offset (c-)
  • c

  • c++-mode-hook
  • c-backspace-function
  • c-basic-offset
  • c-cleanup-list
  • c-comment-continuation-stars
  • c-comment-only-line-offset
  • c-default-style
  • c-delete-function
  • c-echo-syntactic-information-p
  • c-electric-pound-behavior
  • c-file-offsets
  • c-file-style
  • c-hanging-braces-alist, c-hanging-braces-alist
  • c-hanging-colon-alist
  • c-hanging-colons-alist
  • c-hanging-comment-ender-p
  • c-hanging-comment-starter-p
  • c-hanging-semi&comma-criteria
  • c-indent-comments-syntactically-p
  • c-indentation-style
  • c-initialization-hook
  • c-insert-tab-function
  • c-Java-defun-prompt-regexp
  • c-label-minimum-indentation
  • c-mode-common-hook, c-mode-common-hook
  • c-mode-hook
  • c-offsets-alist, c-offsets-alist, c-offsets-alist, c-offsets-alist, c-offsets-alist, c-offsets-alist, c-offsets-alist, c-offsets-alist
  • c-progress-interval
  • c-recognize-knr-p
  • c-special-indent-hook
  • c-style-alist, c-style-alist
  • c-style-variables-are-local-p, c-style-variables-are-local-p, c-style-variables-are-local-p
  • c-syntactic-context
  • c-tab-always-indent
  • cc-lobotomy-pith-list
  • cleanup-list (c-)
  • comment-column
  • comment-continuation-stars (c-)
  • comment-line-break-function
  • comment-only-line-offset (c-)
  • d

  • default-style (c-)
  • delete-function (c-)
  • delete-key-deletes-forward
  • e

  • echo-syntactic-information-p (c-)
  • electric-pound-behavior (c-)
  • f

  • file-offsets (c-)
  • file-style (c-)
  • h

  • hanging-braces-alist (c-), hanging-braces-alist (c-)
  • hanging-colon-alist (c-)
  • hanging-colons-alist (c-)
  • hanging-comment-ender-p (c-)
  • hanging-comment-starter-p (c-)
  • hanging-semi&comma-criteria (c-)
  • i

  • idl-mode-hook
  • indent-comments-syntactically-p (c-)
  • indent-tabs-mode
  • indentation-style (c-)
  • initialization-hook (c-)
  • insert-tab-function (c-)
  • j

  • Java-defun-prompt-regexp (c-)
  • java-mode-hook
  • l

  • label-minimum-indentation (c-)
  • m

  • mode-common-hook (c-)
  • o

  • objc-mode-hook
  • offsets-alist (c-), offsets-alist (c-), offsets-alist (c-), offsets-alist (c-), offsets-alist (c-), offsets-alist (c-), offsets-alist (c-), offsets-alist (c-)
  • p

  • progress-interval (c-)
  • r

  • recognize-knr-p (c-)
  • s

  • special-indent-hook (c-)
  • style-alist (c-), style-alist (c-)
  • style-variables-are-local-p
  • style-variables-are-local-p (c-), style-variables-are-local-p (c-)
  • syntactic-context (c-)
  • t

  • tab-always-indent (c-)

  • This document was generated on 6 November 2000 using texi2html 1.56k.