Misplaced Pages

List (abstract data type): Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 13:50, 2 August 2009 editJeffrey Mall (talk | contribs)Pending changes reviewers16,035 editsm Reverted edits by 117.242.24.128 to last revision by Paddy3118 (HG)← Previous edit Revision as of 23:13, 2 August 2009 edit undoJorge Stolfi (talk | contribs)Autopatrolled, Extended confirmed users, Rollbackers27,608 edits Repositioned image since it shows an implementation of a list, not an abstract list. Fixed caption. Typos and tweaks.Next edit →
Line 1: Line 1:
In ], a '''list''' or '''sequence''' is an ] that implements an ordered collection of ], where the same value may occur more than once. It is a computer representation of the ] concept of a finite ], that is, a ]. Each instance of a value in the list is usually called an '''item''', '''entry''', or '''element''' of the list; if the same value occurs multiple times, each occurrence is considered a distinct item.
]

In ], a '''list''' or '''sequence''' is an ] that implements an ordered collection of ], where the same value may occur more than once. It is a computer representation of the ] concept of a finite ]. Each instance of a value is often called an '''item''', '''entry''', or '''element''' of the list; if the same value occurs multiple times, each occurrence is considered a distinct item.


]
The name '''list''' is also used for several concrete ] that can be used to implement abstract lists, especially ]s. The name '''list''' is also used for several concrete ] that can be used to implement abstract lists, especially ]s.


The so-called '''static''' list structures allow only inspection and enumeration of the values. In ''']''' or '''dynamic''' lists, items may be inserted, replaced, or deleted during the list's existence. The so-called '''static''' list structures allow only inspection and enumeration of the values. A ''']''' or '''dynamic''' list may allow items to be inserted, replaced, or deleted during the list's existence.


Many ]s provide support for '''list data types''', and have special syntax and semantics for lists and list operations. Often a list can be constructed by written the items in sequence, separated by ]s, ]s, or ]s, within a pair of delimiters such as ] '()', ], '', ] '{}', or ] '<>'. Some languages may allow list types to be ] or ] like ]s. In ]s, lists are usually provided as ]s of subclasses of a generic "list" class. List data types are often implemented using ]s or linked lists of some sort, but other ] may be more appropriate for some applications. In some contexts, such as in ] programming, the term list may refer specifically to a linked list rather than an array. Many ]s provide support for '''list data types''', and have special syntax and semantics for lists and list operations. Often a list can be constructed by writing the items in sequence, separated by ]s, ]s, or ]s, within a pair of delimiters such as ] '()', ], '', ] '{}', or ] '<>'. Some languages may allow list types to be ] or ] like ]s. In ]s, lists are usually provided as ]s of subclasses of a generic "list" class. List data types are often implemented using ]s or linked lists of some sort, but other ] may be more appropriate for some applications. In some contexts, such as in ] programming, the term list may refer specifically to a linked list rather than an array.


In ] and ], abstract lists are usually defined ] by four operations: ''nil'' that yelds the empty list, ''cons'', which adds an item at the beginning of a list, ''first'', that returns the first element of a list, and ''rest'' that returns a lst minus its first element. Formally, ] can be defined as abstract lists with elements of ]. In ] and ], abstract lists are usually defined ] by four operations: ''nil'' that yelds the empty list, ''cons'', which adds an item at the beginning of a list, ''first'', that returns the first element of a list, and ''rest'' that returns a lst minus its first element. Formally, ] can be defined as abstract lists with elements of ].

Revision as of 23:13, 2 August 2009

In computer science, a list or sequence is an abstract data structure that implements an ordered collection of values, where the same value may occur more than once. It is a computer representation of the mathematical concept of a finite sequence (mathematics), that is, a tuple. Each instance of a value in the list is usually called an item, entry, or element of the list; if the same value occurs multiple times, each occurrence is considered a distinct item.

A singly-linked list structure, implementing a list with 3 integer elements.

The name list is also used for several concrete data structures that can be used to implement abstract lists, especially linked lists.

The so-called static list structures allow only inspection and enumeration of the values. A mutable or dynamic list may allow items to be inserted, replaced, or deleted during the list's existence.

Many programming languages provide support for list data types, and have special syntax and semantics for lists and list operations. Often a list can be constructed by writing the items in sequence, separated by commas, semicolons, or spaces, within a pair of delimiters such as parentheses '()', brackets, '', braces '{}', or angle brackets '<>'. Some languages may allow list types to be indexed or sliced like array types. In object-oriented programming languages, lists are usually provided as instances of subclasses of a generic "list" class. List data types are often implemented using arrays or linked lists of some sort, but other data structures may be more appropriate for some applications. In some contexts, such as in Lisp programming, the term list may refer specifically to a linked list rather than an array.

In type theory and functional programming, abstract lists are usually defined inductively by four operations: nil that yelds the empty list, cons, which adds an item at the beginning of a list, first, that returns the first element of a list, and rest that returns a lst minus its first element. Formally, Peano's natural numbers can be defined as abstract lists with elements of unit type.

Operations

Implementation of the list data structure may provide some of the following operations:

  • a constructor for creating an empty list;
  • an operation for testing whether or not a list is empty;
  • an operation for prepending an entity to a list
  • an operation for determining the first component (or the "head") of a list
  • an operation for referring to the list consisting of all the components of a list except for its first (this is called the "tail" of the list.)

Characteristics

Lists have the following properties:

  • The size of lists. It indicates how many elements there are in the list.
  • Equality of lists:
    • In mathematics, sometimes equality of lists is defined simply in terms of object identity: two lists are equal if and only if they are the same object.
    • In modern programming languages, equality of lists is normally defined in terms of structural equality of the corresponding entries, except that if the lists are typed, then the list types may also be relevant.
  • Lists may be typed. This implies that the entries in a list must have types that are compatible with the list's type. It is common that lists are typed when they are implemented using arrays.
  • Each element in the list has an index. The first element has index 0 (or some other predefined integer). Subsequent elements have indices that are 1 higher than the previous element. The last element has index (initial index) + (size) − 1.
    • It is possible to retrieve the element at a particular index.
    • It is possible to traverse the list in the order of increasing index.
    • It is possible to change the element at a particular index to a different value, without affecting any other elements.
    • It is possible to insert an element at a particular index. The indices of previous elements at that and higher indices are increased by 1.
    • It is possible to remove an element at a particular index. The indices of previous elements at that and higher indices are decreased by 1.

Applications

As the name implies, lists can be used to store a list of records. The items in a list can be sorted or unsorted for the purpose of fast search (binary search for instance) or fast inserting.

Implementations

Lists are typically implemented either as linked lists (either singly or doubly-linked) or as arrays, usually variable length or dynamic arrays.

The standard way of implementing lists, originating with the programming language Lisp, is to have each element of the list contain both its value and a pointer indicating the location of the next element in the list. This results in either a linked list or a tree, depending on whether the list has nested sublists. Some older Lisp implementations (such as the Lisp implementation of the Symbolics 3600) also supported "compressed lists" (using CDR coding) which had a special internal representation (invisible to the user). The Lists can be manipulated using iteration or recursion. The former is often preferred in imperative programming languages, while the latter is the norm in functional languages.

Programming language support

Some languages do not offer a list data structure, but offer the use of associative arrays or some kind of table to emulate lists. For example, Lua provides tables. Although Lua stores lists that have numerical indices as arrays internally, they still appear as hash tables.

In Lisp, lists are the fundamental data type and can represent both program code and data. In most dialects, the list of the first three prime numbers could be written as (list 2 3 5). In several dialects of Lisp, including Scheme, a list is a collection of pairs, consisting of a value and a pointer to the next pair (or null value), making a singly-linked list.

Applications

Because in computing, lists are easier to realize than sets, a finite set in mathematical sense can be realized as a list with additional restrictions, that is, duplicate elements are disallowed and such that order is irrelevant. If the list is sorted, it speeds up determining if a given item is already in the set but in order to ensure the order, it requires more time to add new entry to the list. In efficient implementations, however, sets are implemented using self-balancing binary search trees or hash tables, rather than a list.

Abstract definition

The abstract list type L with elements of some type E is defined by the following functions:

nil: () → L
cons: E × LL
first: LE
rest: LL

with the axioms

first(cons(e,l)) = e
rest(cons(e,l)) = l

for any element e and any list l. It is implicit that

cons(e,l) ≠ l
cons(e,l) ≠ e
cons(e1,l1) = cons(e2,l2) iff e1 = e2 and l1 = e2

Note that first(nil()) and last(nil()) are not defined.

These axioms are equivalent to those of the abstract stack data type.

In type theory, the above definition is more simply regarded as an inductive type defined in terms of constructors: nil and cons. In algebraic terms, this can be represented as the transformation 1 + A × LL. first and rest are then obtained by pattern matching on the cons constructor and separately handling the nil case.

The list monad

In type theory and functional programming, the list type forms a monad with the following functions:

r e t u r n : A A = a c o n s a n i l {\displaystyle \mathrm {return} \colon A\to A^{*}=a\mapsto \mathrm {cons} \,a\,\mathrm {nil} }
f m a p : ( A B ) ( A B ) = f l { n i l if  l = n i l c o n s ( f a ) ( f m a p f l ) if  l = c o n s a l {\displaystyle \mathrm {fmap} \colon (A\to B)\to (A^{*}\to B^{*})=f\mapsto l\mapsto {\begin{cases}\mathrm {nil} &{\mbox{if }}l=\mathrm {nil} \\\mathrm {cons} \,(f\,a)(\mathrm {fmap} f\,l')&{\mbox{if }}l=\mathrm {cons} \,a\,l'\end{cases}}}
j o i n : A A = l { n i l if  l = n i l a p p e n d a ( j o i n l ) if  l = c o n s a l {\displaystyle \mathrm {join} \colon {A^{*}}^{*}\to A^{*}=l\mapsto {\begin{cases}\mathrm {nil} &{\mbox{if }}l=\mathrm {nil} \\\mathrm {append} \,a\,(\mathrm {join} \,l')&{\mbox{if }}l=\mathrm {cons} \,a\,l'\end{cases}}}

where append is defined as:

a p p e n d : A A A = l 1 l 2 { l 2 if  l 1 = n i l c o n s a ( a p p e n d l 1 l 2 ) if  l 1 = c o n s a l 1 {\displaystyle \mathrm {append} \colon A^{*}\to A^{*}\to A^{*}=l_{1}\mapsto l_{2}\mapsto {\begin{cases}l_{2}&{\mbox{if }}l_{1}=\mathrm {nil} \\\mathrm {cons} \,a\,(\mathrm {append} \,l_{1}'\,l_{2})&{\mbox{if }}l_{1}=\mathrm {cons} \,a\,l_{1}'\end{cases}}}

Note that fmap, join and append are well-defined, since they're applied to progressively deeper arguments at each recursive call. The bind function is then simply:

b i n d : A ( A B ) B = l f j o i n ( f m a p f l ) {\displaystyle \mathrm {bind} \colon A^{*}\to (A\to B^{*})\to B^{*}=l\mapsto f\mapsto \mathrm {join} \,(\mathrm {fmap} \,f\,l)}

The list type is an additive monad, with nil as the monadic zero and append as monadic sum.

Lists form a monoid under the append operation. The identity element of the monoid is the empty list, nil. In fact, this is the free monoid of finite sequences over the set of list elements.

See also

Data structures
Types
Abstract
Arrays
Linked
Trees
Graphs
Data types
Uninterpreted
Numeric
Pointer
Text
Composite
Other
Related
topics
Categories: