In a traditional textual file, several convention noncompatible exist for stoquer the end of line or the end of paragraph. These various conventions originate in the differences of the computing systems, in particular: Mac, Back /Windows, Unix.

These differences, in particular when they are mixed can pose problems of use with certain software, that can in particular empécher count the lines in a uniform way.

Representation in the form of byte

Various codings are used to mark the returns to the line.

ASCII Unicode EBCDIC* CR 000D 0D 0D 0D LF 000A 0A 25 15 CRLF 000D, 000A 0D, 0A 0D, 25 0D, 15 NEL* 0085 85 15 25 VT 000B 0B 0B 0B FF 000C 0C 0C 0C LS 2028 n/a n/a n/a PS 2029 n/a n/a n/a

Usual returns to the lines

Put aside new functionalities of unicode, and specificities of EDBIC,

The characters used to mark new the line can be noted by: ␍, ␊, ␤.

␍ Carriage Return: Carriage return ␊ Line Feed ␤ New Line

carriage return

See also: Carriage return

The carriage return indicates originally the physical mechanism allowing, on a Typewriter, the carriage to return into on the right obstinate (either, from the point of view of the writer, to return at the beginning of the line).

In Data-processing, the carriage return ( Carriage return or CR) is one of the control characters of the standards of coding of character ASCII and EBCDIC.

On the operating system Mac OS (before Mac OS X), the carriage return marks the fine of a paragraph.

In ASCII, CR is indexed like the character 13 in decimal notation and 0D in hexadecimal notation.

In language C, as in much of other languages which it influenced (, Perl, etc), \ R indicates this character.

CRLF

See also: CRLF

In Data-processing, CRLF , acronym of C arriage R eturn (Carriage return) L innate F eed (Jump of line), is a special character or a sequence of characters which indicates the end of the line of a text. The CRLF is sometimes called Carriage return, because before the existence of the computers, the typewriters made a physical return of the carriage.

CRLF is thus not a character ASCII but the association of 2 characters ASCII, the character 13 follow-up of character 10.

CRLF is often represented by \ R \ N . \ R corresponds to CR (Return) and \ N corresponds to LF (eNd off line).

The CRLF is especially used under Windows, but of other operating systems recognize it. However, certain software has standards which prevent them from recognizing the CRLF. There exist programs which convert the CRLF into another equivalent control character.

LF

See also: Jump of line

In Data-processing, the jump of line (LF, line feed ) is a control character indicating that a line should be jumped. Its code ASCII is 10 (0A in Hexadécimal). The jump of line was in the beginning an order of Imprimante used jointly with the Carriage return ( CR ). After the execution of a CRLF, the print head returns completely on the left and jumps a line, ready to begin a new line of text.

CRLF was adopted thereafter like the end of standard line for the communications network, a choice which, retrospectively, is generally regarded as an error. However, this use was maintained under MS-DOS and its descendant Microsoft Windows and will thus not disappear in a near future. One thus finds in the textual files Windows, into hexadecimal, the chain 0D0A.

Under Unix, a jump of line is more commonly called a new line: on a Operating system based on Unix, a jump of line is interpreted as an instruction having the same effect on a final that CRLF has it on a printer. He was not considered to be necessary to post characters of impression on the computer. The computer programming language C, drawing its origins from Unix, reflects this use: out of C, ' \ n' is the escape character for a new line.

Apple also simplified the CRLF on its operating systems by using CR without LF rather. The operating systems of Apple continued to use the carriage return like end of line until Mac OS X, which is partly based on Unix.

One can sometimes note them by: ␍, ␊, ␤.

The separators unicode

The standard Unicode Standard defines two separators of characters inambigüs: the separator of paragraphs (PS = 2029 bases 16) and the separator of line (LS = 2028 bases 16). In text unicode, the characters PS and LS must be use when ambiguity must be avoided. In the other cases, Unicode defines the treatments to be associated with CR, LF, and CRLF.

In addition, unicode defines certain properties relating to the automatic cutting of the text on various lines.

See too

  • CRLF
  • Carriage return (CR)
  • Jump of line (LF)

  • Fine of ASCII line

References

Random links:Monastery of Aghiou Pavlou | Ellecourt | List homeopathic remedies | Maximilien-Henri of Bavaria | 2nd regiment of zouaves

© 2007-2008 speedlook.com; article text available under the terms of GFDL, from fr.wikipedia.org