General information on how the world wide web works. World Wide Web (WWW)

The history of the creation and development of the Internet.

The Internet owes its origins to the US Department of Defense and its secret research conducted in 1969 to test methods for allowing computer networks to survive warfare through dynamic message rerouting. The first such network was ARPAnet, which connected three networks in California with a network in Utah using a set of rules called the Internet Protocol (IP).

In 1972, access was opened to universities and research organizations, with the result that the network began to unite 50 universities and research organizations that had contracts with the US Department of Defense.

In 1973 the network expanded to an international scale, bringing together networks located in England and Norway. A decade later, IP has been extended with a suite of communication protocols that support both local and wide area networks. This is how TCP / IP was born. Shortly thereafter, the National Science Foundation (NSF) opened NSFnet with the goal of linking 5 supercomputing centers. Along with the introduction of TCP / IP, the new network soon replaced ARPAnet as the backbone of the Internet.

Well, how did the Internet become so popular and developed, and the impetus for this, as well as for turning it into an environment for doing business, was given by the appearance of the World Wide Web (World Wide Web, WWW, 3W, ve-ve-ve, three double) - systems hypertext, which made surfing the Internet fast and intuitive.

But the idea of ​​linking documents through hypertext was first proposed and promoted by Ted Nelson in the 1960s, but the level of computer technology existing at that time did not allow it to be implemented, although who knows how it would have ended if has this idea found application ?!

The foundations of what we mean by WWW today were laid in the 1980s by Tim Berners-Lee in the process of creating a hypertext system at the European Laboratary for Particle Physics, European Center for Nuclear Research ).

As a result of these works, in 1990 the scientific community was presented with the first text browser (browser), which allows viewing hyperlinked text files on-line. This browser was made available to the general public in 1991, but its adoption outside academia has been slow.

A new historical stage in the development of the Internet is due to the release of the first Unix version of the Mosaic graphical browser in 1993, developed in 1992 by Marc Andreessen, an internship student at the National Center for Supercomputing Applications (NCSA), USA.

Since 1994, after the release of the Mosaic browser versions for Windows and Macintosh operating systems, and soon after that - the Netscape Navigator and Microsoft Internet Explorer browsers, the popularity of the WWW began explosively, and as a consequence of the Internet, among the general public, first in the United States, and then and around the world.

In 1995, the NSF transferred responsibility for the Internet to the private sector, and since then the Internet has existed as we know it today.

Internet services.

Services (services) are types of services that are provided by servers on the Internet.
In the history of the Internet, there were different types of services, some of which are no longer used, others are gradually losing their popularity, while others are flourishing.
Let's list those of the services that have not lost their relevance at the moment:
-World Wide Web - World Wide Web - service for searching and viewing hypertext documents, including graphics, sound and video. -E-mail - e-mail - a service for transferring electronic messages.
-Usenet, News - newsgroups, newsgroups - a kind of network newspaper or bulletin board.
-FTP - file transfer service.
-ICQ is a service for real-time communication using the keyboard.
-Telnet - service of remote access to computers.
-Gopher - service for accessing information using hierarchical directories.

Among these services, one can distinguish services designed for communication, that is, for communication, transfer of information (E-mail, ICQ), as well as services whose purpose is to store information and provide access to this information for users.

Among the latter services, the leading place in terms of the amount of stored information is occupied by the WWW service, since this service is the most convenient for users and the most advanced in technical terms. The second place is taken by the FTP service, since no matter what interfaces and conveniences are developed for the user, information is still stored in files that this service provides access to. Gopher and Telnet services can now be considered "dying out", since new information almost does not come to the servers of these services and the number of such servers and their audience is practically not increasing.

World Wide Web - the world wide web

The World Wide Web (WWW) is a hypertext, or rather, a hypermedia information system for searching for Internet resources and accessing them.

Hypertext is an informational structure that allows you to establish semantic connections between elements of text on a computer screen in such a way that you can easily make transitions from one element to another.
In practice, in hypertext, some words are highlighted by underlining or coloring in a different color. Highlighting a word indicates the presence of a connection between this word and some document, in which the topic associated with the highlighted word is considered in more detail.

Hypermedia is what you get if you replace the word "text" with "any kind of information" in the definition of hypertext: sound, graphics, video.
Such hypermedia links are possible, since along with textual information, you can link any other binary information, for example, encoded sound or graphics, So, if the program displays a map of the world and if the user selects any continent on this map with the mouse, the program can here give graphic, sound and text information about it.

The WWW system is built on a special data transfer protocol called the HTTP HyperText Transfer Protocol.
The entire content of the WWW system consists of WWW pages.

WWW-pages - hypermedia documents of the World Wide Web system. They are created using the hypertext markup language (HTML). One WWW page is actually a set of hypermedia documents located on the same server, interlaced with reciprocal links and related by meaning (for example, containing information about one educational institution or one museum). Each page document, in turn, can contain multiple on-screen pages of text and illustrations. Each WWW-page has its own "homepage" - a hypermedia document containing links to the main constituent parts of the page. Cover page addresses are circulated on the Internet as page addresses.

A collection of web pages linked together by links and designed to achieve a single purpose is called a web site.


Email appeared about 30 years ago. Today it is the most massive means of information exchange on the Internet. The ability to receive and send e-mail can be useful not only for communicating with friends from other cities and countries, but also in a business career. For example, when applying for a job, you can quickly send your resume via e-mail to various companies. In addition, on many sites where you need to register (on-line games, online stores, etc.), you often need to specify your e-mail. In short, e-mail is a very useful and convenient thing.

E-mail (Electronic mail, English mail - mail, abbreviated e-mail) serves to transfer text messages within the Internet, as well as between other e-mail networks. (Picture 1.)

Using e-mail, you can send messages, receive them in your e-mail box, reply to correspondent letters, send copies of letters to several recipients at once, forward the received letter to another address, use logical names instead of addresses, create several subsections of the mailbox for various kinds correspondence, include in letters various sound and graphic files, as well as binary files - programs.

To use E-mail, the computer must be connected to the telephone network via a modem.
A computer connected to the network is considered a potential sender and receiver of packets. Each host on the Internet, when sending a message to another host, splits it into fixed-length packets, typically 1500 bytes in size. Each packet is supplied with the recipient's address and the sender's address. Packets prepared in this way are forwarded through communication channels to other nodes. Upon receipt of any packet, the node analyzes the recipient's address and, if it matches its own address, the packet is accepted, otherwise it is sent further. Received packets related to the same message are accumulated. Once all packets of one message are received, they are concatenated and delivered to the recipient. Copies of packets are stored on the sender nodes until a response is received from the recipient node about the successful delivery of the message. This ensures reliability. To deliver a letter, the addressee only needs to know his address and the coordinates of the nearest mailbox. On the way to the addressee, the letter passes through several post offices (nodes).

FTP service

Internet service FTP (file transfer protocol) stands for protocol
file transfer, but when considering FTP as an Internet service, there is
not just a protocol, but a service - access to files in file

On UNIX systems, FTP is a standard TCP-based program
always supplied with the operating system. Its original purpose is
transferring files between different computers running on TCP / IP networks: to
one of the computers is running a server program, on the second the user runs
a client program that connects to the server and sends or receives over
FTP files. (Figure 2)

Figure 2. FTP protocol diagram

FTP is optimized for file transfers. Therefore, FTP programs have become
part of a separate Internet service. The FTP server can be configured as follows
in a way that you can connect to it not only under a specific name, but also under
by the conventional name anonymous - anonymous. Then the client does not become available to all
the file system of the computer, but a set of files on the server that
composes the contents of the anonymous ftp server - a public file archive.

Today, public file archives are organized mainly like servers.
anonymous ftp. A huge amount of information is available on such servers today.
and software. Almost everything that can be provided
to the public as files, available from anonymous ftp servers. These are programs -
freeware and demos and multimedia, it is finally
just texts - laws, books, articles, reports.

Despite its prevalence, FTP has many disadvantages. Programs-
FTP clients may not always be convenient or easy to use. Not always possible
understand what kind of file is in front of you - whether this is the file that you are looking for, or not. Not
a simple and universal search engine on anonymous ftp servers - although for
this is what special programs and services exist, but they do not always give
desired results.

FTP servers can also arrange access to files under a password - for example,
to their clients.

TELNET service

The purpose of the TELNET protocol is to provide a fairly general, bidirectional, eight-bit byte-oriented communications medium. Its main purpose is to allow terminal devices and terminal processes to communicate with each other. It is contemplated that this protocol can be used for terminal-to-terminal communication ("binding") or for process-to-process communication ("distributed computing").

Figure 3. Telnet terminal window

Although Telnet sessions distinguish between client and server sides, the protocol is actually completely symmetric. After a transport connection (usually TCP) is established, both ends play the role of "network virtual terminals" (eng. Network virtual terminal, NVT) exchanging two types of data:

Application data (that is, data that goes from the user to the server-side text application and back);

Telnet protocol commands, a special case of which are options that serve to understand the capabilities and preferences of the parties (Figure 3).

Although a Telnet session running over TCP is inherent in full duplex, the NVT should be considered a half duplex device, operating in buffered string mode by default.

The application data passes through the protocol without changes, that is, at the output of the second virtual terminal, we see exactly what was entered at the input of the first. From the point of view of the protocol, data is simply a sequence of bytes (octets), by default belonging to the ASCII set, but with the option enabled Binary- any. Although extensions have been proposed to identify the character set, they are not used in practice.

All application data octet values ​​except \ 377 (decimal: 255) are transported as is. Octet \ 377 is transmitted in sequence \ 377 \ 377 of two octets. This is because the \ 377 octet is used in the transport layer to encode options.

The protocol provides minimal functionality by default and a set of options that extend it. The principle of negotiated options requires negotiation when each option is enabled. One party initiates the request, and the other party can either accept or reject the offer. If the request is accepted, the option takes effect immediately. The options are described separately from the protocol as such, and their support by the software is optional. The protocol client (network terminal) is instructed to reject requests to enable unsupported and unknown options.

Historically Telnet has served to provide remote access to the command line interface of operating systems. Subsequently, it was used for other text-based interfaces, up to MUD games. In theory, even both sides of a protocol can be programs as well as people.

Sometimes telnet clients are used to access other protocols based on TCP transport, see Telnet and Other Protocols.

The telnet protocol is used in the FTP control connection, that is, logging into the server with the telnet ftp command to perform debugging and experimentation is not only possible, but also correct (unlike using telnet clients to access HTTP, IRC and most other protocols ).

The protocol does not provide for the use of either encryption or data authentication. Therefore, it is vulnerable to any kind of attack to which its transport, that is, the TCP protocol, is vulnerable. For the functionality of remote access to the system, the SSH network protocol is currently used (especially its version 2), during the creation of which the emphasis was placed on security issues. So it should be borne in mind that a Telnet session is quite insecure, unless it is carried out on a fully monitored network or with network layer security (various implementations of virtual private networks). Due to its unreliability, Telnet has long been abandoned as a means of operating system management.

