Multimedia is defined by the International Standardization Organization (ISO) as the creation, modification and composing a/or representation of products comprising any type of combination of media [1]. 'Medium' is any means through which it is possible to perceive information, to express it, to store it, or to transmit it.
'Network multimedia' is understood as: :
Sound, or frequencies 20-20000 Hz, which we can hear, is too large a range for transmission through low-capacity lines. The human voice uses from this spectrum only a low band in the range 300-4000 Hz. This means that, in the transmission of voice it is calculated mainly with the bandwidth 4 kHz. In 1928, at the Bell labs for the first time they started to code sound into a digital form, while it was found that to get a post-digitalization signal equal to the original, it is necessary to code it with a double frequency range; this means, that if a voice has 4 kHz, then 8 kHz is required. Today, we can on the Internet encounter an audio signal in an AU format that is sampled at 8 kHz. This format is rather obsolete; it does not use any type of the audio compression, so popular in recently. Despite this, there has been up until now only one audio file supported in JAVA applications (the standard of the company SUN). So, at the frequency 20 kHz we use 40 kHz. In the case of a CD, there is used a 44 kHz 16 bit stereo, which results in 44 x 16 bit x 2 = 176 000 bits (175 kHz) per second. This figure is relatively big. One of the possibilities of how to reduce it is compression.
This is the process of converting a wave time course into discrete values. More information on quantization
This is necessary due to the high cost of transmission. It exploits some attributes of human voice, for instance pauses between words, long periods of magnitudes, predictable changes in voice amplitudes; allowing high expectations for good compression.
The Pulse Code Modulation (PCM) is a popular method of coding an analogue signal. It is used in the public telephone network. The PCM samples an analogue signal with the frequency 8000 times per second, using 7-bit or the 8-bit quantization. In this way, we achieve results of 56 kilobits per second (kbps) with 7-bit sampling, and 64 kbps with 8-bit sampling. The PCM format supports only amplitude compression. The ADPCM (Adaptive differential PCM) uses bandwidth compression, which can reduce the bandwidth by half.
A digital picture, or image, comprises pixels in a 2D field, where each pixel represents a level of intensity or color. In the case of a black & white image, one bit is enough for each pixel. Gray-scale images (256 shades of gray) require 8 bits per pixel. Truecolor images demand 8 bits for representing every color from the RGB color model, which means 25 bit per pixel. Because digital video contains a huge amount of data, compression is essential from the view of transmission via a network.
A videoconference of an uncompressed video (30 frames per second) with the resolution 352x288 would require a bandwidth of 73 Mbps (352*288*39*24 bits).
I - the intra frame containing compressed information of the whole image
P - the prediction frame containing changes between two images
B - the image containing changes between the previous and following frame
(this is possible,because the MPEG stores groups of images at the same time, and not gradually one image
after the other).
A typical 30-frame MPEG looks as follows:
IBBPBBPBBPBBPBBIBBPBBPBBPBBPBB
with 1/3 of the data (from the overall size) being I images, 1/3 being P, and 1/3 being B images. The B images are calculated from the I images and the P images. This sequence is a series of displayed images; in the file is stored the sequence I, P, B1, B2, P, B3, B4, ?, when the first I image is displayed on the screen, the P image is read into the memory, then, from the I, P, and B1images is displayed the second image, next from the I, P, and B2 images is displayed the third image, and in the end the P image is displayed.
V origin? : MPEG-2, a standard for digital television (officially designated as ISO/IEC 13818, in 9 parts).
While the MPEG 1 was primarily intended as an audio/video compression technology of VideoCD (films on CD-ROMs), MPEG 2 was intended for digital television and High Definition Television (HDTV). With regard to the compression method used, MPEG 4 appears as a close relative to H.263 and MPEG-1. Documentation indicates that the MPEG-4 should be backward compatible, or even almost compatible with H.263 and MPEG-1.MPEG-4 uses video compression algorithms like hybrid block discrete cosine transformations and the movement compensation coding methods used already in the MPEG-1, MPEG-2, H.261 and H.263
The MPEG-4 format is already supported also directly in the video player for Windows 98 by the version Media player 6.4. The files have an extension '.asf', in the most common dimensions 320x240 with 15 to 30 frames per second. The size of a 120-minute film in MPEG 4 is from 220MB(for a film with the quality 250Kb), 440MB (for a film with the quality 500 kb), up to 880 MB (for a film with the quality 1Mb). Hardware demands comparable to those of the MPEG-1, where a Pentium 150 was enough for playback, are about double (at least PII 300 or K6-2 300).
One of the compression programs is the Sonic Foundry Stream Anywhere v1.0 of 27 September 1999, that can compress from the following formats: MPEG 1, AVI, and MOV, into the format ASF (MPEG4) with the bandwidths of 100, 250, 512, 1024, 3072 Mb for video, and 28.8 and 56 kb for videoconferences.
DivX is MPEG4 Hi-Res Video Codec for coding video in DVD quality, but in a size of
about 700 kbps (2 hours' film on one CD).
You can find more information on the address
http://divx.ctw.cc.
1. Common Intermediate Format (CIF)
2. Quarter CIF (QCIF),
where only QCIF is compulsory.
CIF has a resolution 352x288, and the number of frames per second is most frequently 30. It provides support for applications requiring high transmission speeds, as for instance videoconferences where several people are in the viewing field of a camera.
QCIF The QCIF has a resolution 174x144 (one quarter of the CIF size), and the number of frames per second is typically 7.5 to 15. Compared to CIF, it requires only one quarter of the transmission channel. The transmission band is in the range from 64 to 128.
The Creative, Inc. Sound blaster marked the start of the development of sound cards, enabling 8-bit mono sound. Up until then, sound in a PC was produced through a 'PC speaker'. Later, 16-bit sound cards for stereo sound entered the scene. And then, 3D sound followed.
The following standards were designed for 3D sound over the time:
A3D v 2.0 - ? the standard for 3D sound by Aurel; this simulates sound propagation using a simplified spatial material model of an environment.
EAX (Environmental Audio eXtension) ? the standard of spatial sound, emphasizing 3D sound through the simulation of sound reflections from the environment. It is easily programmable, and contains pre-calculated values of parameters in relation to the surrounding environment.
Dolby Digital (AC-3) -a digital system of coding 5 standard channels (frequency: 20 Hz ? 20 kHz), two front and two back loudspeakers plus a central loudspeaker, and one channel for a subwoofer (a loudspeaker for frequencies 20 ? 100 Hz) into one digital data stream. This system is briefly indicated in 5.1.
HRTF (Head Related Transition Functions) ? the technology of 3D sound simulation using two loudspeakers or headphones.
Though spatial sound can be made on a stereo loudspeaker system, the resulting 3D sound is not comparable with that as made using 4 loudspeakers.
In sound diffusion in an environment, the Doppler effect arises; through which the frequency of sound of a approaching object appears higher than in a stable, standing object; and the opposite, if an object is receding, its frequencies seem lower
Modern sound card chips comprise a processor, however not a classical processor like a Pentium type, but a DSP procesor (Digital Signal Processor) for processing data in real time. The performance of this sort of processor is given in MIPS (million instructions per second). For comparison with a classical processor, the performance of an EMU 10K chip installed in the sound card SB Live! with the declared performance of 1000 MIPS corresponds to the performance of about a 100 MHz Pentium
At present, besides SB Live!, there are other quality sound cards; for instance: cards installed by Aureal Vortex 2 chip with the power of 800 MIPS. A very good and cheap alternative is the sound card MediaForte Quad X-Treme 256 providing, besides output to 4 loudspeakers, also support of the EAX standard.
Bit rate - number of bits which can be transmitted through a channel for one second (bps).
Codec - a hardware or software component establishing the digital coding and decoding of video signal.
Compression - ? a technique enabling the reduction of a data size required for the reproduction of digital or video signal
H.261 - Low level video codec supporting the CIF and QCIT. Known as 'Px64'
H.262 - Low level video codec, similar to MPEG-2
H.263 - Low level video codec supporting the SQCIF, QCIF, CIF, 4CIF, and 16CIF