Voicing Electronic Organs
Home Up Other Publications Contact Me About the Author About this Website " Prog Organ"

 

VOICING ELECTRONIC ORGANS

  by Colin Pykett

Last revised January 2008

Copyright © C E Pykett 2003 - 2008

Abstract  The subject of how today's digital electronic organs are voiced is many-facetted.  However there is one factor usually paraded as a benefit which in fact is a decided hindrance.  This factor is the complete lack of any intrinsic organ-like character in the hardware and software environment of a digital organ, unlike a pipe organ which can only ever sound like what it is.  By contrast, an electronic organ could just as readily emit the sound of a barking dog as an organ pipe.  Therefore this virtually unbounded flexibility demands a detailed understanding of the speaking characteristics of organ pipes  before the organ can be voiced effectively.  Consequently the majority of this article is concerned with this topic.  It describes the types of digital organ available, how they work and the pros and cons of the various systems.  It then covers matters such as how recordings of organ pipes can be made, how these may require pre-processing before being incorporated into an instrument and how sounds can be created without the need to first make recordings.  Spectrum analysis is discussed in detail because its capabilities and limitations are widely misunderstood, particularly when applied to attack and release transients.

  CONTENTS

(click on the desired section to access it)

INTRODUCTION

THE COMMERCIAL SITUATION

TIME DOMAIN SYNTHESIS TECHNIQUES

FREQUENCY DOMAIN SYNTHESIS TECHNIQUES

FURTHER SYNTHESIS TOPICS

    - ADSR

    - Initial Attenuation

    - Polyphony

    - Multitimbrality

    - Real Time Operation

TIME VERSUS FREQUENCY DOMAIN SYNTHESIS - pros and cons

VOICING

    - Making Recordings of Organ Stops            

    - Pre-processing Recorded Waveforms

    - Spectrum Analysis

    - Noise Removal

    - Transient Analysis

    - Wavelet Analysis

SOFTWARE TOOLS

APPENDIX 1 - Digitising Pipe Organ Waveforms

APPENDIX 2 - Additive Synthesis

NOTES & REFERENCES

 

INTRODUCTION 

The processes used in voicing digital electronic organs are not well known and consequently there is much ill informed buzz and speculation about the subject, as evidenced from Internet chat lines for example.  The reasons for the situation include the lack of information available from manufacturers, and an apparent misunderstanding of topics such as spectrum analysis and the physics of organ pipes even on the part of some of those in the electronic music business.  

This article surveys the types of digital organ available and then moves on to how they are voiced.  The most important feature is a detailed understanding of the nature of organ pipe sounds, which is far more significant than the type of digital system used to simulate them.  Without such knowledge no system, no matter how sophisticated, will achieve success.  This is because any digital organ, unlike its pipe counterpart, is not intrinsically organ-like in any way.  Until it is programmed in minute detail to emit the sounds of organ pipes it could just as easily become a digital dog-barking machine.  Therefore this article considers in detail the processes necessary to identify and extract the essential features from the sounds of real pipes, including their transient behaviour.  

The material in this article requires no more than a high school understanding of science, together with a similar level of computer literacy. Some musical understanding and experience of the pipe organ is also assumed.  Appendix 1 outlines the essentials of how digital signals are generated and handled by computers for those who are unfamiliar with the subject. Appendix 2 covers additive synthesis, a topic which occurs repeatedly throughout the article.  However in both cases the Appendices explain a number of detailed technical points, so they are recommended reading even for those with greater experience.  Spectrum analysis is covered in considerable detail in the article because not only is it pivotal to an understanding of pipe sounds, but it is a subject  widely misunderstood.  

Some of the more detailed aspects can be omitted without a significant loss of comprehension.  These are identified by a smaller typeface.  However this does not imply lesser importance, and anyone seriously into the subject needs to be familiar with them.  

THE COMMERCIAL SITUATION  

Digital electronic organs come mainly in two varieties at the time of writing.  Frequently these are called sampling and real time synthesis instruments.  An example of the former is the Allen organ and of the latter, the Bradford Computing Organ (and those which use derivatives of the Bradford technology).  However this nomenclature is unfortunate and misleading in one respect : the term “real time” is used widely in the field of digital signal processing to describe a system which is capable of responding to or reproducing events at the rate at which they occur in the real world.  Since both types of digital organ and indeed any musical instrument that was ever made necessarily must have this attribute, it cannot be appropriated legitimately to describe just one variety. In any case, it will be pointed out later that at least one type of “real time” organ does not operate strictly in real time at all.  Moreover, a defect of some digital organs in the past which still persists sometimes today is that they cannot keep up with the demands of the player in some circumstances - and this was true particularly of "real time" organs.  This inexcusable attribute makes the use of the term “real time” ridiculous.  The use of the term  to denote a particular kind of electronic organ is usually only for sales and marketing purposes.

The terms time domain and frequency domain synthesis, which are also the property of the wider signal processing community, are much better and we shall use them in this article.  Samplers are examples of time domain synthesisers, and the so-called real time organs to be described use frequency domain methods.  But what are these methods in plain language?  To understand them better it is helpful to take a trip into history.  

In the beginning, starting around the 1920’s, all electronic organs used analogue electronic circuitry.  But in 1971 there appeared a patent [1] which was materially associated with the development and subsequent marketing of the Allen digital organ.  Although it was by no means unique among a plethora of similar patents both before and since, it was in many ways a remarkable document and even today it repays careful study.  Among other things it described a “generator” which could be assigned to any key and which would read out from a memory one or more stored waveforms, at a rate related to the key in question.  When the key was released the generator would be released also, after the waveform had decayed gradually.  All of this was done years before the microprocessor had appeared in general use, and before memory chips were widely available.  In fact the whole thing was a vast hard wired arrangement of simple binary logic elements and the memory was contrived from diode arrays.  In practice the system was realised using a set of novel custom integrated circuits designed by the North American Rockwell Corporation which filed the patent.  This story is interesting not only for its technical content, but because it portrays the evolution of virtually every digital organ since – a new technique protected by patents filed by a specialist firm or other developer, the subsequent development of custom integrated circuits of high complexity (and, later, software as well), and the dissemination of these solely to one or more organ manufacturers under tight commercial licensing conditions.  

A few years later another key patent was filed in the UK in 1976 by the National Research and Development Corporation, a long-defunct invention of the interventionist government of the day, acting for Bradford University.  After a curiously long wait this patent was published in 1980 [2].  It described an alternative system in which the only stored waveform was a sine wave (a pure tone), or even just part of a sine wave, together with lists of numbers describing the harmonic structures of the tones to be simulated.  By then cheap microprocessors had become available, considerably simplifying the design problems, although they were not capable enough to undertake many of the real time computations required.  Consequently this first Bradford Computing Organ design also contained esoteric electronic chips such as binary multipliers.  This system has been used by a number of organ manufacturers and so we see again the same evolutionary pattern as before – a novel design licensed to various organ firms, most of which had played little or no part in the initial development (although in the case of the Bradford system a few did).

It is useful to take stock of the historical context in which these two patents arose.  A large part of the earlier one had nothing to do with the storage and generation of sounds at all, rather it described methods of key scanning and multiplexing which today are taken for granted.  But the point is that when the patent was conferred these techniques (obvious though they were, even at the time) were also then the subject of legal protection.  Thus during the 1970's and 80's other organ manufacturers were prevented, in theory, from even using these relatively trivial techniques let alone the novel method of sound generation itself.  One consequence was that virtually all other firms continued to build analogue organs in this era.  Although a minority of these was very good (but expensive), the rest were cheap and awful.  In the midst of this situation the Bradford patent emerged, because it was able to sidestep the problem of infringement by describing a method of sound generation using frequency domain techniques rather than time domain ones.  In the 1980's a few firms in Europe began to use the Bradford technology, including Ahlborn and Wyvern.  By the mid-1980's the Musicom firm had also appeared which developed a derivative of the Bradford technology, and one of the first commercial users of this system was Copeman Hart in the UK.  A few years later the situation relaxed still further with the expiry of the early time domain patents.  Then firms worldwide were able to use time domain methods freely, and there was a flood of electronic organs in the early 1990's using direct storage of waveforms.  The analogue organ market collapsed equally rapidly at that time.

Perhaps the most important message to take away from this story is that frequency domain synthesis arose during a period when the protection afforded to time domain synthesis was very tight and vigorously defended. Because the Bradford work was funded by the British taxpayer it is reasonable to assume that it was driven by a desire to sidestep the time domain legal straitjacket and open the door wider to UK industry.  Against this background some of the claims one still hears about the alleged superiority of frequency domain methods cause some amusement; all too frequently they have little basis in technical fact.

The situation depicted above has been repeated several times, particularly after the original patents expired.  Both of the systems mentioned have been the subject of continuous development and in addition others have appeared.  Today, various specialist firms source the hardware and software necessary for the systems, and these are usually updated regularly.  It is rare for the electronic organ manufacturer himself to assemble his products from scratch as he used to do in the analogue days.  But from a business perspective, the fact that some manufacturers have made the decision to procure their enabling technology from a single source might imply a strategy with some risk.  This could have unfortunate implications not only for themselves but for their customers in the long term. 

So we have seen that the Bradford system began to be used by a few manufacturers in the 1980’s, and shortly afterwards the technology also became available in an alternative form from Musicom.  Some firms, such as Wyvern, apparently continued with the Bradford system even though their implementation (at least as described in 1999 [3]) is somewhat dated by current standards.  The systems are modular in that they can be expanded to suit various sizes of organ, measured in terms of keyboards and stops.  Depending partly on the size of the organ and partly on how much flexibility the customer is prepared to pay for, these organs incorporate a number of “music modules” (the name might vary depending on the manufacturer), each of which can provide typically 64 independent note generators.   A single module of this size could only cope with a single department of a few stops without running the risk of missing notes, another undesirable phenomenon exhibited by the cheaper organs of either type.

The Allen digital organ, and it is probably safe to say most others, continues to use time domain methods of sound production.  This technique also characterises most if not all pop music instruments such as synthesisers and computer sound cards. It therefore has an enormous user base world wide, with correspondingly wide acceptance and a large amount of pooled experience.  The pros and cons of the systems continue to excite debate, not always conducted objectively or with sufficient insight, among manufacturers, dealers, performers, owners and others with an interest.  

There is a third method of sound production in use today, though so far its impact on the digital organs available appears to be minimal.  It is called physical modelling because it is based on a detailed physical analysis of how musical instruments actually work, and the conversion of this understanding into mathematical equations and computer algorithms.  Computers today are so powerful and fast that these equations can be solved in real time to generate the sound that the instrument itself would emit.  The method has its roots in research begun several decades ago, and the clarinet was one of the first to be fully characterised in this manner. Because it has yet to make major inroads into the digital organ field, it will not be discussed further in this article.  

A problem besetting the subject is the paucity of information and documentation available in the public domain from electronic organ manufacturers and those who supply them with their enabling technology.  One can surmise why this is so, but unfortunately it does not serve the interests of customers and it also allows an unhelpful surfeit of misinformation and misunderstanding to flourish.  This can be observed on a daily basis merely by dipping into various Internet discussion lists, or from the letters columns in the organ literature where correspondents frequently plead for clarification.  Fortunately, even digital electronic organs cannot transcend the capabilities of computer science and engineering or the laws of physics, regardless of claim and counter-claim and strenuous efforts by the ad-men.  Therefore by examining the subject at this level we can identify their strengths and weaknesses and the areas which are most important in terms of voicing, regardless of the firms or technologies involved.  It is for this reason also that terms such as time or frequency domain synthesis are used in this article, to avoid confusing matters of fact and principle with particular commercial offerings.  

TIME DOMAIN TECHNIQUES  

We have just seen that commercial music systems for the mass consumer market such as synthesisers use time domain methods, and this is also true of many if not the majority of digital electronic organs.  The characteristic feature of time domain synthesis is the direct storage of waveforms, or time series as they are more rigorously termed, as sequences of binary numbers.  Thus the digital organ described in [1] was a time domain system because it relied on the storage of waveforms in memory, and these were read out and supplied to the loudspeakers when keys were pressed.  Appendix 1 outlines the essentials of digital signal representation. 

FREQUENCY DOMAIN TECHNIQUES  

Frequency domain synthesis techniques are less common than time domain ones in the commercial music world, at least at the mass consumer level, and the only ones to be described are those which use a form of additive synthesis.   These techniques are classified as frequency domain ones in this article because the stored values relate to frequency artefacts (spectra) rather than those based on time (waveforms).  Appendix 2 summarises the process of additive synthesis.  

We have just seen that in time domain synthesis the memory of the system is occupied by samples of the actual waveforms we want to hear.  With frequency domain synthesis the only stored waveform as such is typically one cycle of a sine wave, or even just one quadrant of a sine wave [5].   This is because the only waveform necessary in additive synthesis is a pure sine wave at each harmonic frequency.  For this reason the Bradford patent [2] did not infringe those such as [1] protecting the Allen organ.  But additive synthesis relies on adding various harmonics, so it is also necessary to store tables of numbers representing the strengths of the harmonics making up the spectrum of an organ stop.  Thus for a diapason, typically 15 to 30 numbers would be needed for each spectrum.  However, as with the time domain system, each stop requires several spectra to enable the variation in tone quality across the keyboard to be captured.  

 

Figure 3 shows a conventional representation of an amplitude spectrum.  Each vertical line represents a harmonic and its height is the amplitude or strength of that harmonic.  The harmonic amplitudes in this example are in dB (decibels), and although this representation is standard practice it can give rise to confusion.  The use of the term “amplitude” is important here as it implies that an increase of 6 dB means a doubling of amplitude.  An alternative representation uses harmonic intensities or powers (again the words are important as they have an exact meaning), which are the square of amplitude.  In this case a doubling of intensity means a change of 3 dB.  More explanation of decibel notation is available in [6].  Therefore you have to look closely when decibels are used to find out which type of spectrum – amplitude or power – is implied, otherwise major errors can ensue in the subsequent synthesis.  Sometimes the term SPL (Sound Pressure Level) appears on the dB axis and this denotes an amplitude measure, not intensity or power.

Some further remarks about nomenclature are appropriate at this point.  People often speak of a spectrum as “the FFT” of a waveform, or of “an FFT program” that will generate a spectrum.  “FFT” means Fast Fourier Transform, and to use it in this way is misleading and non-rigorous.  We can get into difficulties if we do not understand the terminology properly, particularly when dealing with waveforms such as the starting transient sounds of organ pipes, a subject we shall come onto in due course. 

The reason why Fourier’s name crops up is because he was the French mathematician whose  remarkable insights some 200 years ago led to today’s understanding that all periodic sounds can be expressed as a sum of pure tones, constituting the fundamental and upper harmonics of the sound.  The adjective periodic is important here because it means not only that each repetition cycle of the waveform is identical, but that the waveform lasts for many such cycles (the mathematics actually requires an infinite number).   What is called the complex spectrum of such a waveform (not the same as Figure 3) is then given by the Discrete Fourier Transform (DFT) of the waveform, assuming we are dealing with a digitally sampled version of it in a computer.  Fifty years or so ago it took far too long to compute the DFT directly because computers in those days were considered fast if they had a clock rate above 1 MHz.  Therefore Cooley and Tukey published a paper describing how the process could be speeded up by using their Fast Fourier Transform algorithm.  Often it matters little today whether one uses the DFT or the FFT, so fast have computers become.  In fact the FFT can be slower than the straightforward DFT in some circumstances if one is not careful because of the data-shuffling and manipulation overhead it entails compared with the DFT.  

The DFT and the FFT both give the same result, which is not what we conventionally speak of as the spectrum of the waveform such as that pictured in Figure 3.  The output of the complex DFT or FFT contains not only the amplitudes of the harmonics but their relative phases as well.  It also detects both positive and negative frequencies, which we shall not go into. Because the ear is insensitive to phase and negative frequency, we can throw this information away and only use the amplitude and positive frequency information.  But to do this it is necessary to compute what is called the modulus of the complex transform.  An economical way to proceed in terms of the programming, execution time and storage required is to compute the real rather than the complex transform.  This allows the transform length to be only half of what it would otherwise be.  Although these details cannot be explained exhaustively here, they are all important and therefore the overall process of deriving a spectrum is more complicated than simply computing the FFT alone.   

What’s in a name, I hear you say?  Well, incorrect usage reveals vague understanding.  In digital organs the vagueness can lead to inadequate synthesis of starting transients and inefficient additive synthesis to name but two problems, and both are addressed later on.  A further inefficiency, if not in the organs themselves then in the waveform editing processes used to voice them, is related to the apparently widely held belief that the FFT always has to operate on a data length (number of samples) which is a power of 2:  256, 512, 1024, etc. Such an FFT algorithm is called a radix 2 transform.  This is a curious hangover from the original implementation of the technique which did indeed recommend powers of 2.  It is a needless restriction however, and the FFT can use any radix.  Even today one of the most efficient algorithms available was developed at the Stanford Research Institute over 30 years ago; this used a mixed radix transform, thereby allowing virtually any data length to be used.  It also employed a particularly elegant and rapid method of doing the mathematics which avoided the need to repeatedly calculate or look up trigonometric functions.  Moreover, some particularly fast implementations of the FFT have used a prime radix transform in which the data lengths are prime numbers.    Since most, if not all, modern waveform editors still impose the radix 2 restriction it is evident that developments such as these are not well known to the commercial electronic music community.  

In frequency domain synthesis each of the individual spectra making up one stop can be assigned to a different keygroup, a range of notes, though in the commercial systems we have mentioned each is usually assigned to a single note which is sometimes referred to as a voicing point.  The computer system then usually derives an interpolated spectrum for every other note from the two nearest voicing points.  Interpolation is a mathematical technique but it really means blending or (to use an image processing term) morphing, the production of something new from several constituents. If a note keyed corresponds precisely to a voicing point then only that spectrum is used further in synthesising the output.  If the note lies between two voicing points, an interpolated (blended) spectrum is calculated in which each harmonic has an amplitude which is some arithmetical function of the same harmonic in the two spectra at the nearest voicing points. In some systems the mathematical form of the interpolation function can itself be specified.  Interpolation disguises sudden audible discontinuities between voicing points, although the operation of the interpolator can sometimes be detected by playing chromatic scales up and down the keyboard – one can hear the changes in tone quality between the voicing points, and indeed often find out where the voicing points are.  This illustrates at once the blatant crudeness of such instruments compared to real pipe organs, where such phenomena are of course absent.

The spectra themselves cannot be used directly for generating sound – for this they need to be converted to waveforms.  This is done in the Bradford system, for example, in two stages. The spectra for each note of a stop are first calculated by interpolation as described above when the stop is drawn.  Then a corresponding waveform, often just a single cycle, for each note is derived by additive synthesis from each spectrum and placed in a temporary note memory.  These processes occur whether keys are currently pressed or not.  The system then waits until a key is pressed, at which point the waveform for that note and that stop is read out from the note memory just as in a time domain system.  This method of doing additive synthesis gets over the problem of doing it in real time as the music is played, which would require an almost impossibly fast computational capability.  Therefore some organ systems colloquially called “real time” are in fact anything but!  This is another reason for referring to them here as frequency domain systems.  In some systems the synthesised waveforms can be updated rapidly as the music is played to simulate effects such as “live winding”, in which the speech of pipes in a real organ is affected by the dynamics of the winding system.  This is discussed at more length later on.  It is a feature made much of by some manufacturers and pundits, but in fact it uses only basic computer techniques.  It is no different in principle to the monitor on the humblest PC whose picture content can be changed rapidly in response to the demands of a computer game, say.    

In theory, a more efficient way to perform additive synthesis is to compute the Inverse Fourier Transform (IFT) using an FFT algorithm.  Just as the forward Fourier Transform converts time domain data into the frequency domain, the inverse transform converts frequency data into the time domain.  Therefore it can be used to generate a waveform from a set of harmonic amplitudes, i.e. from a spectrum.  As with the forward transform, additional efficiency is gained by ignoring the phase of each harmonic.  However if the transform length is small, care has to be exercised to ensure that the FFT is actually faster than the "brute force" method described above.

It might be wondered why the frequency domain system is used, because it appears at first sight to be more complicated than the time domain sampler technique.  When it was first developed one answer doubtless related to the patent situation.  Nevertheless it is a reasonable question to ponder on, and the advantages and disadvantages of each method will be considered later. .

FURTHER SYNTHESIS TOPICS  

ADSR

We now leave this discussion of synthesis methods and turn to some related topics which apply to all of them.  The term ADSR occurs repeatedly in electronic music, and it is necessary to understand what it means.  It is an expression which arose in the early days of monophonic analogue synthesisers and it stands for Attack, Decay, Sustain and Release.  

 

In Figure 4 is sketched an ADSR curve, which is the amplitude-versus-time envelope of the sound of a single note of a single stop (although in the case of frequency domain systems it can also relate to a single harmonic of a sound).  When a note is keyed the amplitude increases from zero over a time denoted by the Attack parameter.  The maximum value reached during this phase is sometimes called the Initial Attenuation level.  Then the amplitude might fall to some lower level over a second time period defined by the Decay value.  After this the amplitude remains constant, at a level defined by Sustain, for the time that the key is held down.  Finally when the key is released the sound drops to zero over the Release period.  The Attack, Sustain and Release phases correspond exactly to those in Figure 1 which illustrated how sound is generated from a finite wave sample.  

Note that the Attack, Decay and Release parameters are all expressed as time intervals (typically milliseconds), whereas the Sustain parameter is an amplitude level (typically expressed in dB).  In organ work the Decay phase is more often than not irrelevant as far as the overall sound is concerned, because once the Attack phase is over we do not usually want the amplitude to reduce again before the steady state commences. Such an overshoot can produce a most peculiar effect if overdone.  However it can have importance if a separate ADSR characteristic is applied to each harmonic of the sound in  frequency domain synthesis.  For example, the second harmonic of a diapason pipe often rises more rapidly than the fundamental to a higher peak value, before dropping back to a steady state (Sustain) level.  

Do not confuse the Decay and Release parts of the curve; the ending of a sound when the key is released is governed by the Release value, not the Decay one.  Some odd effects can be produced if the wrong value is adjusted inadvertently when voicing or regulating a stop!   

Initial Attenuation

This term does not enjoy a universally acknowledged definition, although it is used in all synthesis methods in some way or other even though its name may vary. It governs the overall attenuation applied to each waveform and is quite independent of the other parameters in the ADSR characteristic.  It is one of the most useful parameters available in a voicing program for getting the regulation of a stop correct over the compass, that is, the way its loudness varies across the keyboard.  In turn the quality of regulation of a stop materially influences its blending characteristics with the others.  

When a sample is loaded into a time domain digital organ it will always be replayed at some fixed maximum level (referred to here as an Initial Attenuation of 0 dB) unless that level is changed.  This assumes there is no Decay phase, so that the Initial Attenuation and Sustain levels in Figure 4 are one and the same.  This will be appropriate for the vast majority of organ samples.  Naturally, the level of the sample corresponding to one keygroup might not be appropriate for the adjacent ones, so to prevent an audible discontinuity appearing the Initial Attenuation parameter for the corresponding samples has to be adjusted.  It is given some other value such as – 6 dB, meaning that an attenuation by a factor of 2 would be applied to that keygroup, although in practice the correct values have to be found by trial and error listening tests.  

Initial Attenuation is also important for adjusting the relative loudness of the various stops.  If this were not done a Dulciana would sound as loud as a Trombone, and to prevent this the Dulciana might need to be attenuated by 35 dB or more relative to the Trombone.  The computer in an organ has no way of divining what we want it to do unless we tell it explicitly!  

Although this discussion referred to time domain systems it also affects frequency domain ones.  At the point when sounds are emitted from the organ, both systems are replaying waveforms stored in computer memory.  For many purposes it does not matter how they got there.  

Polyphony

It is necessary to say a few words about how digital organs use the stored waveforms or harmonic tables to produce sound when several notes are keyed at once.  One of the most important issues concerns polyphony, which is the ability of an organ to produce many notes simultaneously as opposed to the monophonic operation which characterised the earliest synthesisers.  

The following brief discussion is included to assist an understanding of why polyphony is important, and why unlimited polyphony as in the pipe organ can be difficult to achieve in electronic ones.  It should be appreciated that the hardware and software details mentioned are generic, rather than relating to any particular make of digital organ.

All organs can be considered to use generators to actually produce the sounds, although these are very different from the tone generators used in analogue organs.  Moreover, the actual term used might vary depending on the manufacturer’s preferences.  Each generator can be envisaged as a flexible digital circuit arrangement, or a very fast software module, which can accept any waveform for any stop and output it to any amplifier and loudspeaker.  When a key is pressed the computer determines which waveform has to be used for a particular stop and loads it in some way into a free (currently unused) generator, although usually the waveform would not be physically relocated within the computer’s memory.  Instead the generator would be loaded with the address of the waveform where it resides in memory.  However this is a detail which is not of particular interest at present.   Also one need not assume that each generator necessarily handles the complete waveform for a sound.  Sometimes only specific harmonics are loaded into a generator, and these are then combined afterwards to produce the composite sound.  Such techniques are associated mainly with frequency domain (additive) synthesis.

The computer also loads quite a lot of additional information into the generator relating to the ADSR envelope, Initial Attenuation, subtractive filter parameters (if relevant), looping parameters (if relevant), the output channel to be used, etc before switching the generator on.  The note then sounds until the key is released.  If another key was to be pressed while the earlier note was still sounding it is easy to see that another generator has to be available.  In fact if it is decided that up to 8 notes, say, need to be catered for simultaneously on each department of the organ then there need to be at least 8 generators per department.  But this is an oversimplified view of things, and we need to consider how to handle the use of multiple stops simultaneously as well as multiple notes.  

Multitimbrality

An organ has to cater not only for polyphonic operation on a single stop but on multiple stops simultaneously.  It would be a poor organ on which one could only play one stop at a time.  In electronic music parlance this leads to the concept of multitimbrality, a monstrous word no doubt intended to impress the gullible.  However it merely means the ability of an instrument to sound several stops of different timbres at once.  

In an organ of any pretension it is essential to be able to introduce the subtle variations in tuning which characterise a pipe organ.  If a single note is played on a pipe organ with two unison stops drawn they will almost always be slightly out of tune as revealed by very slow beats.  In a recently tuned organ one beat every few seconds would be typical.  If this feature is not incorporated the sound of the digital organ will be thin and unconvincing, and it can only be achieved by having enough generators to assign to every key currently pressed of each stop currently in use.  Thus if there were 8 stops on a particular department and a polyphonic capability of 8 notes was required, then a total of 64 generators would be required just for this small department.  The generators are quite complex pieces of hardware and software which interface in an intimate manner to the computer itself, so that the program in the computer can control all of their functions.  It is this intrinsic complexity which demands the use of specialised integrated circuits for digital organs, as it would be virtually impossible and definitely uneconomic to build all the generators required using ordinary components.  Each “music module” of the Bradford system contains 64 generators, the same as some modern computer sound cards.  

The ability to add more sound generating hardware to a digital organ at will, such as by specifying the number of “music modules” to be used, means that enough generators can be provided in theory to cater for the size of stop list under consideration.  However in practice, even with the advantages conferred by integration, economies have to be made sometimes.  To ensure that sufficient generators remain free while the organ is being played, some notes on some stops will either not sound at all or they will be treated in a different manner under certain circumstances.  For example if a loud reed is being used at the same time as a quiet flue stop is drawn on the same manual, the quiet stop simply might be ignored by the system in the belief that the organist and his audience would not notice.  Or the waveforms for both the stops might be added together before being loaded as a composite waveform into a single generator, thereby removing the “free phase” effect that we have just seen is so desirable.  Some systems brutally drop notes and/or stops completely and randomly when an overload situation approaches, rather than attempting to fail gracefully in the manner described.  

Similar problems arise if many couplers, particularly octave or suboctave couplers, are used while playing.  A single octave coupler can double the number of generators required as soon as it is drawn.  This explains why such couplers are used rather sparingly in most digital organs.  Also mixtures can represent a considerable resources overhead if the individual ranks of the mixture are assigned to separate generators, to properly simulate independent ranks of pipes.  For this reason mixtures may use a single composite waveform which then only requires a single generator per note played, just as with an ordinary stop.  However when this is done, it is then much more difficult (often impossible) to simulate the slight independence of tuning between the ranks; the mixture will be tempered to an unrealistic degree of exactness in this case.  

Real Time Operation

Obviously, all digital organs have to work in real time.  Unfortunately not all of them do, and sometimes players find that a perceptible time lag arises when the music is fast, highly polyphonic, when many stops are in use, or when some or all of these factors arise at once.  In the worst cases the system can grind to a halt, notes can be lost or they continue to sound indefinitely.  The problems arise because the computers in the system simply cannot always cope with the demands placed on them by the player.  It is fair to say that these problems are probably less frequent today than they appeared to be some years ago, but they are of course completely indefensible in an instrument of any sort.  If an electronic organ is dependent on MIDI, this can introduce similar problems of its own [7].  

TIME VERSUS FREQUENCY DOMAIN – PROS AND CONS  

This section begins a discussion of the advantages and disadvantages of time domain and frequency domain methods of synthesising sounds.  Not all conceivable systems can be considered, so the discussion is limited to those mentioned previously which either store waveforms directly or those which compute the waveforms from stored frequency information.  A number of points needs to be made or repeated first to facilitate the discussion: firstly, it is useful to keep in mind that both types of organ produce the sounds from waveforms stored in memory at the time the sounds are being produced.  The fact that the waveforms in one type of organ are permanently resident whereas in the other they are computed as required is sometimes of lesser importance.  A second point is that, because of this difference, the amount of storage required for the time domain system can be much greater than for the frequency domain one.  Each stored sample might be several seconds in length, and because there are many samples the total amount of memory might exceed that for a system which stores harmonic amplitude information.  In practice this is not usually a disadvantage given the ready availability and cheapness of memory today.  Moreover, the difference is not always as great as might be supposed, because the most sophisticated frequency domain systems also have to store large data tables to simulate the transient portions of the waveforms, and even more if real time fluctuations in quasi-steady-state sounds are simulated (see below).  A third difference is that the computing power required in frequency domain systems is generally greater than in time domain ones, though again the difference might not be as significant as often supposed when very high quality simulation is demanded.  

A major difference in capability between the two systems in principle is that time domain organs can reproduce a previously recorded pipe waveform directly whereas those using frequency domain methods cannot.  In the latter case the waveforms have to be reconstructed from frequency information, and this information therefore has to be derived offline beforehand from the paradigm waveforms or in other ways.  A frequency domain organ does not reproduce an exact copy of what you hear from an organ pipe because it will necessarily have been processed in some way firstIn practice the difference in many cases is less important than sometimes claimed however, because making recordings of organ pipes to the necessary high standard required for direct insertion into a time domain sampler is extremely difficult. Extraneous noises are but one problem to overcome.  In these cases the waveforms have to be cleaned up and then resynthesised offline, and similar processes are involved in both cases.  Another major problem affecting time domain samplers is that of looping the waveform to simulate a note longer than that of the stored sample, and often it is impossible to find loop points on the raw waveform such that the ear cannot detect that looping is taking place.  This can occur because of noise present in the signal, and although the noise can usually be suppressed, it is often undesirable to do this because it is the windy sound of some flue pipes which gives them much of their character.  Even if loop points can be found in these cases, the repetitive "swishing" of the noise as it loops becomes offensive to the ear once it has detected its presence.

Many claims are made today for digital processing techniques which profess to be able to loop "difficult" waveforms undetectably, but the plain fact is that they alter the waveform to such an extent that it can no longer be regarded as a sample of the sound as originally recorded.  About the only way to surmount the looping problem without corrupting the original waveform is to use samples of such a length, in excess of ten seconds or so, that their duration is unlikely to be exceeded in normal playing.  If this should occur then looping will of course have to be used to extend the apparent duration of the samples, but with such long samples it will only be required infrequently.  Although memory is cheap today, the total memory requirement can nevertheless become problematical when hundreds or thousands of such long samples have to be stored, but it can be reduced considerably by carefully selecting the sample rate of each sample in relation to its audio bandwidth.  For example, it is wasteful in terms of memory to use a 44.1 kHz sample rate for samples representing the lowest notes of a pedal Bourdon when a rate of 8 kHz or less would suffice.

Regardless of the length of the stored samples, yet another problem affecting time domain samplers concerns the termination of the sound.  When the key is released it is necessary for the system to jump immediately to the release phase of the sound, and if a characteristic release transient and/or the recorded ambience of the auditorium is required it can be difficult to achieve a seamless and undetectable "join" between the previous steady state phase and release.  Most systems which claim to have solved this problem in fact have shortcomings which are only detected when playing the instrument, though there are ways to achieve it.

Frequency domain synthesis also has peculiar difficulties in some situations, such as when the frequency structure of the waveform is changing rapidly.  This can occur during transient phases of the waveforms.  Capturing a complicated transient directly simply by recording it is sometimes the better, indeed the only, way to incorporate it in a digital organ.  Many transients are so complicated that they are impossible to synthesise in any other way because of issues such as partials which begin as band limited noise rather than as discrete frequencies.  Even analysing such transients can be next to impossible because of the mathematical limitations of spectrum analysis, and if you cannot analyse a transient in the first place then you have no hope of being able to re-synthesise it accurately using frequency domain techniques.  Also, the addition of realistic-sounding wind noise to the tones can often be difficult with these systems.  All these issues raise important details which will be expanded in later sections.  

Sometimes claims are made for the frequency domain system that it is better at mimicking the subtle variations which occur while a real organ pipe is speaking.  Pipes respond sensitively to their environment in myriad ways, and the onset of speech can be affected by factors such as how many other stops are drawn and how many notes are demanded simultaneously.  Pipes can interact through a common wind supply and  acoustically with others nearby, and there is some truth in the statement that a pipe never speaks the same way twice.  Critics of the time domain system point out that to incorporate such effects it would be necessary to store an impracticably large number of samples, whereas the frequency domain system can simulate them by computing the necessary variations to the waveforms while they are sounding.  In practice the truth, as usual, lies between these extremes, and time domain systems have reached a level of considerable sophistication in view of their ubiquity.  It is just as easy (or difficult) to incorporate multiple time domain samples as it is to incorporate multiple tables of harmonic amplitudes or other data.  More than ten years ago at least one make of time domain organ even then was able to simulate the gentle variations, including the “burbling”, of a reed stop while the key was held down for an extended period.  Therefore, particularly today, the ability to include these and many other effects are not solely the province of frequency domain systems.  Both systems require the storage of large amounts of data to properly simulate these subtleties, together with highly sophisticated hardware and software host environments.  In terms of information theory, a time domain system is in all circumstances and in every way equivalent to a frequency domain one.  Neither has an advantage over the other in this respect, and anyone who maintains it does must have a technical insight which implies they should apply for the next Nobel prize.

The characteristics attributable to unsteady winding were delightfully set out by a 19th century description of Austin’s Universal Wind Chest system [8]:  

“Why is a single stop so smooth?  Because the demand for wind is so small, and every stop has been voiced and tuned under the condition of the bellows being inflated and almost in a state of rest.  

To explain what otherwise happens we will suppose the bellows reservoir board to be 6ft by 6ft; this, needing a weight of 7lb per square foot, amounts to 2 ¼ cwt.  It is foolish to suppose that this heavy weight can be divested of its natural law of inertia and momentum.  Imagine now the bellows to be fully inflated and in a state of rest – all is right – equal pressure everywhere.  Now suddenly strike full chords with every stop and coupler out and heavy pedal notes added; these heavy weights on [an] upper board constructed to rise and fall, to take in, and give out any displacement, lag behind at first, obeying the law of inertia; this results in less pressure; but when it has started in its descent it obeys the law of momentum; and it may be that this sudden demand of wind is no longer needed.  This suddenly checks its downwards course, and this is the cause of momentum being added to weight.  The result then will be more pressure.  Also the intermittent thrusts of wind from the feeders, whose duty it is to keep up supply, have an equally disturbing influence from exactly the same cause.”  

The description continues in similar vein to describe robbing caused by the conveyances and trunking but not, curiously, inadequate pallets.  Perhaps this is because even Austin’s system could not dispense with these.  Nevertheless it is fascinating to thus see how fashions have swung from attempts to eradicate these “defects” to today’s strenuous endeavours to simulate them in electronic organs, and to re-introduce them in some pipe instruments!  

Another claim sometimes made in favour of frequency domain organs is that it is easier to voice them.  In order to change the harmonic structure of a stop in these instruments, it is only necessary to alter the tables of stored harmonic amplitudes.  Manufacturers’ voicing software generally facilitates this by allowing the voicer to make these adjustments in real time while a note is sounding, through the use of convenient on-screen displays containing virtual “gain controls” for each harmonic (of which there might be up to 250 or so).  However it is also straightforward to implement such a scheme for time domain organs.  Having altered the harmonic structure of a sound in a time domain organ, it is then necessary to do an inverse Fourier Transform (additive synthesis) to generate the new version of the waveform.  This has to be done each time the slightest adjustment to the harmonic amplitudes is made.  However the speed of modern computers should make this somewhat longer process transparent to the voicer.  If it does not, the voicing software and hardware being used is non-optimum and out of date.    

In practice, one area where frequency domain techniques run out of steam before time domain ones is in the number of waveform recipes which can be stored.  Storage of separate samples (or even multiple samples) for every note of every stop is quite feasible for the cheapest and most basic time domain synthesisers today - even the humble SoundFont based synths used in many computer sound cards can do this.  However the latest Bradford frequency domain organ simulator (BEST) still has difficulty in competing in this vital area according to its inventors [13], and therefore it has to fall back on interpolating between a smaller number of voicing points as described previously.  This shortcoming might reflect the rather dated technology on which the system appears to be hosted (thirty-year-old Z80/S80 low speed 8 bit processors are apparently still used in some commercial organ systems which use Bradford technology).

Today the two techniques appear to be moving closer to each other in that some frequency domain organs also rely on storage of waveform samples, such as BEST.  The addition of wind noise to the sounds of certain stops is achieved in these cases by adding sampled noise.  This is not surprising because noise contains an infinite number of Fourier components; it cannot be synthesised from a spectrum of finite size.  This illustrates another disadvantage of the frequency domain system.  Some starting transients may also be stored directly as waveforms rather than as extremely complex data tables.  

It would be comfortable to be able to say that neither system has a definite nor incontrovertible edge over the other.  Both can simulate organ tones to a high degree of realism, but both can also produce disappointing results.  As with pipe organs themselves it boils down to how well they have been voiced, assuming that the hardware and software environment has the necessary degree of flexibility and sophistication to allow the voicer to flex his muscles as it were.  And, as with pipes, the voicer has to know his business inside out, which is something which only comes from experience and an “ear” for the job. Remember that the frequency domain system arose at a time when the patent situation was weighted heavily against the majority of organ manufacturers.  This fact, rather than spurious technical claims, was largely responsible for its genesis.

But I think it is possible to make an objective choice in favour of one system over the other, and therefore I shall conclude this section on the pros and cons of the two methods by stating my own preference.  Because frequency domain ("real time") organs cannot reproduce an exact copy of the sound emitted by an organ pipe, I find them inferior to time domain ("sampling") systems in practice.  It is not possible to model, and thus program, every conceivable nuance of pipe speech for the same reasons that it is impossible to achieve accurate weather forecasts.  In both cases some factors are difficult to reduce to the mere numbers that computer programs demand, and we do not know about other factors at all.  Also the dated hardware environments on which frequency domain systems are hosted simply cannot compete in capability with those used by the synthesiser world at large for time domain synthesis.   Therefore "real time" organs tend to produce a rather blander and smoother approximation to pipe sounds than "sampling" ones, often quite acceptable but less characterful nevertheless.  The variations of the sounds from one actual pipe to another and those which occur while actual pipes are sounding tend to be captured better by "sampling" than by "real time" organs.  I have not yet heard a "real time" digital organ which comes anywhere near being able to simulate the character of a Schnitger or Silbermann instrument for example.  Manufacturers of these types of organ seem content with the numbing blandness of mid-20th century British and American organs. To illustrate this, the two clips which follow were produced by the "sampling" and "real time" synthesis methods respectively:

"Sampled" sounds - mp3 file (3.8 MB/4m 9s)

(Extracts from Mein junges Leben hat ein End, Sweelinck.  About 1/2 minute from each movement)

 

"Real time" sounds - mp3 file (719 KB/46s)

(Extract from Ave Verum, Mozart)

VOICING  

This section begins a discussion of some of the processes involved in voicing a digital organ.  The term “voicing” is used rather loosely here to include many activities, ranging from the collection of raw sound samples through to voicing operations themselves, but it is assumed the reader will have no difficulty in understanding the meaning of the terms from the contexts in which they appear.  

The voicing process requires two categories of activity.  The first includes such things as making recordings of pipe waveforms, and converting them into tables of harmonic amplitudes and other data to be synthesised back into sounds.  Thus spectrum analysis must be  included in the range of processing functions available so that the harmonic structure of the waveforms can be derived.  Much if not all of the necessary hardware and software can be obtained commercially, though a customised software suite will often be better suited to the job.  The second category includes activities associated with the minutiae of loading all these data into a particular organ and adjusting them to taste within more or less narrow limits.  This requires a software and hardware environment necessarily specific to a particular organ; there are many organs on the market and they are constantly evolving.  Moreover, details concerning this environment can usually only be obtained from the manufacturer at his discretion.  Therefore only the first category of activities can be described here.  However it covers an extremely important range of topics for the reason now to be explained.  

A digital organ when first switched on is nothing but a vast empty vacuum which cannot make the merest squeak on its own. It is about as much use as a pipe organ without any pipes.  The vacuum has to be filled by specifying almost every conceivable characteristic of every stop, sometimes on a note-by-note basis.  This statement is particularly true of frequency domain organs, which are incapable of accepting waveform samples.  In these instruments an enormous amount of data have to be laboriously inserted before even a single new stop can be heard. By “new stop” is meant one which has not been used before in that organ.  For example, merely to hear the steady state sound of a stop (i.e. neglecting its attack transient) the voicing points have to be defined, the harmonics for each voicing point have to be inserted, the way that the interpolator treats the variation of each harmonic between each pair of voicing points has to be specified, etc.  To specify the transients in addition, it is typically necessary to define which frequencies are present in the transients, how the frequencies might change with time and the specific ADSR characteristics for each.  Having done all this, the stop may then require a huge amount of additional data before aspects such as live winding can be reproduced.  In other words it is necessary to input hundreds if not thousands of items of data before a frequency domain organ can emit sounds. 

The time domain organ can be got going much more quickly because a set of samples for a new stop can be loaded into it very rapidly.  Nevertheless, the collateral information necessary to tonally finish and regulate the instrument properly is considerable and consumes a lot of time. 

Compare this situation to that for real pipes.  A pipe can only emit the sound for which it was designed, more or less.  If a diapason pipe is made it will only ever sound like a diapason, whose precise tone is determined by its scale, the cut-up, etc.  All the voicer can do is to ensure it comes onto speech properly and, within narrow limits, emits the sound which its designer and maker intended.  Obviously he could not change a diapason into a reed.  Yet that is exactly what is possible within the vacuum of a digital organ, which contains nothing intrinsically organ-like at all.  It could just as easily emit the sound of a barking dog or a car engine revving as an organ. 

This virtually unbounded flexibility is usually paraded as a benefit but in fact it is  a decided disadvantage.  It is the main reason why digital organ manufacturers have been forced to copy real organ tone.  Without an a priori detailed knowledge of the structure of pipe sounds they could never have conjured out of thin air the myriad parameters necessary to enable a digital organ to sound like a pipe organ.  This is not to deny that with sufficient experience it is possible to create a new sound without recourse to real pipe data.  But most digital organ sounds are developed on the basis of a detailed examination of pipe waveforms and, to reinforce the point again, they could never have got off the ground even in principle were it not for such examination.  Therefore the derivation, analysis and understanding of real pipe sounds materially affects all digital organs and it is the subject of prime importance in voicing.  The subsequent process of loading the results of the studies into a particular make of organ is a mechanical process by comparison.

In principle, therefore, there are two ways to insert a new sound into any digital organ: one can start either from real pipe sounds or the sound can be synthesised from scratch (but with a priori knowledge and experience).  In practice a combination of both methods is also used, often of necessity because of problems such as unacceptable levels of noise or other forms of corruption of real pipe sounds.  Noise can result from the blower, the organ action, traffic, etc.  In these cases the original data have to be processed to remove the offending artefacts.  In what follows we shall refer to these three methods as “real”, “synthetic” and “hybrid” voicing for brevity.  

Many of the steps necessary to insert a new sound into any form of digital organ are similar.  For time domain organs one would first wish to obtain a satisfactory sample of the waveform of interest, including its transient structure, using any of the three methods outlined above.  In the “synthetic” (and sometimes the “hybrid”) case the waveform would typically be generated offline from a table of harmonic amplitudes using an additive synthesis (inverse Fourier Transform) program. The harmonic amplitudes would be produced by the voicer based on his intuition, judgement and experience.  Then the waveform would be loaded into the organ system and parameters such as ADSR adjusted until it matched other samples relating to the same stop, and those of other stops.  Note that the generation of the transient portions of the waveforms in “synthetic” or “hybrid” synthesis is much more difficult than for the steady state portions, hence the frequent reliance on the “real” voicing method in practice.  

For frequency domain organs the waveform itself cannot be loaded into the system.  In the “real” case a spectrum analysis of the harmonic structure of a real pipe sound would be used to populate the necessary tables of harmonic amplitudes. In the “hybrid” case these values would be modified heuristically in some way. In the “synthetic” case the amplitudes would be created from scratch as described above.  In all cases, difficulties related to transient structure will often be encountered when dealing with frequency domain synthesis.  

Making Recordings of Organ Stops

Recordings of real organ pipes are used in the “real” and “hybrid” methods of voicing as defined above.  The advantages of using recordings of the sounds to be simulated include the ability to reproduce the starting and termination transients of the tones, which can be very complex in structure and therefore difficult or impossible to synthesise in any other way.  By simply using a recording of the waveform we do not need to concern ourselves about this complexity; the transients will be reproduced automatically as part and parcel of the process.  However this is only possible with time domain organs, or frequency domain ones which also allow samples to be used.  

Having said this, a major disadvantage is the difficulty of making recordings of sufficiently high quality to be used directly, and most people find it is only after trying to do it that the problems become apparent.  There is also an aesthetic consideration of considerable importance.  Frequently one sees a firm trumpeting the fact that recordings of one famous organ or another have been incorporated into their products, yet they seem not to realise that they are alienating a substantial part of their potential market.  By admitting they are merely adept copyists of first rate organ tone rather than being able to generate it from scratch, they are also admitting doubts about their inability to understand and implement the voicing techniques which are the subject of this article.  Some critics go so far as to argue that producing “pirated” samples from recordings of pipe organs is a form of copyright infringement [9], which does not apply to generating them synthetically.  This is why both methods are discussed here.   

Anyone who has yet to try making recordings of organ pipes will probably be surprised by the difficulty of achieving a quality sufficient for direct use in a digital sampling system.  The many issues involved include the following:

 

bullet

Where do we place the microphone(s) in relation to the pipework of interest?

  

If the recordings were to be made close to the pipes a lot of possibly unwanted noises can be introduced.  High amplitude wind noise is generally undesirable even though you might want it to be reproduced.  This is because the presence of random noise makes the selection of looping points in a time domain system almost impossible on the steady state parts of the waveform – the superimposed noise means that there is in fact no steady state as such, and the audible discontinuity in the noise will be virtually impossible to get rid of.  You cannot loop a noisy waveform!  In signal processing language it is only possible to loop deterministic, not stochastic, waveforms even though both may have stationary statistical properties.  A very long sample can sometimes be used to ease this problem so that the time between loop repetitions is many seconds, but this enormously increases the storage requirement.  Another problem associated with noisy samples is the unpleasant way the noise builds up subjectively as more stops are combined.  

Action thump or other noises are also a problem when recording close to the pipes.  If the starting transients are to be recorded it is often difficult in subsequent editing to remove the thump without  truncating the desired transient. 

Yet another problem involves subjective aspects of ambience.  Do you want the digital organ to sound much as the original pipework does in the body of the building?  If so, there is little point making recordings close to the pipes.  Under these conditions the effect of the digital organ in a smaller room will be totally different to that of the original instrument in the nave of a church, say.  But making recordings some distance away from the pipes introduces its own difficulties such as those caused by standing waves, and these are addressed later.  The difference in the harmonic structure of the tones when recorded at different distances is well understood – very close to a pipe there will invariably be large numbers of harmonics discernible even for stops such as flutes.  But the higher order harmonics decay rapidly in strength with increasing harmonic number, so that away down the building the higher ones become inaudible.  You would not usually want these unimportant harmonics to emanate from the loudspeakers of your digital organ in a small room (in a large one such as a church the issues are different). 

A definitive answer to the question of recording range cannot therefore be given and each case has to be assessed according to its circumstances.  In general you might try making recordings a few metres away from the pipes; the necessary recording level settings then tend to cut down the amount of spectrum distortion caused by reflected waves which are of lower amplitude by the time they reach the microphone.  This problem is discussed more fully later. 

bullet

Reducing Outband Noise  

 

Outband noise means any signal which is not within the frequency range of the pipe being recorded.  It is important to cut out such noise otherwise needless degradation of the sound can result.  Of particular importance is noise below the lowest frequency (the fundamental frequency) of the pipe in question, as there is absolutely no reason why the pass band of the equipment should extend below this point.  This implies the use of variable high pass analogue filters which should be set to the highest frequency which does not attenuate the fundamental.  It is important to insert the filters at the microphone inputs as some high amplitude low frequency noise, such as sub-audio wind fluctuations from the blower, can otherwise dominate the recording level settings.  Such fluctuations can be of very high amplitude if picked up by microphones with an extended low frequency response, and inserting the filters after some stages of amplification might result in saturation occurring which the filters can do nothing about. In fact it is often the blower which is the chief contributor to outband noise, and it can be effectively removed by simple high pass filtering for pipes above a certain pitch.  For lower frequency pipes, such as a quiet 16 foot pedal flue stop, blower noise can be most intrusive and extremely difficult to get rid of.   

bullet

How many and what type of microphones should be used?  

 

If you are familiar with studio recording techniques you will be aware that many separate sound channels are often recorded and later mixed down.   However you will also be aware of the school of thought which teaches that one should only use the minimum number of channels, or even that anything above two channels is definitely undesirable.  The multiple channel doctrine is really one of convenience; it enables the balance of the final mix to be adjusted to suit the tastes of the producer, performers, etc.  But the main technical shortcoming is that each microphone picks up signals from regions which are intended to be the focus for others, and all of them pick up ambience from the auditorium.  The mixing process then results in artificial peaks and troughs in the frequency response of the final mix due to the phase reinforcements and cancellations which result in the mixing process.  In turn this can result in particular notes appearing to be too strong or weak when the recording is auditioned.  For our purposes the effect will introduce spectrum distortions by artificially exaggerating or attenuating certain harmonics.   

 

The use of only one microphone removes this problem but it introduces others, the principal one being the choice of microphone position.  Because the subjective effects vary with listening position in real auditoria the question arises as to why one position is necessarily better than any other – if a hundred people were questioned they would doubtless give at least a hundred different answers..  When recording organ pipes two microphones might be used, separated by a few metres, as this enables one track or the other to be used later if standing wave effects are troublesome.  One should never combine the two tracks by replaying them monaurally, as this leads immediately to spectrum distortions as described above.   

 

As to the choice of microphones, high quality capacitor microphones will give good results; these have an extended and flat response from sub-audio frequencies to 20 kHz or more.   The use of proprietary foam windshields can help attenuate blower pressure fluctuations when recording close to the organ.  A cardioid beam pattern is preferable to an omni directional one, as this helps to reduce standing wave problems.  The axis of peak response would of course be directed towards the pipes being recorded. 

bullet

What recording medium should be used?  

 

Commercial standard analogue magnetic tape systems are almost useless if you are intending to transfer samples directly into a digital organ.  The slightest trace of wow or flutter will be painfully apparent when you try to select loop points on the digitised waveform, and it will usually defeat your best efforts to do so.  Also signal to noise ratios are barely adequate even when using professional noise reduction systems.  However the best quality equipment might just provide passable results, although it is difficult to obtain today outside the second hand market.  If you are only intending to use recordings as a basis for subsequent spectrum analysis of the sounds, such as when the “hybrid” method of voicing is used, then the requirements can be relaxed somewhat.

 

One of the best systems from a technical point of view is DAT (Digital Audio Tape) recorders, but unfortunately these are somewhat expensive.  Many users find that the use of tape, a sequential rather than random access medium, also becomes progressively more irritating when locating or repeatedly replaying tracks. If you do use DAT ensure you select a normal PCM (Pulse Code Modulation) recording mode rather than one which uses compression, otherwise some peculiar distortions of the signals can occur.  MP3 coding is completely useless for high quality work, if you have not discovered this already.

 

A Minidisc system is far cheaper while still providing the signal to noise and other benefits of digital recording.  It is also a random access data storage medium which is much more convenient in use than tape.  However in this case it is less probable that you could select a PCM recording mode and you will almost certainly have to put up with some form of data compression such as ATRAC (Adaptive Transform Acoustic Coding).  The problem with these techniques is that they are optimised on a psycho-acoustics basis mainly for the signal characteristics (crest factor, etc) of pop music, thus the long sustained tones you will want to record from an organ pipe might produce some weird distortions on replay.  However the problem is unlikely to have major implications in practice provided you use the SP (standard play) recording option rather than an LP (long play) mode.  The practical advantages of the Minidisc system are also significant: a small battery powered “walkman” – type recorder together with microphones which can be powered from the recorder itself makes for an extremely compact and convenient recording set-up.