Share to: share facebook share twitter share wa share telegram print page

Database design

Database design is the organization of data according to a database model. The designer determines what data must be stored and how the data elements interrelate. With this information, they can begin to fit the data to the database model.[1] A database management system manages the data accordingly.

Database design is a process that consists of several steps.

Conceptual data modeling

The first step of database design involves classifying data and identifying interrelationships. The theoretical representation of data is called an ontology or a conceptual data model.

Determining data to be stored

In a majority of cases, the person designing a database is a person with expertise in database design, rather than expertise in the domain from which the data to be stored is drawn e.g. financial information, biological information etc. Therefore, the data to be stored in a particular database must be determined in cooperation with a person who does have expertise in that domain, and who is aware of the meaning of the data to be stored within the system.

This process is one which is generally considered part of requirements analysis, and requires skill on the part of the database designer to elicit the needed information from those with the domain knowledge. This is because those with the necessary domain knowledge often cannot clearly express the system requirements for the database as they are unaccustomed to thinking in terms of the discrete data elements which must be stored. Data to be stored can be determined by Requirement Specification.[2]

Determining data relationships

Once a database designer is aware of the data which is to be stored within the database, they must then determine where dependency is within the data. Sometimes when data is changed you can be changing other data that is not visible. For example, in a list of names and addresses, assuming a situation where multiple people can have the same address, but one person cannot have more than one address, the address is dependent upon the name. When provided a name and the list the address can be uniquely determined; however, the inverse does not hold – when given an address and the list, a name cannot be uniquely determined because multiple people can reside at an address. Because an address is determined by a name, an address is considered dependent on a name.

(NOTE: A common misconception is that the relational model is so called because of the stating of relationships between data elements therein. This is not true. The relational model is so named because it is based upon the mathematical structures known as relations.)

Conceptual schema

The information obtained can be formalized in a diagram or schema. At this stage, it is a conceptual schema.

ER diagram (entity–relationship model)

A sample entity–relationship diagram

One of the most common types of conceptual schemas is the ER (entity–relationship model) diagrams.

Attributes in ER diagrams are usually modeled as an oval with the name of the attribute, linked to the entity or relationship that contains the attribute.

ER models are commonly used in information system design; for example, they are used to describe information requirements and / or the types of information to be stored in the database during the conceptual structure design phase.[3]

Logical data modeling

Once the relationships and dependencies amongst the various pieces of information have been determined, it is possible to arrange the data into a logical structure which can then be mapped into the storage objects supported by the database management system. In the case of relational databases the storage objects are tables which store data in rows and columns. In an Object database the storage objects correspond directly to the objects used by the Object-oriented programming language used to write the applications that will manage and access the data. The relationships may be defined as attributes of the object classes involved or as methods that operate on the object classes.

The way this mapping is generally performed is such that each set of related data which depends upon a single object, whether real or abstract, is placed in a table. Relationships between these dependent objects are then stored as links between the various objects.

Each table may represent an implementation of either a logical object or a relationship joining one or more instances of one or more logical objects. Relationships between tables may then be stored as links connecting child tables with parents. Since complex logical relationships are themselves tables they will probably have links to more than one parent.

Normalization

In the field of relational database design, normalization is a systematic way of ensuring that a database structure is suitable for general-purpose querying and free of certain undesirable characteristics—insertion, update, and deletion anomalies that could lead to loss of data integrity.

A standard piece of database design guidance is that the designer should create a fully normalized design; selective denormalization can subsequently be performed, but only for performance reasons. The trade-off is storage space vs performance. The more normalized the design is, the less data redundancy there is (and therefore, it takes up less space to store), however, common data retrieval patterns may now need complex joins, merges, and sorts to occur – which takes up more data read, and compute cycles. Some modeling disciplines, such as the dimensional modeling approach to data warehouse design, explicitly recommend non-normalized designs, i.e. designs that in large part do not adhere to 3NF. Normalization consists of normal forms that are 1NF, 2NF, 3NF, Boyce-Codd NF (3.5NF), 4NF, 5NF and 6NF.

Document databases take a different approach. A document that is stored in such a database, typically would contain more than one normalized data unit and often the relationships between the units as well. If all the data units and the relationships in question are often retrieved together, then this approach optimizes the number of retrieves. It also simplifies how data gets replicated, because now there is a clearly identifiable unit of data whose consistency is self-contained. Another consideration is that reading and writing a single document in such databases will require a single transaction – which can be an important consideration in a Microservices architecture. In such situations, often, portions of the document are retrieved from other services via an API and stored locally for efficiency reasons. If the data units were to be split out across the services, then a read (or write) to support a service consumer might require more than one service calls, and this could result in management of multiple transactions, which may not be preferred.

Physical design

Physical data modeling

The physical design of the database specifies the physical configuration of the database on the storage media. This includes detailed specification of data elements and data types.

Other physical design

This step involves specifying the indexing options and other parameters residing in the DBMS data dictionary. It is the detailed design of a system that includes modules & the database's hardware & software specifications of the system. Some aspects that are addressed at the physical layer:

  • Security – end-user, as well as administrative security.
  • Performance – mainly addressed via indexing for the read/update/delete queries, data type choice for insert queries
  • Replication – what pieces of data get copied over into another database, and how often. Are there multiple-masters, or a single one?
  • High-availability – whether the configuration is active-passive, or active-active, the topology, coordination scheme, reliability targets, etc all have to be defined.
  • Partitioning – if the database is distributed, then for a single entity, how is the data distributed amongst all the partitions of the database, and how is partition failure taken into account.
  • Backup and restore schemes.

At the application level, other aspects of the physical design can include the need to define stored procedures, or materialized query views, OLAP cubes, etc.

Example: relational database data modeling

The following steps are suggestion of the data modeling process for Microsoft Access, a relational DBMS.

  1. Determine the purpose of the database – This helps prepare for the remaining steps.
  2. Find and organize the information required – Gather all of the types of information to record in the database, such as product name and order number.
  3. Divide the information into tables – Divide information items into major entities or subjects, such as Products or Orders. Each subject then becomes a table.
  4. Turn information items into columns – Decide what information needs to be stored in each table. Each item becomes a field, and is displayed as a column in the table. For example, an Employees table might include fields such as Last Name and Hire Date.
  5. Specify primary keys – Choose each table's primary key. The primary key is a column, or a set of columns, that is used to uniquely identify each row. An example might be Product ID or Order ID.
  6. Set up the table relationships – Look at each table and decide how the data in one table is related to the data in other tables. Add fields to tables or create new tables to clarify the relationships, as necessary.
  7. Refine the design – Analyze the design for errors. Create tables and add a few records of sample data. Check if results come from the tables as expected. Make adjustments to the design, as needed.
  8. Apply the normalization rules – Apply the data normalization rules to see if tables are structured correctly. Make adjustments to the tables, as needed.[4]

See also

References

  1. ^ Teorey, T.J., Lightstone, S.S., et al., (2009). Database Design: Know it all.1st ed. Burlington, MA.: Morgan Kaufmann Publishers
  2. ^ Teorey, T.; Lightstone, S. and Nadeau, T.(2005) Database Modeling & Design: Logical Design, 4th edition, Morgan Kaufmann Press. ISBN 0-12-685352-5
  3. ^ Javed, Muhammad; Lin, Yuqing (2018). "Iterative Process for Generating ER Diagram from Unrestricted Requirements". Proceedings of the 13th International Conference on Evaluation of Novel Approaches to Software Engineering. SCITEPRESS – Science and Technology Publications: 192–204. doi:10.5220/0006778701920204. ISBN 978-989-758-300-1.
  4. ^ Database design basics. (n.d.). Database design basics. Retrieved May 1, 2010, from https://support.office.com/en-US/article/Database-design-basics-EB2159CF-1E30-401A-8084-BD4F9C9CA1F5

Further reading

  • S. Lightstone, T. Teorey, T. Nadeau, "Physical Database Design: the database professional's guide to exploiting indexes, views, storage, and more", Morgan Kaufmann Press, 2007. ISBN 0-12-369389-6
  • M. Hernandez, "Database Design for Mere Mortals: A Hands-On Guide to Relational Database Design", 3rd Edition, Addison-Wesley Professional, 2013. ISBN 0-321-88449-3

Read other articles:

Untuk kelompok suku di Filipina, lihat Orang Bisayak. Suku BisayaOrang BisayaBusana tradisional Bisaya Sarawak.Daerah dengan populasi signifikan Brunei: 43,000[1]  MalaysiaSarawak: 23,000[2]Sabah: 22,000[3] USA: 14,000[4]BahasaBahasa Bisaya Sabah, Malaysian, Bahasa Bisaya Brunei, InggrisAgamaKebanyakan Islam dan minoritas signifikan Kristen dan AnimisKelompok etnik terkaitMurut, Kadazan-Dusun, Dayak Bisaya adalah penduduk asli barat laut dan pesisir…

Homeotic protein bicoidIdentifiersOrganismeDrosophila melanogasterSimbolbcdUniProtP09081PencarianStrukturSwiss-modelDomainInterPro (Atas) Gradien protein Bicoid Nuklir dalam embrio Drosophila transgenik tetap yang membawa gen fusi Bicoid-GFP. Gambar milik Julien O. Dubuis dan Thomas Gregor. (Bawah) Protein bicoid-GFP (hijau) dan mRNA bicoid berlabel IKAN (merah) di ujung anterior embrio Drosophila transgenik tetap. Kedua embrio berorientasi dengan kutub anterior di sebelah kiri. Gambar milik Sha…

Questa voce o sezione sull'argomento Svezia non cita le fonti necessarie o quelle presenti sono insufficienti. Puoi migliorare questa voce aggiungendo citazioni da fonti attendibili secondo le linee guida sull'uso delle fonti. Smålandprovincia storica LocalizzazioneStato Svezia RegioneGötaland Contea Kalmar Kronoberg Jönköping TerritorioCoordinate57°06′03.6″N 14°53′52.8″E / 57.101°N 14.898°E57.101; 14.898 (Småland)Coordinate: 57°06′03.…

Tindik lorum. Tindik lorum adalah suatu tindik kelamin yang terletak di perbatasan antara batang penis dengan kantung buah zakar (skrotum).[1][2] Kata lorum adalah lakuran dari kata low (bhs. Inggris, artinya 'rendah') dan frenum (suatu bagian anatomi kelamin pria), karena tindik ini pada dasarnya seperti tindik frenum tapi ditempatkan di posisi terbawah. Tindik ini sekilas mirip dengan hafada tetapi memiliki perbedaan; pada tindik lorum, anting-anting berada di perbatasan dengan…

Apache TrailSutradaraRichard ThorpeRichard RossonProduserSamuel MarxSkenarioMaurice GeraghtyCeritaErnest HaycoxPemeranLloyd NolanDonna ReedWilliam LundiganAnn AyarsConnie GilchristChill WillsPenata musikSol KaplanSinematograferSidney WagnerPenyuntingFrank SullivanPerusahaanproduksiMetro-Goldwyn-MayerDistributorLoew's Inc.Tanggal rilis 24 Juni 1942 (1942-06-24) Durasi66 menitNegaraAmerika SerikatBahasaInggris Apache Trail adalah sebuah film koboi Amerika Serikat tahun 1942 garapan Rich…

Pour les articles homonymes, voir Pierre Ier. Île Pierre IerPeter I Øy (no) Une partie côtière de l'île Pierre Ier. Géographie Continent Antarctique Localisation Mer de Bellingshausen (océan Pacifique) Coordonnées 68° 50′ 00″ S, 90° 35′ 00″ O Superficie 243 km2 Point culminant Pic Lars-Christensen (1 755 m) Géologie Île volcanique Administration Statut Territoire revendiqué par la Norvège Démographie Population Aucun h…

Etika Protestan dan Semangat Kapitalisme (bahasa Inggris: The Protestant Ethic and the Spirit of Capitalism) adalah sebuah buku yang ditulis oleh Max Weber, seorang ekonom dan sosiolog Jerman pada 1904 dan 1905 yang mulai sebagai sebuah seri esai. Edisi awal dalam bahasa Jerman dan berjudul: Die protestantische Ethik und der 'Geist' des Kapitalismus. Terjemahan ke bahasa Inggris dibuat pada 1930 dan beberapa edisi telah diedarkan. Weber menulis bahwa kapitalisme berevolusi ketika etika Protestan…

Su-25 A Georgian Su-25UB Jenis Pesawat serang antigerilya Pembuat Sukhoi Design Bureau Penerbangan perdana 22 February 1975 (T8) Diperkenalkan 19 July 1981 Status In service Pengguna utama RusiaBelarusia UkrainiaKorea Utara Dibuat 1978–2014 Jumlah 1,024 Harga satuan US$11 million Varian Sukhoi Su-28 Sukhoi Su-25 (Kode NATO: Frogfoot) adalah sebuah pesawat tempur dengan tempat duduk tunggal, bermesin ganda dengan pendorong jet diroduksi di Uni Soviet oleh Biro Perancang Sukhoi. Desain…

ألفية: ألفية 3 قرون: القرن 20 – القرن 21 – القرن 22 عقود: عقد 1970  عقد 1980  عقد 1990  – عقد 2000 –  عقد 2010  عقد 2020  عقد 2030 سنين: 2004 2005 2006 – 2007 – 2008 2009 2010 2007 في التقاويم الأخرىتقويم ميلادي2007MMVIIتقويم هجري1427–1428تقويم هجري شمسي1385–1386تقويم أمازيغي2957من بداية روما2760تقو…

Untuk patologis Amerika Serikat, lihat George Kenneth Mallory. George MalloryLahirGeorge Herbert Leigh Mallory(1886-06-18)18 Juni 1886Mobberley, Cheshire, Inggris, Britania RayaMeninggal8–9 Juni 1924 (usia 37)Wilayah Utara, Gunung Everest, TibetSebab meninggalKecelakaan mendaki gunungPenemuan jasad1 Mei 1999AlmamaterMagdalene College, CambridgePekerjaanGuru, Pendaki gunungSuami/istriRuth Dixon Turner ​ ​(m. 1914⁠–⁠1924)​Karier militer…

Basilika San LorenzoBasilika Minor Santo Laurensiusbahasa Italia: Basilica di San LorenzoBasilika San LorenzoBasilika San LorenzoLokasi di Firenze43°46′29.7″N 11°15′13.9″E / 43.774917°N 11.253861°E / 43.774917; 11.253861Koordinat: 43°46′29.7″N 11°15′13.9″E / 43.774917°N 11.253861°E / 43.774917; 11.253861LokasiFirenze, ToskanaNegara ItaliaDenominasiGereja Katolik RomaSejarahDedikasiSanto LaurensiusTanggal konsekrasi3…

Questa voce sull'argomento poeti francesi è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Jean de La Taille Jean de La Taille (Bondaroy, 1540 – Parigi, 1607) è stato un poeta e drammaturgo francese. Indice 1 Biografia 2 Note 3 Bibliografia 4 Altri progetti 5 Collegamenti esterni Biografia Studiò discipline umanistiche a Parigi con Muretu e legge a Orléans con Anne de Bourg. Nella prima fase della sua carriera fu ugonotto, ma successivamente si conver…

Yangon–Mawlamyine Railway ရန်ကုန်-မော်လမြိုင် ရထားလမ်းOverviewOwnerMyanma RailwaysLocaleYangon Region, Bago Region, Mon StateOperationOperator(s)Myanma RailwaysTechnicalSystem length286.7 km (178.1 mi)Track gauge1,000 mm (3 ft 3+3⁄8 in) Yangon–Mawlamyine Railway (Burmese: ရန်ကုန်-မော်လမြိုင် ရထားလမ်း) is a railway line in Myanmar and is operated by Myanm…

Association football club in Greece Main article: Panionios G.S.S. Football clubPanioniosFull nameΠανιώνιος Γυμναστικός Σύλλογος ΣμύρνηςPaniónios Gymnastikós Sýllogos Smýrnis(Pan-Ionian Gymnastic Club of Smyrna)Nickname(s)Κυανέρυθροι (Red and blue) Ιστορικός (Historic)Πάνθηρες (Panthers)Founded14 September 1890; 133 years ago (1890-09-14) (as Orpheus Smyrni)GroundNea Smyrni StadiumCapacity11,700OwnerParent ClubC…

American TV series or program Pretty WildCastGenreRealityDirected byAmber MazzolaStarring Tess Taylor Alexis Neiers Gabrielle Neiers Andrea Arlington Country of originUnited StatesOriginal languageEnglishNo. of seasons1No. of episodes9ProductionExecutive producers Amber Mazzola Barry Katz Bill Thompson Brian Volk-Weiss Chelsea Handler Dan Levy Tom Brunelle Running time40–44 minutesProduction companies Borderline Amazing Productions Five Five Productions New Wave Entertainment Original releaseN…

American diplomat Joseph R. Donovan Jr.United States Ambassador to IndonesiaIn officeJanuary 12, 2017 – February 14, 2020PresidentBarack ObamaDonald TrumpPreceded byRobert O. Blake Jr.Succeeded bySung Kim Personal detailsAlma materGeorgetown University's School of Foreign ServiceNaval Postgraduate School Donovan greets Secretary of State Pompeo in Jakarta in 2018 Joseph R. Donovan Jr. is a former American diplomat who previously served as United States Ambassador to Indonesia. Joseph …

Opera by Gioachino Rossini Maometto IIOpera by Gioachino RossiniRossini c. 1820LibrettistCesare della ValleLanguageItalianBased onValle's play Anna ErizoPremiere3 December 1820 (1820-12-03)Teatro di San Carlo, Naples Maometto II (or Maometto secondo) is an 1820 opera in two acts by Gioachino Rossini to an Italian libretto by Cesare della Valle. Set in the 1470s during a time of war between the Turks and Venetians, the work was commissioned by the Teatro di San Carlo in Naples. Del…

Company Clerical Medical Investment Group LimitedCompany typeSubsidiaryIndustryFinancial servicesFounded1824; 200 years ago (1824)HeadquartersLondon, England, UKProductslife assurancePensionsInvestmentsParentLloyds Banking GroupWebsitewww.clericalmedical.co.uk Clerical Medical is a British life assurance, pensions and investments company founded in 1824,[1] and a subsidiary of Lloyds Banking Group. History In 1824, Dr George Pinckard formed a committee of physicians and…

Sungai TempurungSungai TempurungLokasiNegaraIndonesiaProvinsiKalimantan SelatanKotaBanjarmasinCiri-ciri fisikMuara sungaiSungai Martapura - lokasiBanjarmasin Timur - koordinat3°18′40″S 114°36′56″E / 3.31099°S 114.61553°E / -3.31099; 114.61553Panjang1.252 m (0,778 mi)Lebar  - minimum19 m (0,012 mi) - rata-rata20 m (0,012 mi) - maksimum21 m (0,013 mi)Informasi lokalZon…

English tenor and academic John PotterPotter performing with Trio MediævalOccupations classical singer academic Years active1970s - presentWebsitejohn-potter.co.uk John Potter is an English tenor and academic. Early life and education John Potter's musical education began as a chorister in the Choir of King's College, Cambridge,[1] after which he became a scholar at The King's School, Canterbury and exhibitioner at Gonville and Caius College, Cambridge. His coaches included lieder …

Kembali kehalaman sebelumnya