Generation of Data with Zipfian Distribution for Database Testing

Izdanje: Naučna konferencija Uniteh 2010

Oblast: Computer Systems

Stranice: 339-343

Apstrakt:
It is often very hard to perform testing on real data in large databases. In order to execute testing of some performances on databases, data need to be generated synthetically by using distribution that suits to the real data the most. In many papers authors state that the data has been generated by using Zipfian distribution with appropriate z values, but they don’t explain the process of generating such set of data, they just give pieces of complete scenario. We have collected those pieces and put some effort in comparison of results they have published with ours. In this paper we present the process of generating the synthetically data for testing of accuracy of approximate queries in large databases based on knowledge we have gathered from papers and we have discovered by ourselves.
Ključne reči: Zipfian distribution, cumulative distribution, synthetically generated data, database
Priložene datoteke:

Preuzimanje citata:

BibTeX format
@article{article,
  author  = {S. Ilić, D. Radosavljević and V. Stojanović}, 
  title   = {Generation of Data with Zipfian Distribution for Database Testing},
  journal = {Naučna konferencija Uniteh 2010},
  year    = 2010,
  pages   = {339-343}}
RefWorks Tagged format
RT Conference Proceedings
A1 Siniša Ilić
A1 Dragana Radosavljević
A1 Vidosav Stojanović
T1 Generation of Data with Zipfian Distribution for Database Testing
AD Naučna konferencija Unitech, Gabrovo, Bugarska
YR 2010
Unapred formatirani prikaz citata
S. Ilić, D. Radosavljević and V. Stojanović, Generation of Data with Zipfian Distribution for Database Testing, Naučna konferencija Unitech, 2010