SQL Data Generator 4

Help for older versions available.

Example Python scripts

The page is a list of example IronPython scripts you can use to generate data with the Python script generator. You can use these examples as starting points for creating your own scripts.

Book titles (SQL)

This script generates SQL Server book titles (eg SQL Server Hardware, SQL Server Statistics, Inside the SQL Server Query Optimizer):

  1. # Read an XML file
  2. # Use the CLR XML libraries
  3. import clr
  4. clr.AddReference("System.Xml")
  5. from System.Xml.XPath import XPathDocument, XPathNavigator
  6. # A sample XML file
  7. filename = r"BookStore.xml"
  8. def main(config):
  9. filepath = config["config_path"] + "\\" + filename
  10. return list(books(filepath, column_size=config["column_size"]))
  11. def books(filepath, column_size=50):
  12. # Locate all titles of books in this document
  13. doc = XPathDocument(filepath)
  14. nav = doc.CreateNavigator()
  15. expr = nav.Compile("/bookstore/book/title")
  16. titles = nav.Select(expr)
  17. for title in titles:
  18. # Truncate titles to the column size before yield
  19. yield str(title)[:column_size]

Credit card numbers (VISA style)

This script generates valid VISA style credit card numbers (eg 4289911008500748):

  1. # Generate VISA-style numbers with a Luhn checksum
  2. import clr
  3. clr.AddReference("System")
  4. from System import Random
  5. random = Random()
  6. def choice(l):
  7. return l[random.Next(len(l))]
  8. def completed_number(prefix, length):
  9. # Given a prefix and a desired length, fill in the number up
  10. # to the desired length, using Luhn to compute the checksum
  11. ccnumber = list(prefix)
  12. # Generate digits
  13. for _ in range(length - len(prefix) - 1):
  14. ccnumber.append(choice('0123456789'))
  15. # Calculate sum
  16. sum = pos = 0
  17. reversedCCnumber = list(reversed(ccnumber))
  18. while pos < length - 1:
  19. odd = int(reversedCCnumber[pos]) * 2
  20. if odd > 9: odd -= 9
  21. if pos != (length - 2): sum += int(reversedCCnumber[pos+1])
  22. sum += odd
  23. pos += 2
  24. # Calculate check digit
  25. checkdigit = ((sum / 10 + 1) * 10 - sum) % 10
  26. ccnumber.append(str(checkdigit))
  27. return ''.join(ccnumber)
  28. def main(config):
  29. for _ in range(config["n_rows"]):
  30. # VISA numbers start with a 4
  31. yield completed_number('4', 16)

Initials

This scripts generates lowercase initials (eg lg, pg, cs):

  1. # Combinatoric generator
  2. # The Python iteration and combinatorics library
  3. import itertools
  4. lowercase = "ABCDEFGHIJKLMNOPQRSTUVWXYZ".lower()
  5. def main(config):
  6. return list(combinations())
  7. def combinations():
  8. # Iterate over all pairs of letters in `lowercase`
  9. for pair in itertools.combinations(lowercase, 2):
  10. # and yield them as a string
  11. yield "".join(pair)

ISO 639-3

This script uses a URL to generate ISO 639-3 language codes (eg afa, alg, art):

  1. # Make an HTTP request and read the output
  2. import clr
  3. clr.AddReference("System.Net")
  4. from System.Net import HttpWebRequest
  5. clr.AddReference("System")
  6. from System.IO import StreamReader
  7. # geonames list of iso-laguages
  8. url = "http://download.geonames.org/export/dump/iso-languagecodes.txt"
  9. def main(config):
  10. return list(read_wiki(column_size=config["column_size"]))
  11. def read_wiki(column_size=50):
  12. # Create a CLR web request object
  13. http = HttpWebRequest.Create(url)
  14. http.Timeout = 5000
  15. http.UserAgent = "Red Gate SQL Data Generator"
  16. # Read the response with a CLR StreamReader
  17. response = http.GetResponse()
  18. responseStream = StreamReader(response.GetResponseStream())
  19. html = responseStream.ReadToEnd()
  20. # Yield all lines that start with a *,
  21. # truncated to the column width
  22. for line in html.splitlines():
  23. if not line.startswith("ISO"):
  24. items = line.split('\t')
  25. yield items[0] #ISO 639-3
  26. #yield items[1] #ISO 639-2
  27. #yield items[2] #ISO 639-1
  28. #yield items[3] #Language name

Numbered labels

This script generates numbered labels (eg Label 42, Label 43, Label 44):

  1. # Specify a string stem as constant and append an incrementing number to it
  2. def main(config):
  3. return list(Sequence(config["n_rows"])) # The max number of rows available
  4. def Sequence(max):
  5. for i in range(1, max + 1): # Modify the range start and end points in order to offset the row number
  6. yield "Label {0}".format(i) # Format the output string here

US cities

This script generates US city names (eg New York, Los Angeles, Chicago):

  1. # Open a file and filter the results
  2. filename = r"cities.txt"
  3. def main(config):
  4. filepath = config["config_path"] + "\\" + filename
  5. return list(cities(filepath))
  6. def cities(filepath):
  7. with open(filepath) as cities:
  8. for city in cities:
  9. if city.startswith(tuple("ABDCEFG")):
  10. yield city.strip()

Years of service

This script uses a date of birth column to generate years of service numbers (eg 1, 23, 5):

To use this script, on line 5, replace InsertColumnName with the name of a column in your table that contains dates of birth.

  1. # This example takes a column as a date of birth
  2. # Then work out a random number of years service
  3. # This if this is > now the person is still working at this company
  4. def main(config):
  5. dateOfBirth = InsertColumnName
  6. workingAge = 18
  7. age = (DateTime.Now - dateOfBirth).TotalDays/365
  8. numberofYearsWorking = age - workingAge
  9. #Initialize Randomness only on the first row
  10. if (config["row_count"] == 0):
  11. # could use python random number generator
  12. # and store the value in the config
  13. session.SetupNormalGenerator(False, config["seed"], 1.0, 50, 10, 0.2)
  14. if (numberofYearsWorking > 0) :
  15. yearsOfService = session.GetNextRandomNumber()
  16. if (yearsOfService > numberofYearsWorking) :
  17. #Person still working at this company
  18. return numberofYearsWorking
  19. else:
  20. return yearsOfService
  21. return 0 #Too young

 

 


Didn't find what you were looking for?