by DL Keeshin
March 17, 2025
In my last couple of posts, I covered the kDS Discovery app's design strategy for summarizing interview data. We looked at how structured responses and automated summaries help make sense of interviews. Now, let's switch gears and talk about using large language models to generate context-aware test data.
Instead of relying on static datasets or spending time writing out sample responses by hand, we use a large language model to create realistic, context-aware test data on the fly. This approach makes testing faster, more accurate and reduces manual effort.
Creating realistic test data is a challenge, especially for dynamic apps like kDS Discovery. Traditional methods rely on static datasets or manually crafted responses, which quickly become outdated. Large language models (LLMs) like GPT-4 offer a smarter solution, generating diverse, context-aware test data on demand.
The Python script, generate_test_answer.py leverages GPT-4 to create realistic test answers for interview questions stored in the kDS Discovery test database. This approach not only streamlines the testing process but also ensures that the responses reflect real-world scenarios based on industry descriptions and user roles.
query = """
SELECT email_, role_, respondent_id, industry_description,
subsidiary_, business_unit, question_text, question_id
FROM temp.vw_lookup_for_answer_test
ORDER BY sort_order LIMIT 20;
"""
This SQL query retrieves a sample of 20 questions, ensuring that the test data covers a variety of topics.
prompt = (
f"Based on the following industry_description and role, please generate a sample answer..."
)
The prompt construction ensures that GPT-4 understands the industry context and expected response format.
cur.execute("""
CALL interview.up_insert_answer(%s, %s, %s, %s, %s);
""", (question_id, respondent_id, json.dumps(sample_answer), answer_date, source))
This step inserts the generated answers into the database, making them available for testing.
Leveraging GPT-4 for test data generation is an innovative approach that simplifies and improves the accuracy of test cases in applications like kDS Discovery. By automating this process, we can ensure that the test data remains relevant, scalable, and reflective of real-world scenarios.
As LLMs continue to evolve, their ability to generate structured and meaningful responses will become an invaluable tool for developers looking to streamline testing and data generation workflows.
Also, as mentioned previously, my company, kDS LLC, is actively seeking organizations interested in acquiring a beta version of the kDS Data Source Discovery App. If your organization is interested, please let us know using the contact form below. We'd love to collaborate.
As always, thanks for stopping by!