Nemeste Baya,
I understand you are busy but I feel you can help me on this SQL interview question I can't able to answer it.
I am a junior SQL Developer and have been following and referencing codeproject.com whenever I need some help.
A friend who applied for a job in a certain company was asked some questions and I answered some of them except those listed below.
I copied and pasted it, I am requesting your help and guidance to answer these three questions.
I really appreciate your help and time.
Q1- A 100 GB flat file containing 18 columns of fact data is being dropped daily on your server. Describe the architecture (both hardware and software) required to 1) Recognize that a file has arrived and is ready for ingestion 2) Decompress the file and extract it into a staging schema in a reasonable amount of time (100GB = ~300M rows).
Q2- A work table named WorkContracts has the following columns: ContractId, Product, and Value. A dimension named DimContracts has a surrogate key field called ContractKey and the same 3 columns above. ContractId is the unique business key. Write the most efficient statement that produces all rows in WorkContracts that are not in DimContracts.
Q3- A new column, EmailOptOut, has been added to the source OLTP system in the Customer table. Describe the steps required to propagate this new data all the way through the data warehouse system to a new attribute in the Customer dimension in the cube.
Thank you for your time, again.
Joseph
What I have tried:
I answered the other questions but I can't do these. I just need guidance.