I have a spreadsheet of employee data (time charged to projects and customers) that is tens of thousands of rows long. Each month, the file is updated to include the most recent month data. Each row represents one employee's charges (each employee generally has multiple charges each month).
I need to identify approximately 10-15 "bad combinations" of data, where essentially employees are charging time to the wrong customer/project, etc. For example, the "QC-Europe" team should not be charging to customer "Ford" and the "Engineering" department should not be using the sub-process of "Design", etc etc.
I'm debating the best approach for this. My current path is to create tables of all "good combinations" and do lookups off of that, but maybe there is a more efficient method? Thanks!
rough file layout:
Employee Dept Sub-Process Customer Jan Feb Mar Apr etcJoe Engineering Build Ford 10 15 8 30