key steps for large data projects