In the world's of today, citizens' data privacy is taken seriously. This is of course due to global awareness of social network impact on society where everyone is publicly exposing personal and professional information. This access to the world and speed to communication comes with a cost - your public data could be used by organisations. That leads to regulations like the European's General Data Protection Regulation (a.k.a. GDPR) that can lead to 4% of the worldwide annualturnover if it is not respected, that is, if enterprises aren't able to explain how privacy data are used and cannot apply the 'Right to be forgotten' to any citizen in Europe! In this talk, we'll cover what is actually Data Science Governance, how important it is for Enterprise having legacy systems, but it is even more true for labs and units working on Data Science project. Hence, the second part of the talk will focus on what Data Science Governance can be delivered in a context of using Apache Spark in Scala.