We are pleased to announce Grid Dynamics and the Fest Group will have an Online Big Data meetup. Join us for a virtual event to learn how companies leverage Big Data experience and advantage analytics to deliver big ideas in building data platforms.
Join us on June 4 at 6 PM MSK
Key takeaways:
- How modern Big Data tools helps to stabilize operations
- How to optimize monitoring, restatement and support on the project
- How to adjust marketing models using data
___________________________________________________________________
Talk 1 -- Vladimir Baev, Senior Big Data Developer, Grid Dynamics --
“Data processing in production: Apache Airflow will orchestrate it for you”.
Apache Airflow is a popular and widely used orchestration platform for building ETL Data Pipelines. The amount of data is increasingly growing and it should be appropriately processed with corresponding SLAs, taking into account possible outages of distributed systems and the nature of data sources. To achieve this, we not only need processing engines, but also orchestration tools to connect all the components and guarantee appropriate scheduling.
We will start with an overview of the Airflow core concepts and take a more in-depth look from the perspective of production requirements and use cases: which Airflow features are extremely useful in the production environment, and which could potentially damage your system?
During the presentation, we are also going to discuss possible extensions and tools, which may simplify the Big Data Engineer’s life.
Moreover, we will take a deeper look from the perspective of development and usage: which features might be very helpful and which might damage your production?
In addition we are going to focus on the following topics:
- Why choose Apache Airflow and do we have alternatives?
- What does the Airflow pipeline look like?
- How to build a flexible orchestration as a code with CI/CD and tests?
- Monitoring, restatement and support
- Integration with clouds and third party services
_______________________________________________________________
Talk 2 -- Oleksandr Fedirko, CEE Regional Head of BigData Practice, GlobalLogic -- “Marketing Data Lake in the Cloud”.
“This is a story of my journey to the world of marketing. I will cover the business area and the products that I was involved in. Will jump to the technical perspective and tools set alongside with the architectural overview. I will cover challenges that we faced while building data lake in the Azure cloud. Then focus on the next steps of evolution.
Marketing things are boring, aren’t they ?
- Starting points
- Challenges on a project
- Next steps and evolution
- Conclusions
RSVP here: https://bit.ly/2zstVN6
Организатор: Grid Dynamics I Big Data community I St.Pet.
Dynamic Talks -- это серия технических митапов с докладами от экспертов в области Java, Big Data, Data Science, Devops, Machine Learning, Artificial Intelligence и других областях. Митапы Dynamic Talks проводятся в США, Польше, Сербии, Украине, России.
Среди наших клиентов - такие лидеры мирового рынка, как Google, American Eagle, eBay, Microsoft, VMware, PayPal, Yahoo!, Macy’s.
Добро пожаловать и следите за обновлениями на сайте и в соцсетях Grid Dynamics --
https://www.facebook.com/lifeatgriddynamics
https://twitter.com/GridDynamics