All Data and AI Weekly #189 - May 12, 2025


All Data and AI Weekly
( AI, Data, NiFi, Iceberg, Polaris, Streamlit, Flink, Kafka, Python, Java, SQL, Unstructured Data )
#189 - May 12, 2025
https://bsky.app/profile/paasdev.bsky.social
NiFi + AI + AI Data Cloud + Iceberg.
https://www.reddit.com/r/DataEngineeringForAI/hot/
Boston May 14 2025 https://www.dbta.com/DataSummit/2025/Timothy-Spann.aspx
https://github.com/sfc-gh-tspann/DataAIDemos/blob/main/airquality.sql
https://www.slideshare.net/slideshow/14may2025_tspann_fromairqualityunstructureddata-pdf/277680861
https://medium.com/@tim.spann_50517/real-time-enrichment-of-air-quality-data-26564464b2a5
https://www.youtube.com/watch?v=YJhRcXFNv2M
Monthly NYC and Youtube Events
Snowflake Tips
Tim, I need to backup some data in Snowflake. Just make sure you have retention time up, usually 30-60 days makes sense. You could want 90 days. Make a clone at your point and time so you can instantly compare any changes to what it was at that point you are concerned for. You can also export your data to cloud storage if you wish. You can also replicate it to other accounts. Lots of options here, no worry about data loss. Just travel back in time.
Zero Copy Clone
Storage Considerations
Create clones of databases at/before a table
Time Travel to Clone Databases / Schemas / Tables at a Point in Time
Set your retention time in days (up to 90 days)
Replicate Across Accounts / Regions / Clouds
Export Data to Cloud Storage
If you wish to export the data to an S3 stage, you can do that as well.
Cool Stuff of the week
⚡️ https://www.youtube.com/watch?v=v3Anx71WNm0&t=568s&pp=ygULIlRpbSBTcGFubiI%3D
❄️ https://medium.com/snowflake/ai-infused-pipelines-with-snowflake-cortex-6a7954f2078d
⚡️ https://medium.com/@orellabac/querying-data-from-neo4j-to-snowflake-1c1ee537aeb6
❄️ https://www.snowflake.com/en/blog/auto-manufacturers-drive-innovation-snowflake/
❄️ https://medium.com/@tim.spann_50517/building-rag-applications-with-cortex-ai-bf0a3d2202db
⚡️ https://github.com/yuanze-lin/Olympus
⚡️ https://github.com/slidevjs/slidev
❄️ https://pytorch.org/blog/press-release-pytorch-foundation-expands-welcomes-projects-vllm-deepspeed/
❄️ https://docs.snowddl.com/getting-started
❄️ https://github.com/sfc-gh-praj/app-app-communication
❄️ https://quickstarts.snowflake.com/guide/getting_started_with_ai_observability/#0
❄️ https://medium.com/@peter.horrigan/so-you-have-your-pat-in-vault-now-what-5757632f8d51
❄️ https://www.snowflake.com/en/blog/new-regions-egress-cost-optimizer/
❄️ https://docs.snowflake.com/en/user-guide/warehouses-gen2
⚡️ https://github.com/emcie-co/parlant
New Models
❄️ https://www.snowflake.com/en/blog/meta-llama-4-now-available-snowflake-cortex-ai/
⚡️ https://huggingface.co/docs/transformers/main/en/model_doc/d_fine
❄️ https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2
⚡️ https://huggingface.co/nvidia/OpenCodeReasoning-Nemotron-7B
Marketplace
⚡️ https://app.snowflake.com/marketplace/providers/GZSTZJL5LMG/Coresignal
Upcoming
May 15 - Overview of Snowflake https://www.snowflake.com/webinars/product-demo/data-cloud-demo-2025-05-15/
May 21 - Zero to Snowflake Hands on Lab https://www.snowflake.com/webinars/virtual-hands-on-labs/zero-to-snowflake-2025-05-21/
May 28 - Transforming Text https://www.snowflake.com/webinar/virtual-hands-on-labs/transforming-text-with-snowflake-cortex-building-intelligent-applications-apac-20250528/
June 19 - Northstar Intro to Snowflake Data Engineering https://www.snowflake.com/webinars/northstar-virtual-2025-06-19/
June 21 - Hybrid Tables for Real-Time https://www.snowflake.com/webinars/product-demo/harnessing-real-time-data-with-snowflake-hybrid-tables-101-2025-05-21
June 25 - Build data engineering pipelines https://www.snowflake.com/webinars/virtual-hands-on-labs/build-data-engineering-pipelines-using-snowpark-in-snowflake-notebooks-2025-06-25/
June 26 - Build a GenAI App https://www.snowflake.com/webinars/virtual-hands-on-labs/build-a-gen-ai-app-in-10-min-with-snowflake-2025-06-26/
In-Person
June 2 -5 Snowflake Summit - SF https://www.snowflake.com/en/summit/?utm_cta=website-events-featured
Very soon:
📊 May 14, 2025 - Boston - https://www.dbta.com/DataSummit/2025/default.aspx
📊 May 22, 2025 - New York City - https://events.sigmacomputing.com/mergespringnyc Sigma Computing - 9th floor
https://github.com/timothyspann
Recent Tim Stuff
https://www.youtube.com/watch?v=4Ojue8TWv6A
Apps, Demos, Examples, Models, Notebooks and Projects
© 2020-2025 Tim Spann https://www.youtube.com/@FLaNK-Stack
(AI + Vectors + LLM + Streaming + IoT)
Subscribe to my newsletter
Read articles from Timothy Spann directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
