FLaNK Weekly for 22 January 2024

Timothy SpannTimothy Spann
2 min read

22-January-2024

IMG_5361

FLaNK Stack Weekly

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

Get your new Apache NiFi for Dummies!

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

vector8

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

This is Issue #121

https://github.com/tspannhw/FLiPStackWeekly

https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

https://www.cloudera.com/solutions/dim-developer.html

Articles

Writing A Gen AI Processor with Python https://medium.com/@tspann/writing-a-generative-ai-python-processor-ed0655cf4e3f

Codeless Generative AI Pipelines with Chroma Vector DB & Apache NiFi https://medium.com/@tspann/codeless-generative-ai-pipelines-with-chroma-vector-db-apache-nifi-43e77d75952f

Using NiFi to Augment and Enrich LLM Results with Real-Time Contextual Data https://medium.com/@tspann/augmenting-and-enriching-llm-with-real-time-context-b6da7ba4960a

Web AI Testing with Chrome https://developer.chrome.com/blog/supercharge-web-ai-testing

What is TinyML? https://www.ikkaro.net/what-tinyml-is/

Watch that DNS https://rmoff.net/2024/01/16/hosting-on-github-pages-watch-out-for-subdomain-hijacking/

Which Gen AI to Use? https://artificialanalysis.ai/

Implementing RAG with HuggingFace https://medium.com/international-school-of-ai-data-science/implementing-rag-with-langchain-and-hugging-face-28e3ea66c5f7

Kafka on K8 https://engineering.grab.com/kafka-on-kubernetes?

Fix Busted PiP https://medium.com/@RyanHiebert/how-i-fixed-a-pip-compile-dependency-resolution-error-c09305e107e2

NiFi in Kafka Connect https://www.cloudera.com/content/dam/www/marketing/resources/webinars/emea-how-to-run-nifi-flows-in-kafka-kconnect.png.landing.html

Redhat with Cloudera for Generative AI https://www.redhat.com/en/blog/unlocking-power-generative-ai-cloudera-data-platform-and-red-hat-openshift

AI https://blog.cloudera.com/announcing-clouderas-enterprise-artificial-intelligence-partnership-ecosystem/

Videos

Unlocking Financial Data with Real-Time Pipelines (OSACon 2023) https://www.youtube.com/watch?v=Q7gF7m4yFi4&ab_channel=OSACon

Auto Generate NiFi Flows from Natural Language by Mark Payne https://www.youtube.com/watch?v=3oRnUdE7x7w

Looking at the New Features of Apache NiFi (Halifax Community over Code) https://www.youtube.com/watch?v=_orD9aAXk48&ab_channel=TheASF

Utilizing Real-Time Transit Data for Travel Optimization (Halifax Community over Code) Sunday Oct 8 2023, Canada https://www.youtube.com/watch?v=OWQmeF-UeEc&ab_channel=TheASF

Continuous SQL with Kafka and Flink | Timothy Spann (EN) https://www.youtube.com/watch?v=IGs0k240zhU&ab_channel=JAVAPRO

Events

On Demand https://events.dzone.com/dzone/Data-Pipelines-Investigating-the-Modern-Day-Stack?utm_bmcr_source=LinkedIn

Open Source Finance Forum. Virtual. https://resources.finos.org/znglist/osff-2023-virtual-presentations/?c=cG9zdDo5OTEzOTk%3D&utm_campaign=OSFF+NYC+2023&utm_content=269713979&utm_medium=social&utm_source=linkedin&hss_channel=lcp-18473937

Feb 8, 2024: NYC.

https://www.meetup.com/new-york-open-source-data-infrastructure-meetup/events/297484047/

18:00 - 18:30 Welcome: Networking & snacks 18:30 - 18:35 Kickoff: Welcome Aiven 18:35 - 19:00 A Guide to Product Experimentation (Erin Mikail Staples, LaunchDarkly) 19:00 - 19:30 Building Real-time Pipelines: A Case Study with Transit Data (Tim Spann, Cloudera) 19:30 ~ 21:00 Food & networking

Feb 2024: Webinar

https://www.cloudera.com/about/events/webinars/stay-ahead-of-cyber-threats-by-utilizing-data-in-motion.html?utm_medium=virtual-event&utm_source=resources-module&keyplay=ALL&utm_campaign=FY25-Q1-CorporateWebinar-AMER-cyber-threats&cid=701Hr000001pXCQIA2

Feb 28, 2024: NYC. Cloudera Meetup. Flink https://www.meetup.com/futureofdata-princeton/events/298661947/

March 15, 2024: Princeton. IT Professional Conference at Trenton Computer Festival IEEE Information Technology Professional Conference on Friday, March 15th, 2024 https://princetonacm.acm.org/tcfpro/

April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2024-tim-spann--y4coe

Code

  • https://github.com/tspannhw/FLaNK-python-watsonx-processor
  • https://github.com/tspannhw/FLaNK-CDW
  • https://github.com/tspannhw/FLaNK-VectorDB
  • https://github.com/tspannhw/FLaNK-RPI5
  • https://github.com/tspannhw/FLaNK-EdgeAI
  • https://github.com/kevinbtalbert/NiFi-Flows-Demos
  • https://github.com/DataSQRL/apirag

Models

  • https://github.com/apple/ml-ferret
  • https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.1-GGUF
  • https://github.com/kevinbtalbert/Electric_and_Utilities_System_Demo
  • https://stability.ai/news/stable-code-2024-llm-code-completion-release
  • https://clay-foundation.github.io/model/
  • https://github.com/speechbrain/speechbrain
  • https://huggingface.co/thenlper/gte-large
  • https://github.com/SeanLee97/AnglE
  • https://huggingface.co/WhereIsAI/UAE-Large-V1
  • https://huggingface.co/stabilityai/stablelm-2-1_6b
  • https://github.com/jzhang38/TinyLlama
  • https://huggingface.co/tiiuae/falcon-7b

Tools

  • https://github.com/timfraedrich/OutRun
  • https://projectnessie.org/
  • https://github.com/KRTirtho/spotube
  • https://textart.sh/
  • https://developer.spotify.com/documentation/web-api
  • https://nightshade.cs.uchicago.edu/downloads.html
  • https://barkeywolf.consulting/posts/barcode-scanner-webassembly/#meet-zbar
  • https://github.com/kffl/speedbump
  • https://github.com/stevekrenzel/pick-ems
  • https://tratt.net/laurie/blog/2024/faster_shell_startup_with_shell_switching.html
  • https://github.com/polyzos/stream-processing-with-apache-flink
  • https://gptcache.readthedocs.io/en/latest/bootcamp/langchain/qa_generation.html
  • https://jliljebl.github.io/flowblade/index.html
  • https://willowprotocol.org/
  • https://nitro.unjs.io/
  • https://github.com/OPCFoundation/UA-EdgeTranslator
  • https://www.open62541.org/
  • https://pypi.org/project/pinecone-client/
  • https://www.plotteus.dev/
  • https://github.com/serversideup/spin
  • https://github.com/Portkey-AI/gateway
  • https://maven.apache.org/docs/4.0.0-alpha-12/release-notes.html
  • https://github.com/openremote/openremote
  • https://github.com/fugue-project/fugue
  • https://github.com/apache/flink-kubernetes-operator
  • https://github.com/dai-shi/excalidraw-claymate
  • https://github.com/whylabs/langkit
  • https://github.com/clastix/kamaji
  • https://github.com/milvus-io/bootcamp
  • https://github.com/deepset-ai/haystack-cookbook
  • https://github.com/sgl-project/sglang
  • https://github.com/georgevetticaden/evernote-ai-chatbot
  • https://github.com/IBM/watsonxdata-python-sdk
  • https://mermaid.live/
  • https://github.com/dennislee22/deepspeed-train-CML
  • https://github.com/gabrielchua/RAGxplorer
  • https://chromeenterprise.google/os/chromeosflex/

© 2020-2024 Tim Spann

image

0
Subscribe to my newsletter

Read articles from Timothy Spann directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Timothy Spann
Timothy Spann