FLaNK Stack Weekly for 05 August 2023

Timothy SpannTimothy Spann
1 min read

07-August-2023

FLiPN-FLaNK Stack Weekly

Tim Spann @PaaSDev

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

Get your new Apache NiFi for Dummies!

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

TIMM!!!

https://github.com/huggingface/pytorch-image-models https://huggingface.co/docs/timm/index

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

This is Issue #97

https://github.com/tspannhw/FLiPStackWeekly

https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Videos

https://www.youtube.com/watch?v=JdsY5p1GZ38&ab_channel=DatainMotion

https://www.youtube.com/watch?v=0G98z_fs_SQ

https://www.youtube.com/watch?v=NJWb92HRuYY&ab_channel=Kinzorize

https://www.youtube.com/watch?v=V1MEsV1Dkew

https://www.youtube.com/watch?v=VOerTAir9SU&ab_channel=Zilliz

Talks

https://www.slideshare.net/bunkertor/building-realtime-pipelines-with-flank-a-case-study-with-transit-data

Articles

https://medium.com/@tspann/no-code-sentiment-analysis-with-hugging-face-and-apache-nifi-for-article-summaries-cf06d1df1283

https://medium.com/@tspann/tims-quarter-in-streaming-2q-2023-59181e7847b3

https://community.cloudera.com/t5/Community-Articles/Call-a-CML-Deployed-Model-From-Apache-NiFi-in-10-minutes-Or/ta-p/374853

https://medium.com/cloudera-inc/getting-ready-for-apache-nifi-2-0-5a5e6a67f450

https://www.infoq.com/presentations/apache-iceberg-streaming/

https://www.theregister.com/2023/08/01/aws_and_ibm_netezza_come/

https://eugeneyan.com//writing/llm-patterns/

https://www.datanami.com/this-just-in/cloudera-board-appoints-software-industry-veteran-charles-sansbury-as-new-ceo/

https://betterprogramming.pub/frameworks-for-serving-llms-60b7f7b23407

https://longform.asmartbear.com/problem/

https://research.ibm.com/blog/nasa-hugging-face-ibm

https://garystafford.medium.com/building-a-rag-based-conversational-chatbot-with-langflow-and-streamlit-784e2b77bcbe

https://graciano.dev/2023/08/03/weekend-reading-list-187/

https://cldr-steven-matison.github.io//blog/NiFi-For-Dummies/

https://medium.com/towards-generative-ai/unleashing-the-power-of-the-information-retriever-in-the-retrieval-augmented-generation-pipeline-a782c7287e9b

https://kevinbtalbert.github.io/iceberg/pyiceberg-api/

https://aws.amazon.com/blogs/big-data/a-side-by-side-comparison-of-apache-spark-and-apache-flink-for-common-streaming-use-cases/?utm_source=substack&utm_medium=email

https://thenewstack.io/comparing-different-vector-embeddings/

https://medium.com/intuit-engineering/open-source-fuzzy-matcher-finding-data-similarities-in-records-33e4879ef4fd

Throw Back Article

https://community.cloudera.com/t5/Community-Articles/Accessing-Facebook-Page-Data-from-Apache-NiFi-1-2/ta-p/246915

https://github.com/tspannhw/NiFi-Man/blob/main/all.md

Events

https://attend.cloudera.com/ameropendatalakehousewithcdpon?lid=7vxyhds3tlv7

August 23, 2023: NYC. AI. https://www.aicamp.ai/event/eventdetails/W2023082314

October 7-10, 2023: Halifax, CA. Community over Code. https://communityovercode.org/

October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html

November 2, 2023: Evolve NYC https://www.cloudera.com/about/events/evolve/new-york.html#register

November 22, 2023: Big Data Conference. Hybrid
https://bigdataconference.eu/

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Code

  • https://github.com/BrooksIan/CallCMLModel
  • https://github.com/DigitalSal/cdf-workshop
  • https://github.com/mmehra12/HOLWorkshops/tree/main/CDF/Guide#pre-requisites
  • https://github.com/voxel51/fiftyone
  • https://github.com/intuit/fuzzy-matcher
  • https://github.com/intuit/chain-z
  • https://github.com/intuit/Tank
  • https://github.com/intuit/maven-build-scanner

Tools

  • https://seatunnel.apache.org/docs/about
  • https://github.com/Alpha-VLLM/LLaMA2-Accessory
  • https://overturemaps.org/
  • https://github.com/tspannhw/EverythingApacheNiFi
  • https://github.com/Soulter/hugging-chat-api
  • https://ai.meta.com/blog/audiocraft-musicgen-audiogen-encodec-generative-ai-audio/
  • https://ollama.ai/blog/run-llama2-uncensored-locally
  • https://github.com/Dicklesworthstone/llama2_aided_tesseract
  • https://github.com/Skocimis/opensms
  • https://paimon.apache.org/docs/master/engines/flink/
  • https://calcite.apache.org/news/2023/07/26/release-1.35.0/
  • http://minborgsjavapot.blogspot.com/2023/08/java-new-draft-jep-computed-constants.html
  • https://github.com/QwenLM/Qwen-7B
  • https://github.com/apache/flink-kubernetes-operator
  • https://github.com/gorilla-llm/gorilla-cli
  • https://github.com/NielsRogge/Transformers-Tutorials
  • https://huggingface.co/datasets/iamtarun/python_code_instructions_18k_alpaca/viewer/iamtarun--python_code_instructions_18k_alpaca/train?row=10
  • https://learn.sparkfun.com/tutorials/adding-wifi-to-the-nvidia-jetson/all
  • https://github.com/gavv/httpexpect
  • https://github.com/OpenBuddy/OpenBuddy
  • https://github.com/openvinotoolkit/anomalib
  • https://github.com/openbmb/toolbench
  • https://colab.research.google.com/drive/10vhji3FPOAm43zAvjOF4dlgxAeqIkHDx?usp=sharing
  • https://milvus.io/docs/install_standalone-docker.md
  • https://github.com/towhee-io/examples/
  • https://github.com/musabgultekin/functionary
  • https://vllm.readthedocs.io/en/latest/getting_started/quickstart.html
  • http://altexxanet.org/about.html
  • https://jupyter-ai.readthedocs.io/en/latest/users/index.html#prerequisites
  • https://github.com/ibireme/yyjson
  • https://www.tadviewer.com/
  • https://github.com/tconbeer/harlequin
  • https://dlthub.com/docs/intro

© 2020-2023 Tim Spann

0
Subscribe to my newsletter

Read articles from Timothy Spann directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Timothy Spann
Timothy Spann