โ Successful Building a GenAI use cases just requires the latest and greatest fr
โ Successful Building a GenAI use cases just requires the latest and greatest frontier model, right?!
๐ In my conversations with customers I often realize that the choice of the best & shiniest model is highly occupying their resources, while thinking and focusing on architecting and building the use cases which ultimately should fulfil users needs gets very little attention. This naturally fuelled by the fast-pace and loud announcements of new frontier models, which come withย superior benchmark results and new capabilities. What we often forget is that the majority of capabilities become commodity among different models very fast. Hence it often makes much more sense to focus on the use cases and building the generative application for the use case in a way it really addresses the userโs need and is able to evolve to utilize new release models where this makes sense. I wrote about this inย my blog post (https://lnkd.in/e_YhCinM.)
๐ Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up.
๐ Bagrat Ter-Akopyan, Carmen Heger and team. Congrats and thanks for the nice technical write-up. It’s amazing to see how you continue to innovate on behalf of your customers. I love the outcome.
๐ What stands out for me from your technical report is your description of the why and how you moved away from your own, already very good, initial RAG implementation. Citing from the report:
“๐๐ฏ๐ฆ ๐ฐ๐ง ๐ฐ๐ถ๐ณ ๐ญ๐ฆ๐ข๐ณ๐ฏ๐ช๐ฏ๐จ๐ด ๐ง๐ณ๐ฐ๐ฎ ๐ต๐ฉ๐ฆ ๐ญ๐ข๐ด๐ต ๐ฃ๐ช๐จ ๐๐๐ ๐ฑ๐ณ๐ฐ๐ซ๐ฆ๐ค๐ต ๐ธ๐ข๐ด ๐ต๐ฉ๐ข๐ต ๐ธ๐ฆ ๐ฏ๐ฆ๐ฆ๐ฅ๐ฆ๐ฅ ๐ต๐ฐ ๐ด๐ฑ๐ฆ๐ฆ๐ฅ ๐ถ๐ฑ ๐ฐ๐ถ๐ณ ๐ฆ๐น๐ฑ๐ฆ๐ณ๐ช๐ฎ๐ฆ๐ฏ๐ต๐ข๐ต๐ช๐ฐ๐ฏ ๐ค๐บ๐ค๐ญ๐ฆ๐ด. ๐๐ฏ ๐ต๐ฉ๐ฆ ๐ต๐ฆ๐ค๐ฉ๐ฏ๐ช๐ค๐ข๐ญ ๐ด๐ช๐ฅ๐ฆ, ๐ต๐ฉ๐ข๐ต ๐ฎ๐ฆ๐ข๐ฏ๐ต ๐ถ๐ด๐ช๐ฏ๐จ ๐ฎ๐ฐ๐ณ๐ฆ ๐ฐ๐ง๐ง-๐ต๐ฉ๐ฆ-๐ด๐ฉ๐ฆ๐ญ๐ง ๐ด๐ฐ๐ญ๐ถ๐ต๐ช๐ฐ๐ฏ๐ด ๐ง๐ฐ๐ณ ๐๐๐ ๐ข๐ณ๐ค๐ฉ๐ช๐ต๐ฆ๐ค๐ต๐ถ๐ณ๐ฆ๐ด ๐ต๐ฉ๐ข๐ต ๐ฉ๐ข๐ท๐ฆ ๐ฃ๐ฆ๐ค๐ฐ๐ฎ๐ฆ ๐ข๐ท๐ข๐ช๐ญ๐ข๐ฃ๐ญ๐ฆ ๐ข๐ฏ๐ฅ ๐ณ๐ฆ-๐ถ๐ด๐ช๐ฏ๐จ ๐ฐ๐ถ๐ณ ๐ฐ๐ธ๐ฏ ๐ฆ๐น๐ช๐ด๐ต๐ช๐ฏ๐จ ๐ด๐ฐ๐ญ๐ถ๐ต๐ช๐ฐ๐ฏ๐ด.” and “๐๐ฐ ๐ฎ๐ข๐ฌ๐ฆ ๐ต๐ฉ๐ฆ ๐ฅ๐ฐ๐ค๐ถ๐ฎ๐ฆ๐ฏ๐ต๐ด ๐ด๐ฆ๐ข๐ณ๐ค๐ฉ๐ข๐ฃ๐ญ๐ฆ ๐ง๐ฐ๐ณ ๐ฐ๐ถ๐ณ ๐๐๐ ๐ด๐บ๐ด๐ต๐ฆ๐ฎ, ๐ธ๐ฆ ๐ถ๐ด๐ฆ๐ฅ ๐๐๐ ๐ฌ๐ฏ๐ฐ๐ธ๐ญ๐ฆ๐ฅ๐จ๐ฆ ๐ฃ๐ข๐ด๐ฆ๐ด. ๐๐ฉ๐ช๐ด ๐จ๐ข๐ท๐ฆ ๐ถ๐ด ๐ด๐ญ๐ช๐จ๐ฉ๐ต๐ญ๐บ ๐ง๐ฆ๐ธ๐ฆ๐ณ ๐ค๐ฐ๐ฏ๐ง๐ช๐จ๐ถ๐ณ๐ข๐ต๐ช๐ฐ๐ฏ ๐ฐ๐ฑ๐ต๐ช๐ฐ๐ฏ๐ด ๐ต๐ฉ๐ข๐ฏ ๐ข ๐ค๐ถ๐ด๐ต๐ฐ๐ฎ ๐ฃ๐ถ๐ช๐ญ๐ฅ ๐ฆ๐ข๐ณ๐ญ๐ช๐ฆ๐ณ ๐ด๐ฐ๐ญ๐ถ๐ต๐ช๐ฐ๐ฏ ๐ฃ๐ถ๐ต ๐ด๐ฑ๐ฆ๐ฅ ๐ถ๐ฑ ๐ช๐ฏ๐จ๐ฆ๐ด๐ต๐ช๐ฐ๐ฏ ๐ฃ๐บ ๐ด๐ฆ๐ท๐ฆ๐ณ๐ข๐ญ ๐ฎ๐ข๐จ๐ฏ๐ช๐ต๐ถ๐ฅ๐ฆ๐ด ๐ธ๐ฉ๐ช๐ค๐ฉ ๐ข๐ญ๐ญ๐ฐ๐ธ๐ฆ๐ฅ ๐ถ๐ด ๐ต๐ฐ ๐ช๐ต๐ฆ๐ณ๐ข๐ต๐ฆ ๐ฑ๐ณ๐ฆ-๐ฑ๐ณ๐ฐ๐ค๐ฆ๐ด๐ด๐ช๐ฏ๐จ ๐ฎ๐ถ๐ค๐ฉ ๐ง๐ข๐ด๐ต๐ฆ๐ณ.”
๐ข If you are on the hunt for an image ๐๐ก๐ ๐ฉ๐๐๐๐ข segmentation model, which is open and you can deploy
๐ข If you are on the hunt for an image ๐๐ก๐ ๐ฉ๐๐๐๐ข segmentation model, which is open and you can deploy on your own, have a look at the just released ๐ฆ๐ฒ๐ด๐บ๐ฒ๐ป๐ ๐๐ป๐๐๐ต๐ถ๐ป๐ด ๐ ๐ผ๐ฑ๐ฒ๐น ๐ฎ (๐ฆ๐๐ ๐ฎ). The model capabilities can be nicely experienced in Metaโs Demo). Read more about the announcement at their announcement page.
๐ทโโ๏ธIf you are looking into deploying the model to build your own application on AWS, ๐๐บ๐ฎ๐๐ผ๐ป ๐ฆ๐ฎ๐ด๐ฒ๐ ๐ฎ๐ธ๐ฒ๐ฟ is a very good alternative for you. Quoting from Metaโs announcement website:
โฐ Early start and back to Zรผrich Airport! ๐ซ Some disturbances grounded me here last night, but I'm
โฐ Early start and back to Zรผrich Airport! ๐ซ Some disturbances grounded me here last night, but I’m looking back to yesterday’s amazing Generative AI hackathon with a smile ๐.
โ The participants were very engaged and built impressive PoCs in no time. My favorite feedback was: “It’s amazingly easy to create an 80% solution in almost no time”, “Based on the learning I built a 2nd use case in just 10 more minutes and got amazing results”, and “It’s not just the Generative AI models, but the platform that allowed me to build RAG-based solutions so easily”. ๐
What a great evening yesterday at the AWS User Group Munich. The quote of the evening for me, yeah I
What a great evening yesterday at the AWS User Group Munich. The quote of the evening for me, yeah I might be a little biased ๐ , was :
“There are multiple model launches in Amazon Bedrock. Launching new models in Bedrock became like new minor versions numbers for databasesย or other services. Hard to keep up with the pace”.
Reflecting on this, I think that is a very true statement. It underlines the pace of innovation in this space and the necessity of choice at your hands if you want to build #GenAI based application for your customers. Hence the need of a service like #AmazonBedrock. Also featured the Bedrock Converse API, which makes switching models even easier.
How do you choose the foundation model for your Generative AI App โ like your car?
How do you choose the foundation model for your Generative AI App โ like your car?
Just published a new blog post on medium:
๐ How do you choose the right foundation model for your Generative AI app? ๐ก It’s like picking the perfect car!ย ๐ Just like car buyers care more about features, price, and user experience, Generative AI app users prioritize functionality, cost, and ease of use over the specific model under the hood.ย ๐ทโโ๏ธ Building a successful Generative AI app requires a well-architected, cost-effective solution that meets customer needs, not just chasing the latest, most powerful model.ย ๐ง Models are crucial, but they’re just one component. A robust architecture with components like data management, model orchestration, and user interfaces is essential.ย ๐ฐ Cost-effectiveness matters. Choose the “Goldilocks” model that’s good enough for your use case, not overpowered (and overpriced).ย โฑ๏ธ Models evolve rapidly. Your app architecture and processes must allow seamless model updates or replacements to stay competitive.ย ๐งฉ Embrace model composability. Use different models for different tasks within your app for optimal cost and performance.ย ๐ฃ๏ธ Just like cars, Generative AI apps aren’t one-size-fits-all. Tailor your solution to your specific use case and customer needs.ย ๐ Don’t get caught up in the hype. Work backward from customer problems to build fantastic, innovative solutions with Generative AI.ย ๐ Enjoy the ride and have fun building your Generative AI app! It’s an exciting journey ahead.
What a very looong weekend in Stockholm. Loved every second of it. Started of with spectating the #S
What a very looong weekend in Stockholm. Loved every second of it. Started of with spectating the #StockholmMarathon. Would have loved to join, but no last minute (literally) bibs available. Only learned about it in the hour before the start. Still had a very good time.
We started the week with a Prompt Engineering on #AmazonBedrock workshop leading participant through the implementation of a marketing use case. Was great to see Oliver Mรถller, Tobias Nitzsche and Chakkree Tipsupaย in joined action. The workshop has been good received and AWS officeโs in Stockholm have been a very welcoming place for all the customers.
Prompt Engineering is possibly the single most valuable skill you have to master if you want to get
Prompt Engineering is possibly the single most valuable skill you have to master if you want to get to production with your Generative AI based application.
As many customers are working hard towards bringing their ideas and proof of concepts to realisations in 2024, this topic hits the keynote stage of the #AWSSummit in Stockholm.
We bring:
- ๐ ๏ธa toolbox of proven tools to build reliable prompts
- ๐ทa mechanism to create reliable results, turning trial&error into engineering
- ๐a ton of learnings from idealo internet GmbH โs journey into production
Looking forward to meet you there ๐
The ones who joined Philipp and me in our session at the #AWSSUMMIT in Berlin last week already got
The ones who joined Philipp and me in our session at the #AWSSUMMIT in Berlin last week already got a preview of the blog post as we run it as a demo. Nice that it is now published and you all can get hands on it. Kudos!
AWS Inferentia2 is a great way to optimize the inference part of your (gen)AI workloads on AWS and the blog post helps you to dive straight into deploying a LLM (in this case Meta’s Llama 3) model. But it is not “just” Llama 3. From Hugging Face recent blog post: “Enabling over 100,000 models on AWS Inferentia2 with Amazon SageMaker” - https://lnkd.in/ePYb6TFs. So there is a good chance that you can benefit from AWS Inferentia 2 today :)
๐กOn my way back from a customer workshop on โPrompt Engineeringโ in Den Haag in the Netherlands. Goo
๐กOn my way back from a customer workshop on โPrompt Engineeringโ in Den Haag in the Netherlands. Good to connect with nature and an upcoming storm and very interesting to learn from customers about their experiences with GenAI. Still on the mission of turning authoring a prompt from a pure art form to more of an engineering approach. A Test driven, automated engineering approach for creating prompts has well resonated with the participants of the workshop - canโt wait to see that implemented. Big kudos to the participants who turned a potential boring presentation in an interactive exchange of ideas. Loved it!