Anyone using AI Locally - as in on their own computer?

If you think the app is not uploading your quarries, conversations, files and whatever else you share with it, you don’t know how it all works.
It's not connected to the internet. Not al all. If you watch the video, you download the LM studio, and then you download an AI model you want to use.

The PC is not connected to the internet, so unless the AI works by quantum computing, there is no connection.
 
It's not connected to the internet. Not al all. If you watch the video, you download the LM studio, and then you download an AI model you want to use.

The PC is not connected to the internet, so unless the AI works by quantum computing, there is no connection.
Then whenever you connect it back, it will start uploading the data. And if the computer it’s on is always offline, the model will never improve.
 
Then whenever you connect it back, it will start uploading the data. And if the computer it’s on is always offline, the model will never improve.
There are 3 types of AI LLM:

1) The one you pay a low fee to use and you are the product as well. So when you do something they are also used to train the model. If you are not paying an arm and a leg this is likely the one.

2) The one you pay an arm and a leg to make sure they don't upload. You know if you have to use an IT department or pay a lot more than their advertised discount rate and you have someone to call and some legal paperwork to review.

3) The download to your machine to run one. I think so far Llama is the only one that would fit, but as I mentioned above this is going to be either a pretty meh one that fit a normal laptop or you have to buy some ridiculously expensive laptop to run, like an HP laptop a GPU company demo their chip with Llama on, that has 96GB of RAM and the laptop cost like $3k before the RAM shortage (so probably a $9k laptop now). Anything that can run on cheap to average laptop is kind of dumb in comparison.
 
Then whenever you connect it back, it will start uploading the data. And if the computer it’s on is always offline, the model will never improve.
I think you're confusing AI as a service (whether it be a web interface like ChatGPT.com or Grok.com or an API like the ones OpenAI, xAI, etc provide) with open models you run on your computer (LM Studio, etc).
 
@KrisZ :

As @PandaBear and @dogememe said above - I am NOT talking about using an AI service.

I have a local installed model and this version is amazing compared to what was available six months ago. I'm running on slightly older hardware and while it's not super fast, it doesn't need to be for my uses as a "technical writing assistant". What normally takes me half a day to do, this AI does it in less than three minutes.

I don't have to do the "crunching", or hire someone to do the basic information processing and data entry. I do it on a PC at my house and no one has access to the information but me and the customer (the final product). Huge time-saver for me.

So, I'm taking from the lack of responses that very few, if any of the BITOG crew are using Local LLM?
 
Last edited:
I run LM Studio with various models on both my Windows laptop with a Intel Core Ultra 9, 64GB RAM, and a GTX 5080 with 16GB of VRAM and my 3 year old MacBook Air M2 with 16GB of unified memory. The same query on the same model will run roughly 2-3x as fast on the Windows box as it will on my Mac. Still pretty impressive to see the Mac flexing its tiny muscles compared to a much more expensive and stout PC.
 
I'm using Mistral 7B and Llama3:8b on my work computer, it's a Dell Precision 3570 laptop with a Core i7-1255U, Nvidia T550 discrete graphics, and 64GB RAM. I run ollama as a front-end for the models. I have the OS set for ollama.exe to run using the T550, which is something you need to do manually in Windows 11 if the OS doesn't detect that it should use discrete graphics instead of the build-in Intel UHD display adapter.

I experimented with a lot of different models before settling on these ones. Mistral:7B is a little faster but Llama3:8b is more accurate. Both generally speaking run well on this hardware, close to what you'd get on an AI website. Mistral is also kind of frenchy, which is annoying. It's always nagging me about things like copyright law and other various and sundry topics it doesn't need to opine on. Liberté, égalité, fraternité, I guess.

Regarding model selection, the big models are slow because they are too big for this hardware. But what was surprising in the beginning was the smallest models are not always faster because they have to process more on their limited codebase and training more before they come up with an answer. You really need an AI model and size that is optimized for your hardware. Tinyllama is one of the better small models and it works ok, not great, on my home machine, a Core i3 with 20GB of RAM. It takes some experimentation, and I ask every model I've tried what the best model for my hardware is, both for my work and home machines.

I'm willing to try others, especially if they are newer. The older models, even if they run well on your hardware, like Google Gemma, they're not very accurate because the tech has moved on, and Gemma was published in 2023.
 
I saw Llama 3 was demoed on a laptop with 96GB, and that's the "kind of lame" version. Most of the work LLM we have are not on premises for this specific reason.
Llama3:8b runs pretty well on my work laptop, but it does have discrete graphics.

You can verify Nvidia graphics performance by typing 'nvidia-smi' at the Windows command prompt. Mine goes to 100% GPU when it's answering queries.
 
I do for entertainment only...
I bombed people here with that crap but not on the forum.
I have hotrod computers
 
Llama3:8b runs pretty well on my work laptop, but it does have discrete graphics.

You can verify Nvidia graphics performance by typing 'nvidia-smi' at the Windows command prompt. Mine goes to 100% GPU when it's answering queries.
I do think it works pretty well on a laptop. My problem was Llama 3 was kind of lame (compare to Claude).

Our workplace has switched from Co Pilot to Claude now, Llame 3 never even got qualified due to the output not being that great quality for our work. It makes a lot of "new grad mistakes".
 
What are you trying to accomplish? If you have a decent dedicated GPU (Nvidia RTX series with 16GB+ memory) you can actually use it for basic tasks like analyzing data, small code snippets, or writing short stories. Yes you can do it with less resources but it becomes less and less useful.

But if you're expecting capability similar to what even a free plan with ChatGPT or Grok etc can offer... it's not going to happen on your computer. Even if you fill your garage with AI servers... the open source models just aren't the same. Sure, there are some really good ones, but you simply don't have the engineering minds and financial resources to make something comparable to the latest version of ChatGPT or Grok etc.
I'm proving this out locally before moving on to an underutilized VxRail cluster. Biggest problem is it doesn't have discrete GPUs, queries will have to be quantized.

It doesn't need to be general purpose AI, like the big AI sites. It just needs to be trained to do a few things well. Every node in this cluster has 1.5TB of RAM, so, it's possible to throw obscene amounts of memory at it to make up for the limitations of CPU vs a GPU. You can easily load an entire model into memory.

As for engineering talent, well, I tend to believe there's fewer Albert Einstein level talents in this field than people think there are, and most are just building on top of what others have done.
 
I'm proving this out locally before moving on to an underutilized VxRail cluster. Biggest problem is it doesn't have discrete GPUs, queries will have to be quantized.

It doesn't need to be general purpose AI, like the big AI sites. It just needs to be trained to do a few things well. Every node in this cluster has 1.5TB of RAM, so, it's possible to throw obscene amounts of memory at it to make up for the limitations of CPU vs a GPU. You can easily load an entire model into memory.

As for engineering talent, well, I tend to believe there's fewer Albert Einstein level talents in this field than people think there are, and most are just building on top of what others have done.
You won't know until you give it a try. As people said you can solve every problem in the world with enough time and money, and in this case memory and time.
 
I understand your feelings!
However, I think AI may save us!

I already feel social media is destroying us at least with AI now one can verify information easily for the masses of everything being posted on X, Twitter, Facebook, and everything else. It’s ridiculous. The information on there and the way it skews less critically thinking people.

It’s really laughable the stuff that gets posted online and people suck that stuff up like it’s from some authoritative site and it’s nothing but a propaganda machine turning out this stuff, turning us against each other. It’s going to make it harder for the foreign propaganda machines to move you to vote one way or another. It’s not gonna happen overnight however it is happening.
I admire your optimism.

However, I fear that AI is turning out more propaganda machines than solving problems. To wit:

https://www.malwarebytes.com/blog/a...sletter_v1_177514470918&utm_content=Wikipedia
 
I admire your optimism.

However, I fear that AI is turning out more propaganda machines than solving problems. To wit:

https://www.malwarebytes.com/blog/a...sletter_v1_177514470918&utm_content=Wikipedia
Yeah, I am optimistic. There will be faults and always are faults with any new technology. It makes fun for discussion of the negatives but my feeling is no one reports on the positives because people like to read bad stuff.

Keeping in mind we are only at the very pinnacle of development. Really I see new AI technology as antiquated compared to what it is going to look like in 10 years. Already, AI is reviewing imaging scans in the health industry picking up things that are sometimes missed by humans, cancers and arterial disease. I believe one of our members in this forum just had a CT Angiogram with AI interpretation? If I am wrong on his particular scan, it does exist. Stuff like that.
I know it's controversial. Im just saying nothing is going to stop it. IT will be every part of our lives. Im not saying it shouldn't be alarming, Im just saying there is good as well as evil. Even if we banned it, we would be behind the rest of the entire super powers who hate us, that would be more scary.

If I was younger, yeah, I dont know... Im glad I grew up when I did.
 
Back
Top Bottom