Tbh, if you don’t know what that means, you can’t trust it.
Though, it means that unless it’s running locally on your own hardware and not in the cloud and you haven’t verified the source code directly (or someone else you trust hasn’t) then assume it is nefarious and do not give it any personal or sensitive information you wouldn’t want anyone on the Internet to know.
Name, address, phone number, Bank info, nude photos of yourself, etc. If the info being released could harm you or in some way negatively impact your life, assume it would be sent to China or anywhere else on the world wide web if you ran it without following the previous guidelines.
Any data related to your person. (name, contacts, date of birth, etc.)
Search “PII” or “personally identifiable information” if you want to read more about that.
Thanks @[email protected] for filling OP in! I want to add a few things incase OP is unaware of more than just what you explained:
LLM = large language model, one of the types of AI. Examples: ChatGPT, DeepSeek, Meta’s LLaMA
Open-Source: the program code of the AI is available to look at, in its entirety
If you are not sure if you understand these terms and what frightful_hobgoblin said, then just assume whatever AI you are using is going to share your chat with the company behind it.
open source is also very tricky with LLMs: i’d argue if you can’t recreate it from scratch, it’s not open source… deep seek does not contain all the data necessary to recreate it from scratch: it’s open weights (the model itself can be downloaded and run) but not open source… i’d classify it as free (as in beer) software; not open source
What does this means?
Tbh, if you don’t know what that means, you can’t trust it.
Though, it means that unless it’s running locally on your own hardware and not in the cloud and you haven’t verified the source code directly (or someone else you trust hasn’t) then assume it is nefarious and do not give it any personal or sensitive information you wouldn’t want anyone on the Internet to know.
Could you please explain what “personal or sensitive info” means?
Name, address, phone number, Bank info, nude photos of yourself, etc. If the info being released could harm you or in some way negatively impact your life, assume it would be sent to China or anywhere else on the world wide web if you ran it without following the previous guidelines.
Any data related to your person. (name, contacts, date of birth, etc.) Search “PII” or “personally identifiable information” if you want to read more about that.
‘locally-running’ means it is on your computer, will work without an internet connection
anything you access using the internet is not ‘locally-running’
The comment means don’t send information over the internet that you don’t want to share.
Thanks @[email protected] for filling OP in! I want to add a few things incase OP is unaware of more than just what you explained:
LLM = large language model, one of the types of AI. Examples: ChatGPT, DeepSeek, Meta’s LLaMA
Open-Source: the program code of the AI is available to look at, in its entirety
If you are not sure if you understand these terms and what frightful_hobgoblin said, then just assume whatever AI you are using is going to share your chat with the company behind it.
open source is also very tricky with LLMs: i’d argue if you can’t recreate it from scratch, it’s not open source… deep seek does not contain all the data necessary to recreate it from scratch: it’s open weights (the model itself can be downloaded and run) but not open source… i’d classify it as free (as in beer) software; not open source
Excellent addition, I agree!
That’s the criteria of many FOSS catalogue repositories: they won’t add any software that is not completely reproducible.