Lee Duna@lemmy.nz to

Technology@lemmy.worldEnglish · 1 year ago

Audacity adds AI audio editing capabilities thanks to free Intel OpenVINO plugins

www.notebookcheck.net

515

Audacity adds AI audio editing capabilities thanks to free Intel OpenVINO plugins

www.notebookcheck.net

Lee Duna@lemmy.nz to

Technology@lemmy.worldEnglish · 1 year ago

Audacity has added AI audio editing capabilities thanks to Intel’s free OpenVINO plugins. These plugins add AI-powered noise suppression, speech transcription, music generation and remixing, and music separation to the freeware sound editor and are available for download today.

You must log in or register to comment.

Chat

Blastboom Strice@mander.xyz
link
fedilink
English
arrow-up
60·
edit-2
1 year ago
[Edit: indeed, its actually good that it’s 2gb]

2gb plugin??!

Btw, does it work with tenacity?
- 9point6@lemmy.world
  link
  fedilink
  English
  arrow-up
  64·
  1 year ago
  AI models are often multiple gigabytes, tbh it’s a good sign that it’s not “AI” marketing bullshit (less of a risk with open source projects anyway). I’m pretty wary of “AI” audio software that’s only a few megabytes.
  - interdimensionalmeme@lemmy.ml
    link
    fedilink
    English
    arrow-up
    10·
    1 year ago
    Tensorflowlite models are tiny, but they’re potentially as much an audio revolution as synthetizer were in the 70s. It’s hard to tell if that’s what we’re looking at here.
  - Neato@ttrpg.network
    link
    fedilink
    English
    arrow-up
    4·
    1 year ago
    Why are they that big? Is it more than code? How could you get to gigabytes of code?
    - General_Effort@lemmy.world
      link
      fedilink
      English
      arrow-up
      50·
      1 year ago
      Currently, AI means Artificial Neural Network (ANN). That’s only one specific approach. What ANN boils down to is one huge system of equations.
      
      The file stores the parameters of these equations. It’s what’s called a matrix in math. A parameter is simply a number by which something is multiplied. Colloquially, such a file of parameters is called an AI model.
      
      2 GB is probably an AI model with 1 billion parameters with 16 bit precision. Precision is how many digits you have. The more digits you have, the more precise you can give a value.
      
      When people talk about training an AI, they mean finding the right parameters, so that the equations compute the right thing. The bigger the model, the smarter it can be.
      
      Does that answer the question? It’s probably missing a lot.
    - Aatube@kbin.social
      link
      fedilink
      arrow-up
      15·
      edit-2
      1 year ago
      It’s basically a huge graph/flowchart.
      - acockworkorange@mander.xyz
        link
        fedilink
        English
        arrow-up
        6·
        1 year ago
        It’s really nothing of the sort.
        
        Aatube@kbin.social
        link
        fedilink
        arrow-up
        14·
        1 year ago
        
        Specifying weights, biases and shape definitely makes a graph.
        
        IMO having a lot of more preferred and more deprecated routes is quite close to a flowchart except there’s a lot more routes. The principles of how these work is quite similar.
        
        General_Effort@lemmy.world
        link
        fedilink
        English
        arrow-up
        3·
        1 year ago
        
        There are graph neural networks (meaning NNs that work on graphs), but I don’t think that’s what is used here.
        
        I do not understand what you mean by “routes”. I suspect that you have misunderstood something fundamental.
        
        Aatube@kbin.social
        link
        fedilink
        arrow-up
        5·
        1 year ago
        
        I’m not talking about that. What’s weights, biases and shape if not a graph?
        
        By routes, I mean that the path of the graph doesn’t necessarily converge and that it is often more tree-like.
    - 9point6@lemmy.world
      link
      fedilink
      English
      arrow-up
      8·
      edit-2
      1 year ago
      The current wave of AI is around Large Language Models or LLMs. These are basically the result of a metric fuckton of calculation results generated from running a load of input data in, in different ways. Given these are often the result of things like text, pictures or audio that have been distilled down into numbers, you can imagine we’re talking a lot of data.
      
      (This is massively simplified, by someone who doesn’t entirely understand it themselves)
      - circuitfarmer@lemmy.world
        link
        fedilink
        English
        arrow-up
        1·
        edit-2
        1 year ago
        deleted by creator
    - ඞmir@lemmy.ml
      link
      fedilink
      English
      arrow-up
      7·
      1 year ago
      They’re composed of many big matrices, which scale quadratically in size. A 32x32 matrix is 4x the size of a 16x16 matrix.
- bamboo@lemm.ee
  link
  fedilink
  English
  arrow-up
  33·
  1 year ago
  It seems reasonable given it includes multiple AI models.
- Fisch@lemmy.ml
  link
  fedilink
  English
  arrow-up
  7·
  1 year ago
  2gb is pretty normal for an AI model. I have some small LLM models on my PC and they’re about 7-10gb big. The big ones take up even more space.
- Lexi Sneptaur@pawb.social
  link
  fedilink
  English
  arrow-up
  3·
  1 year ago
  Isn’t tenacity a joke project made by 4channers
  - CaptainBasculin@lemmy.ml
    link
    fedilink
    English
    arrow-up
    15·
    1 year ago
    That fork is sneedacity, which is very dead.
    - Lexi Sneptaur@pawb.social
      link
      fedilink
      English
      arrow-up
      3·
      1 year ago
      Gotcha, thank you for the info. Gotta admit their made-up words are pretty funny
  - RmDebArc_5@lemmy.ml
    link
    fedilink
    English
    arrow-up
    10·
    1 year ago
    Tenacity is a Audacity fork without telemetry
    - m-p{3}@lemmy.ca
      link
      fedilink
      English
      arrow-up
      16·
      1 year ago
      Isn’t the telemetry in Audacity opt-in anyway?
      - Fisch@lemmy.ml
        link
        fedilink
        English
        arrow-up
        3·
        1 year ago
        The fork was created when Audacity was bought and one of the first things the new developers were about to do was add opt-out telemetry. People didn’t like that at all. From what I read in this thread, they ended up adding opt-in telemetry instead.
      - xploit@lemmy.world
        link
        fedilink
        English
        arrow-up
        1·
        1 year ago
        deleted by creator
sapetoku@sh.itjust.works
link
fedilink
English
arrow-up
40·
1 year ago
I’ve been using the OpenVINO plugins for a few weeks and it’s genuinely impressive. Noise cancelling is one thing, but the transcription tool is amazing. I can create subtitles from conference recordings in minutes and create transcripts of recorded zoom calls, etc. and it does it for multiple languages.

That’s the kind of shit I like using AI for.
- mojofrododojo@lemmy.world
  link
  fedilink
  English
  arrow-up
  4·
  1 year ago
  
  music generation and remixing
  
  any insight as to what this is?
edric@lemm.ee
link
fedilink
English
arrow-up
33·
1 year ago
The music separation and speech transcription plug-ins actually sound nice. Obviously that will depend on how reliable they actually are.
- ChunkMcHorkle@lemmy.world
  link
  fedilink
  English
  arrow-up
  15·
  edit-2
  1 month ago
  deleted by creator
Agent641@lemmy.world
link
fedilink
English
arrow-up
33·
1 year ago
Removed by mod
- interdimensionalmeme@lemmy.ml
  link
  fedilink
  English
  arrow-up
  27·
  1 year ago
  We already had a scare with them, but turns out it was very unfair overreaction to the project.
  
  In this case I’m happy as long as it’s hardware platform independent and uses open source released models.
  
  AI music art has been for a long time in the hands of industry moguls and us peasants have had nothing. So I’m happy with anything that puts this power in the hands of the everyman.
  - doctorcrimson@lemmy.world
    link
    fedilink
    English
    arrow-up
    18·
    edit-2
    1 year ago
    Was it unfair? I haven’t been following since they got bought out by spyware?
    
    EDIT: Audacity was acquired by a company called MuseGroup in 2021 who added unnecessary telemetry and they admit that they do provide the data the collect to third parties. Some claim the changes were reverted but I haven’t confirmed that myself so until I see there is no telemetry it’s spyware as far as I’m concerned.
sic_semper_tyrannis@feddit.ch
link
fedilink
English
arrow-up
29·
1 year ago
Use Tenacity instead
- laughterlaughter@lemmy.world
  link
  fedilink
  English
  arrow-up
  25·
  edit-2
  1 year ago
  Why?
  
  Edit: I see now. https://tenacityaudio.org/docs/_content/Motivation.html
- ElPussyKangaroo@lemmy.world
  link
  fedilink
  English
  arrow-up
  6·
  1 year ago
  What’s the difference?
  - nutsack@lemmy.world
    link
    fedilink
    English
    arrow-up
    14·
    1 year ago
    
    In April 2021, Muse Group acquired the famous audio editing applicaiton Audacity. Their goals for Audacity were to bring much needed improvements to Audacity. However, not too long after, there was an attempt to add telemetry to the program
    - exhaust_fan@lemmy.world
      link
      fedilink
      English
      arrow-up
      7·
      1 year ago
      Ok so what’s Tenacity? A fork pre shittification?
      - _dev_null@lemmy.zxcvn.xyz
        link
        fedilink
        English
        arrow-up
        7·
        1 year ago
        Aye, but a little more convoluted. TL;DR: Several new projects forked to avoid the enshitification, and with much time and drama, most of the actually active maintainers joined forces under the Tenacity name+repo. (And 4chan was part of the drama, because of course they were.)
    - ElPussyKangaroo@lemmy.world
      link
      fedilink
      English
      arrow-up
      1·
      1 year ago
      Oof. That’s sad.
Holzkohlen@feddit.de
link
fedilink
English
arrow-up
26·
1 year ago

…and Audacity for Windows 64-bit is required to run these plugins.

Useless.
- EatATaco@lemm.ee
  link
  fedilink
  English
  arrow-up
  29·
  1 year ago
  On lemme I’m often reminded of the vegan joke:
  
  How do you tell if someone is a Linux user? Don’t worry, they’ll tell you.
- arin@lemmy.world
  link
  fedilink
  English
  arrow-up
  3·
  1 year ago
  Having no need for over 4gb of RAM?
homoludens@feddit.de
link
fedilink
English
arrow-up
25·
1 year ago
Windows only :(
- Limonene@lemmy.world
  link
  fedilink
  English
  arrow-up
  56·
  1 year ago
  According to the repo, it builds fine on Linux. They just don’t distribute a binary for it.
  
  https://github.com/intel/openvino-plugins-ai-audacity/issues/27
  - ReallyZen@lemmy.ml
    link
    fedilink
    English
    arrow-up
    7·
    1 year ago
    It’s already on the AUR
    - 🦄🦄🦄@feddit.de
      link
      fedilink
      English
      arrow-up
      4·
      1 year ago
      I fucking love arch and its community
- LanternEverywhere@kbin.social
  link
  fedilink
  arrow-up
  2·
  1 year ago
  Presumably you could use it in a VM running Windows
AcidOctopus@lemmy.ml
link
fedilink
English
arrow-up
19·
1 year ago
I’m sure I used to use Audacity back in the day as a free, quick and dirty editor to splice up audio tracks. I’m talking at least 10 years ago.

Had no idea it was still even a thing.
- TheHarpyEagle@lemmy.world
  link
  fedilink
  English
  arrow-up
  53·
  edit-2
  1 year ago
  It’s honestly pretty much the industry standard for indie creators. There’s nothing super flashy about it, it just does its job very well.
  
  This along with 7-zip and OBS and the like have been pretty impressive success stories for FOSS, even if most of their users don’t even know what that means.
- doctorcrimson@lemmy.world
  link
  fedilink
  English
  arrow-up
  3·
  1 year ago
  They got acquired in 2021 so a lot of people have been very skeptical about it lately.
tabular@lemmy.world
link
fedilink
English
arrow-up
12·
1 year ago
Was the training data ethically sourced (for music generation)?

How do music creators feel about their work potentially being regenerated and used in other’s works?
- ArmokGoB@lemmy.dbzer0.com
  link
  fedilink
  English
  arrow-up
  13·
  1 year ago
  Considering copyright is unethical to begin with…
  - tabular@lemmy.world
    link
    fedilink
    English
    arrow-up
    4·
    1 year ago
    I could almost agree but I think there is value in copyleft: a hack of copyright to ensure users have some of the rights copyright denies when you get a copy/derivative work from another.
    
    With no copyright it’s great that you won’t be sued if you share software but in practice a mere binary isn’t enough (reverse engineering is impractical). We need the source code to be able to change it (or understand what it’s even doing). I won’t support removing all copyright law without a solution.
  - Femsoup [She/Her]@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    1·
    1 year ago
    deleted by creator
- Elderos@sh.itjust.works
  link
  fedilink
  English
  arrow-up
  9·
  1 year ago
  Define ethically sourced.
  - hansl@lemmy.world
    link
    fedilink
    English
    arrow-up
    20·
    1 year ago
    Free range grass fed.
  - tabular@lemmy.world
    link
    fedilink
    English
    arrow-up
    4·
    1 year ago
    Getting permission to copy each music work for use in training data may be ethically important while the creators are dependant on income from that work to survive, or just as a social contract.
    - Meowoem@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8·
      1 year ago
      The capitalist mindset really is a weird one, rent seeking is out of control. We’re talking about a tool that allows independent creators and hobby users to improve the quality of their projects but all you can think about is the possibility of getting a couple of dollars in royalties.
      
      Regular users being able to use advanced noise reduction allows regular people to better compete with corporations, it’s the sort of technology which can help displace the monopolies which rule the world. But you’re against it because they didn’t give you 6 cents for listening to your cover version of country roads
      - tabular@lemmy.world
        link
        fedilink
        English
        arrow-up
        4·
        1 year ago
        Consider there is nuance here. I write code and want people to use it but only if they follow the license that means they must share it with others. I liked the idea of AI creating art for me until I considered the tool’s method of creation and the negative effect taking from artists may have.
        
        I suggest supporting independent creators directly instead.
      - General_Effort@lemmy.world
        link
        fedilink
        English
        arrow-up
        2·
        1 year ago
        Completely agree, but one thing:
        
        help displace the monopolies
        
        These monopolies are a social/legal problem. It can’t be solved with technology. The increased FTC action in the US under the Biden administration are really a hopeful sign.
        
        I am worried about the number of people who want to go in the opposite direction, which “ethically sourced” is simply code for.
    - kuneho@lemmy.world
      link
      fedilink
      English
      arrow-up
      6·
      1 year ago
      you are saying this like the music indistry weren’t about resampling/remixing/rethinking existing songs/melodies/phrases already. it always was. and that’s fine! people always gets down to the source if they hear something fancy.
      - tabular@lemmy.world
        link
        fedilink
        English
        arrow-up
        3·
        edit-2
        1 year ago
        I can’t image people always get to the source, my understanding is most music does not have attribution of significant portions copied.
        
        kuneho@lemmy.world
        link
        fedilink
        English
        arrow-up
        1·
        1 year ago
        
        I can’t imagine…
        
        well yeah, there’s a lot of things I can’t imagine either, the world is a strange place
        
        tabular@lemmy.world
        link
        fedilink
        English
        arrow-up
        1·
        1 year ago
        Indeed, but without reason to change my mind it will remain the same.
- summerof69@lemm.ee
  link
  fedilink
  English
  arrow-up
  5·
  1 year ago
  
  How do music creators feel about their work potentially being regenerated and used in other’s works?
  
  They can always discuss that with their psychologists! :)
peopleproblems@lemmy.world
link
fedilink
English
arrow-up
12·
1 year ago
Removed by mod
Sunforged@lemmy.ml
link
fedilink
English
arrow-up
9·
edit-2
1 year ago
Audacity just doesn’t seem worth the trouble after discovering Reaper and how powerful it is for only $60.
- OpenHammer6677@lemmy.world
  link
  fedilink
  English
  arrow-up
  12·
  1 year ago
  I’m a sound engineer and I use different DAWs for different purposes. There’s just no one DAW that does all, so this is a compromise I’m happy to go with.
  
  When I do podcast editing, I use Audacity to split multi-track WAV files and for truncating silence. It’s just waaaay easier to do this there than on Reaper. Plus it has a loopback recording feature built-in which I use for Zoom meeting recordings etc.
  
  I use Pro Tools for audio post, but for most of what I do I’m a Reaper guy. It’s very powerful as you said and it just works.
  
  I know it can be a hassle switching DAWs (muscle memory on shortcuts can get weird), but for me, I like making the most of the strengths of a tool rather than forcing something to do everything.
  - Sunforged@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2·
    1 year ago
    That’s awesome!
    
    I learned DAWs with ProTools back around 2006 in college. Dropped out because I didn’t want to enter a competitive trade where my best opportunities were moving out of state.
    
    Got sucked into another industry and haven’t touched much audio for the past decade. Getting back into it now and started on Audacity but the 2021 buyout had me confused where to land with the Tenacity split. the good/bad of open source I suppose but as a user being in the middle of a split was frustrating and detracting from recording. Finding out about Reaper and talking to people leaving ProTools behind even within the industry was just what I needed when I needed it.
    
    My daughter (11yo) is now getting into DAWs as her current goal is to score an internship at KEXP, being able to share with her all the stuff I learned in school has been so much fun.
- Takapapatapaka@lemmy.world
  link
  fedilink
  English
  arrow-up
  11·
  1 year ago
  I see what you mean, in your case as well as mine, Reaper is far more powerful and so far more adequate to our needs But people do not always search for powerful software. Sometimes they only want something easy to learn, with only basic tasks but well performed and entirely free. When you have these requirements, Audacity is better
  - Sunforged@lemmy.ml
    link
    fedilink
    English
    arrow-up
    7·
    edit-2
    1 year ago
    Audacity is a great learning tool for intro absolutely! When you’re just dipping your toes into recording and editing, free and $60 is a huge difference.
    
    I feel like users that are going to be using any of the features of this plug-in, they’re probably at the point that going to Reaper makes sense.
- ReallyActuallyFrankenstein
  link
  fedilink
  English
  arrow-up
  7·
  1 year ago
  Does Reaper have similar AI tools? Not a dig, a real question.
  - Takapapatapaka@lemmy.world
    link
    fedilink
    English
    arrow-up
    6·
    1 year ago
    Not at the moment, from what I know
🔍🦘🛎@lemmy.world
link
fedilink
English
arrow-up
6·
1 year ago
Awesome, useful features if they work well. I’ll have to try it out.
Kawawete@reddeet.com
link
fedilink
English
arrow-up
6·
1 year ago
I wonder if it can “de-brickwall” music now
- laughterlaughter@lemmy.world
  link
  fedilink
  English
  arrow-up
  6·
  edit-2
  1 year ago
  De-brickwall?
  
  Edit: Googled it.
  - Slovene@feddit.nl
    link
    fedilink
    English
    arrow-up
    10·
    1 year ago
    Wanna share with the rest of the class?
    - Blue_Morpho@lemmy.world
      link
      fedilink
      English
      arrow-up
      14·
      1 year ago
      To make music louder so it stands out, producers amplify the music until the waveform looks like a straight line instead of peaks and valleys of loud and soft
      - Slovene@feddit.nl
        link
        fedilink
        English
        arrow-up
        14·
        1 year ago
        Ah, the loudness wars …
vosagoy@futurology.today
link
fedilink
English
arrow-up
3·
1 year ago
Into the trash it goes
- Terminarchs@slrpnk.net
  link
  fedilink
  English
  arrow-up
  14·
  1 year ago
  Why is that?
  - vosagoy@futurology.today
    link
    fedilink
    English
    arrow-up
    2·
    1 year ago
    Open source projects are now catching up to the AI buzzword. I wonder who could be behind this.
    - Lexi Sneptaur@pawb.social
      link
      fedilink
      English
      arrow-up
      38·
      1 year ago
      AI, like cloud computing, is just a layman’s term for something else. You will not be able to stem the tide of language changing. It just means machine learning now. Just like how cloud computing is just a term for computing in a k8s cluster in someone’s data center.
      - General_Effort@lemmy.world
        link
        fedilink
        English
        arrow-up
        7·
        1 year ago
        Neural nets have been a part of AI ever since the term was coined 70 years ago. The one thing one could complain about is that the term may be narrowing to that specific approach.
        
        Strictly, neural nets are a specific kind of ML and ML is a specific kind of AI. The term AI seems to have gone out of fashion in academia, though.
        
        Lexi Sneptaur@pawb.social
        link
        fedilink
        English
        arrow-up
        4·
        1 year ago
        AI is far too broad of a term, for sure.
    - simple@lemm.ee
      link
      fedilink
      English
      arrow-up
      27·
      1 year ago
      Removed by mod
    - Aatube@kbin.social
      link
      fedilink
      arrow-up
      6·
      edit-2
      1 year ago
      This is Intel’s plug-in, with otherwise no relation to Audacity. Plus, as long as they don’t bundle it, I don’t see a problem with it.
      - vosagoy@futurology.today
        link
        fedilink
        English
        arrow-up
        3·
        1 year ago
        
        intel
        
        Ahhh everything makes sense now
        
        olympicyes@lemmy.world
        link
        fedilink
        English
        arrow-up
        3·
        1 year ago
        This is a case where you didn’t even need to read the article. You just had to read the headline!
        
        Pope-King Joe@lemmy.world
        link
        fedilink
        English
        arrow-up
        2·
        edit-2
        1 year ago
        Seriously, it’s right there!
        
        vosagoy@futurology.today
        link
        fedilink
        English
        arrow-up
        1·
        1 year ago
        tsmt
    - half_built_pyramids@lemmy.world
      link
      fedilink
      English
      arrow-up
      5·
      1 year ago
      Audacity was already in the trash since the buyout and telemetry data collection.
      - Aatube@kbin.social
        link
        fedilink
        arrow-up
        11·
        1 year ago
        I never understood the opposition to anonymized telemetry. While adding an entire network stack for it is certainly quite atrocious, there’s no problem with the principle I can see.
        
        Bizarroland@kbin.social
        link
        fedilink
        arrow-up
        7·
        1 year ago
        Some people prefer to not have their every action watched and observed by some anonymous Big brother.
        
        The people who do not get that are the people who profit from the watching, and the people that are, best case, inconsiderate of the desires and feelings of other people.
        
        It is not normal nor is it natural to claim ownership of other people’s activity.
        
        It is normal and natural to wish to exist without being observed. Privacy is a fundamental human right and companies are taking advantage of the fact that it is not legally enforced.
        
        Hopefully the laws will catch up and make it so that each and every individual opportunity to directly observe a person must be explicitly approved beforehand with a set time limit on the observation, and that all telemetry must be made publically available and transparent, not only during the original acquisition of data but also in each and every single usage of that data after the fact.
        
        It is only fair after all that should accompany wish to observe you that they must also be equally observed.
        
        Aatube@kbin.social
        link
        fedilink
        arrow-up
        9·
        edit-2
        1 year ago
        But if you anonymize the data, does it really mean someone has their every action watched in a harmful way?
        
        Emily (she/her)@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        5·
        1 year ago
        This is an odd place to grand stand. I’m glad you have ideals, but the fact is Audacity was looking to gather industry standard telemetry data (basic system information and crashes) as an opt-in system. This information is extremely important in fixing bugs and prioritising developer resources.
        
        Bizarroland@kbin.social
        link
        fedilink
        arrow-up
        1·
        1 year ago
        And I could see the forest a whole lot better if all these trees weren’t in the way.
        
        It’s not that one person is doing it it’s that everyone is doing it.
        
        The only way to stop everyone from doing it is to stop everyone from doing it.
        
        vosagoy@futurology.today
        link
        fedilink
        English
        arrow-up
        1·
        1 year ago
        Oy vey

Technology@lemmy.world

technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

3.7K users / day
9.33K users / week
18.7K users / month
35.9K users / 6 months
322 local subscribers
69.3K subscribers
14.4K Posts
589K Comments
Modlog