I have a dumb work related chrome thing, i’d like to make it so that when a certain notification sound plays in chromium, my computer does a few things automatically for me

Does anyone know a good way to make this happen?

I imagine it’d have to be setup like:

when chrome starts playing audio && check if that audio matches soundfile.ogg && myscript.sh, but I don’t know any good cli utilities that could get something like that done, and if there are any better ideas!

edit: to avoid X/Y issues i’ve summarized the problem in full here:

  1. I have a work program, this notifies me if I get a call or email, the work program then presents an accept/decline page, and does not proceed until I either accept, decline, or it times out.
  2. I want it to do two different things depending on if it’s a call or email
  3. It provides no notification other than the sound and an “accept” button on the page
  4. I have a chrome window open that does nothing but this, and I never use chrome for anything else
  5. I want to automatically do various things when I receive either this call or email
  6. I want it to be broadly applicable rather than a script designed for the specific website giving me the notification (so not a chrome extension). This prevents me from having to update any code in the event that the backend changes dramatically, and even if the notification sound changes, i’d just record a new sound as the activation noise.
  7. The noise is always the same, and hasn’t changed for many years, and there is a distinct noise between calls and emails
  8. They never overlap, they never play multiple times at the same time, and they never make any noises other than those two. The noises are distinct.

These factors cause me to want to run a script once the noise is recognized, only if the noise is playing in a particular app. I’m using pipewire/hyprland on arch.

My current plan for isolating the noise is to do the following:

pactl load-module module-combine-sink sink_name=‘Work’ slaves=‘easyeffects_sink’

and then set chrome exclusively to play audio on work.

Then set a script to check the sink work for audio that matches what I want. That should be simpler than the other methods i’ve seen to isolate the noise.

  • t0mri@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    6 months ago

    I dunno exactly but, for shell script, playerctl can detect the audio output ig.

    There might be some audio library. May be you can write a daemon or something to watch for events. That way you can you use it on anywhere not just on chrome.

    Can you add more specifications, coz id like to help

  • onlinepersona@programming.dev
    link
    fedilink
    English
    arrow-up
    0
    ·
    6 months ago

    I think it would possibly be easier to write an extension. You can inspire yourself from this extension, intercept media playback, possibly hash the media being played, compare the hash to a known DB you create, and call a script in response to a positive detection.

    CC BY-NC-SA 4.0

    • Communist@lemmy.mlOP
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      6 months ago

      It’s really not in this case, I can see why people think that since i’ve been vague, but tbh I thought somebody would have already made an easy sound recognition program and I just hadn’t seen it, and that once someone pointed that to me the rest would be easy.

      Here is the entirety of the problem:

      1. I have a work program, this notifies me if I get a call or email, the work program then presents an accept/decline page, and does not proceed until I either accept, decline, or it times out.
      2. I want it to do two different things depending on if it’s a call or email
      3. It provides no notification other than the sound and an “accept” button on the page
      4. I have a chrome window open that does nothing but this, and I never use chrome for anything else
      5. I want to automatically do various things when I receive either this call or email
      6. I want it to be broadly applicable rather than a script designed for the specific website giving me the notification (so not a chrome extension). This prevents me from having to update any code in the event that the backend changes dramatically, and even if the notification sound changes, i’d just record a new sound as the activation noise.
      7. The noise is always the same, and hasn’t changed for many years, and there is a distinct noise between calls and emails
      8. They never overlap, they never play multiple times at the same time, and they never make any noises other than those two. The noises are distinct.

      These factors cause me to want to run a script once the noise is recognized, only if the noise is playing in a particular app. I’m using pipewire/hyprland on arch.

      edit: actually they have, it should be really easy with this: https://github.com/worldveil/dejavu

  • thevoidzero@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    6 months ago

    Someone already talked about the XY problem, so I’ll say this.

    Why sound notification instead of notification content? If your notification program (dunst in my case) have pattern matching or calling scripts based on patterns and the script has access to which app, notification title, contents etc. then it’s just about calling something in your bash script.

    And any time you wanna add that functionality to something else, add one more line with a different pattern or add a condition in your script. Comparing text is lot more reliable than audio.

    Of course your use case could be completely different, so maybe give some examples of use case so people can give you different ways to solve that instead of just the one you’re thinking of.

  • bloodfart@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    6 months ago

    My assumption is that you don’t care if your notification gets spoofed, ex. Someone rings a little bell and the script deletes all cookies from porn websites as if the little bell notification played.

    So I think the hardest and best way to do this is to have the script run on a separate device than the sound plays on.

    First record the sound you want to trigger with. Use the script executing device with the microphone and interface you’ll be using in production set up in the location of production to make it easier on yourself.

    Now reduce the bitrate of the target sound a lot. No, more than that, keep going, a little more, that’s perfect.

    Now write something that will capture the last target_sound_length seconds of audio and compare it with the bitcrushed version. Depending on your device, there may be a buffer object in the adc you can interface with, although if it’s running a normal operating system you won’t be able to just get to it without going through the os first.

    If you can go through the chrome notificationing machine, figure out the hook used to trigger the notification you want to respond to and intercept and perform the script. No nyquist needed!

  • ace_garp@lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    6 months ago

    Expect automates things, based on text-input captured from a terminal.

    Not sure if it has been extended/hacked to take sound as an input.