2

GitHub - jasonjmcghee/plock: From anywhere you can type, query and stream the ou...

 7 months ago
source link: https://github.com/jasonjmcghee/plock
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

Repository files navigation

Plock

Use an LLM (or anything else that can stream to stdout) directly from literally anywhere you can type. Outputs in real time.

Write a prompt, select it, and (by default) hit Cmd+Shift+.. It will replace your prompt with the output in a streaming fashion.

Also! You can first put something on your clipboard (as in copy some text) before writing / selecting your prompt, and it (by default) Cmd+Shift+/ and it will use the copied text as context to answer your prompt.

For Linux, use Ctrl instead of Cmd.

100% Local by default. (If you want to use an API or something, you can call any shell script you want specified in settings.json)

I show an example settings.json in Settings

Note: Something not work properly? I won't know! Please log an issue or take a crack at fixing it yourself and submitting a PR! Have feature ideas? Log an issue!

Demo showing concept of Triggers, and the new flexible system

68747470733a2f2f63646e2e6c6f6f6d2e636f6d2f73657373696f6e732f7468756d626e61696c732f64336164646463333763326534633864616565663131323535633031333565612d776974682d706c61792e676966

Demo using GPT-3.5 and GPT-4

68747470733a2f2f63646e2e6c6f6f6d2e636f6d2f73657373696f6e732f7468756d626e61696c732f37353632323066336635653234396435623464356237353965396639616464332d776974682d706c61792e676966

If you are going to use this with remote APIs, consider environment variables for your API keys... make sure they exist wherever you launch, or directly embed them (just don't push that code anywhere)

(Original) Demo using Ollama

68747470733a2f2f63646e2e6c6f6f6d2e636f6d2f73657373696f6e732f7468756d626e61696c732f66656432363765363935643134356338386536626666376536333164613865302d776974682d706c61792e676966

(in the video I mention rem, another project I'm working on)

Getting Started

Install ollama and make sure to run ollama pull openhermes2.5-mistral or swap it out in settings for something else.

Launch "plock"

Shortcuts:

Ctrl / Cmd + Shift + .: Replace the selected text with the output of the model.

Ctrl / Cmd + Shift + /: Feed whatever is on your clipboard as "context" and the replace the selected text with the output of the model.

(these two are customizable in settings.json)

Escape: Stop any streaming output

Mac will request access to keyboard accessibility.

Linux (untested), may require X11 libs for clipboard stuff and key simulation using enigo. Helpful instructions

Also system tray icons require some extras

Windows (untested), you'll need to swap out Ollama for something else, as it doesn't support windows yet.

[Settings]

There is a settings.json file which you can edit to change shortcuts, the model, prompts, whether to use shell scripts and what they are, and other settings.

After updating, click the tray icon and select "Load Settings" or restart it.

At any time you can click the tray icon and it will list the settings location. For what it's worth:

On mac, It's at ~/Library/Application Support/today.jason.plock/settings.json.

On linux, I think it's ~/$XDG_DATA_HOME/today.jason.plock/settings.json.

Windows, I think it's ~\AppData\Local\today.jason.plock\settings.json

But clicking the icon is the best way.

Correct me if any of these are wrong.

Using Settings

Take a look at the shortcut keys. A “trigger” can be started with a shortcut. That points to a process (by an 0-index) and a prompt (by a 0-index) to the lists defined in the processes and prompts fields.

a process is either “ollama” or a command (shell on mac). You can use that to call your script.

prompts can use one of the two built in variables $CLIPBOARD and $SELECTION, or any others you define using set_env_var trigger.

next_steps defines what happens to the output, which can be written to the screen (streaming or all at once ), saved to a variable, and/or kick off another trigger.

In the future i want to make it easy to trigger flexibly (cron, push), output wherever / however, easily chain things together.

very very open to feedback

Show Example

{
  : {
    : ,
    : ,
    : 
  },
  : [
    {
      : [
        ,
        
      ]
    },
    {
      : []
    },
    {
      : [
        ,
        
      ]
    },
    {
      : [
        ,
        
      ]
    },
    
  ],
  : [
    {
      : ,
      : 
    },
    {
      : ,
      : 
    },
    {
      : ,
      : 
    },
    {
      : ,
      : 
    }
  ],
  : [
    {
      : ,
      : ,
      : ,
      : [
        {
          : 
        },
        {
          : 
        }
      ],
      : 
    },
    {
      : ,
      : ,
      : ,
      : [
        
      ],
      : 
    },
    {
      : ,
      : ,
      : ,
      : [
        
      ],
      : 
    },
    {
      : ,
      : ,
      : ,
      : [
        
      ],
      : 
    },
    {
      : ,
      : ,
      : ,
      : [
        ,
        {
          : 
        },
        {
          : 
        }
      ],
      : 
    },
    {
      : ,
      : ,
      : ,
      : [],
      : 
    }
  ]
}

Building Plock

If you don't have apple silicon or don't want to blindly trust binaries (you shouldn't), here's how you can build it yourself!

Prerequisites

  • Node.js (v14 or later)
  • Rust (v1.41 or later)
  • Bun (latest version)

Installation Steps

Node.js

Download from: https://nodejs.org/

curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
source $HOME/.cargo/env

Bun NPM

Whattt?? Why? - well, windows doesn't support bun in github actions afaict. So, I'm using npm instead.

How to Install Node

Project Setup

git clone <repo_url>
cd path/to/project
npm install
npm run tauri dev

Build

npm run tauri build

Another demo

Another demo where I use the perplexity shell script to generate an answer super fast. Not affiliated, was just replying to a thread lol

Screen.Recording.2024-01-21.at.7.21.53.PM.mov

Secrets

Curious folks might be wondering what ocr feature is. I took a crack at taking a screenshot, running OCR, and using that for context, instead of copying text manually. Long story short, rusty-tesseract really dissapointed me, which is awkward b/c it's core to xrem.

If someone wants to figure this out... this could be really cool, especially with multi-modal models.


About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK