Mexes's Profile

Web App: Booru Prompt Gallery - Get Prompts quickly! (V8.0)

APP LINK: Booru Prompt Gallery - By MexesTL;DR: Web tool to quickly get clean prompts from Danbooru, Gelbooru, e621, and Aibooru. It removes irrelevant tags, makes adding and managing multiple tags easier, categorizes tags (Appearance, Clothing, Pose, Background), merges redundant tags, and much more! The prompts are mostly designed for Illustrious and Pony; there are plans to try to implement a natural language system, but it is not implemented yet.Hello! About 4 or 5 months ago, I launched this web app to the public. Since then, it has advanced quite a bit, adding more conveniences, APIs, and useful tools for those who generate AI Art. Because of this, I've decided to remake the article and publish it again to organize everything better and to explain all the new features that might be a bit tricky to understand.What is this?Booru Prompt Gallery, as its name suggests, is a gallery... of prompts. What it does is take posts from different digital art websites (like Danbooru), extract the tags that describe them, clean them up, and sort them to leave them ready for generating images. It is extremely useful for people who train LoRAs or AI Artists on social media.Why did I make it?As a LoRA model trainer, the most time-consuming part for me was testing the model and creating varied, high-quality examples for its release. So, looking to speed up that part, I made this page.Feature BreakdownFrom here on down, I will describe "what it does and how to use it" for all the tools in this app. I tried to accompany all of them with an image to make it easier to understand; as a result, the article looks much longer. It's not as much text as it seems, trust me! (well, maybe it is a bit too much text).Basic OperationYou can filter by specific tags. For example, let's say you want examples of Frieren: simply put "frieren" in the search bar and it will start showing you only examples that contain her. This works for any booru tag. After that, just look for one you like and you can copy it completely. (Note: You won't get the exact image, but you will get all the characteristics seen in the image; this depends on how well tagged it is). If you prefer, you also have the option to copy only certain categories, like clothing, for example.Teach PanelDividing tags into categories couldn't be done by magic, so I designed this interface for people who want to collaborate by categorizing each tag. There is an LLM system running in the background; meaning, suggestions go through an AI to quickly determine if the suggestion is correct or not. If the AI says it's incorrect or isn't sure, it goes to human review (which is me). Thank you very much to everyone who decides to collaborate!Options and FiltersAPI Providers: Depending on the provider you choose, you will see specific content. For example, the "e621" API is for furry content. Generally, I recommend always using Danbooru, as the tagging is more suitable for Illustrious.Search bar: This is where you put what you want to see in the examples. You can enter characters, actions, clothing, etc. Due to API limitations, you can only type 1 or 2 tags, depending on the active options.Blacklist: Here you will place the tags for which you do not want to see examples.Filter button: This is the typical content shield. Toggle it on or off to see (or hide) that type of content. Tags to add: An option to add whatever tags you want to all prompts. Useful if you use LoRAs with trigger words or want to apply styles (realistic, photorealistic, sketch, etc.).Preset saving: Saves the tags to add. Designed for those who manage multiple tag packs.Tags to remove: Removes tags from the final prompt on all cards. For example, tags like "solo" or "realistic", which are sometimes found in prompts and might not be desired.Minimum Tag Count: This option ensures that only prompts with more than a certain amount of tags appear. The higher the number, the more detailed prompts you get; I recommend leaving it around 20-30.Autocomplete: Not sure how the tag appears on Danbooru? There is a simple autocomplete system to help you type them correctly.Mode Buttons1. FavoritesBy hovering over a card, you can add it to your favorites to always have it handy.2. TrendingThis is a screen to see what is most popular for the day. I made this feature mostly for AI Artists. You can click directly on the cards to send them to the search engine, or right-click to copy the prompt.3. MergeBy entering Merge mode, you can combine categories between cards. It is very useful when you want to generate certain characters in different poses, clothes, and backgrounds. You can select more than one category on each card and combine as many cards as you want!In case there is a tag you aren't interested in, you can simply click it and it will be deleted from the final prompt. As you scroll down, there is a paper-shaped button that lets you enter this mode without having to scroll all the way back up to the control panel.4. FeedbackThis button isn't a mode per se, but it's there so you can send me reports, requests, and more. I highly appreciate any reports or feature requests; this helps the web app grow much more and become more useful to everyone!Quick Navigation ControlsRandom Button: Ensures you don't always see the latest results, fetching random content instead. Very useful if you're tired of always seeing the same things in the same order.Refresh Button: Simply reloads the results in case there are new posts, or to restart the random search.History Button: Opens a side window showing a timeline of all the tags you have copied previously.Prompt Generation OptionsInclude Character: Does exactly that: includes character tags in the prompt.Smart tag combination: If the prompt has, for example, "hair, long hair, white hair", this function combines them into a single tag: "white long hair". It is useful to avoid redundancy and not saturate the tokenizer.Global Tag Weights: An option designed for users who like to use weights on their tags.How do tag weights work?All tags are clickable. Clicking one opens a panel where you can send the tag directly to search, but the interesting part is that you can increase or decrease its weight. If you set it, for example, to 1.5, this weight is applied to the tag and it's ready to copy with the new configuration. If you enable the Global Tag Weights option, you will unlock a new feature:If you click the planet icon, you will save that tag as "global".That weight will automatically be applied to all cards containing said tag.You can modify and change all the weights you want. You can also modify the weights directly from the Panel Manager, right next to the option.Image DownloadAs the name implies, you can quickly download the images by hovering over the image and pressing the download button. Useful if you need the image for ControlNet or IP-Adapter.Support the AppDo you like the app and want to support me? You can do so through Buzz or Ko-fi donations.It also helps a lot if you leave your feedback and suggestions.That’s it!Stay hydrated and don’t forget to blink.

Mexes

About the trigger word in LoRa training

Following this series of articles on LoRa training, today it’s time to touch on the subject of the Trigger Word in style LoRAs.I invite you to read the previous article where I touched on the subject of the Text Encoder, as it might help you better understand today’s concepts. Training the Text Encoder in LoRa: Why it Matters for Style | CivitaiNote: This article aims to be easy to understand. I will not use complex technical terminology (like weights, vectors, or matrices) and will even skip over some deep theoretical concepts to simplify understanding.All this time I’ve been training style LoRAs, most of the time I don’t train them with a trigger word. Mostly for the sake of convenience: by not having to worry about whether you are using the keyword or not, you simply apply the LoRa and forget about the rest.However, for this experiment and using the same dataset from the previous article, I trained another LoRa simply by adding a trigger word to all the images. But first, let’s go through the theory before looking at the results and differences.Although the examples given here focus on LoRa style, many concepts apply equally to LoRa character.What is the Trigger Word?The trigger word is the tag you will use to activate your LoRa... forgive the redundancy. The idea is that this tag is like an empty "container" that we will fill with whatever we want to train. Usually, invented words or tags with unique characters (like letters swapped for numbers) are used to ensure that this tag doesn't previously exist in the model's knowledge, thus giving us a clean canvas.There are two ways to use the trigger word:Removing tags: For example, if the style you want to train is a realistic style, what you should do is delete those tags that represent the realistic style and commonly appear with the auto tagger like “nose, lips, realistic, photorealistic”. This will cause those characteristics seen in the images to associate directly with the trigger word since they aren't tagged.Keeping tags: If you leave “nose” and “lips” written in the dataset, surely your results with only the trigger word won't have those same noses or lips seen in the dataset images, but you will have the colors and strokes of your style. If you want to obtain the same facial structure, it will suffice to write "nose" and "lips" in the prompt to get those characteristics. This approach is useful for flexible LoRAs.Is it necessary?Let’s analyze the two approaches:Method without Trigger: By not using a trigger word when we train a style LoRa, the existing tags are modified. The most common one is 1girl, but generally, the style starts to train onto different common tags within the dataset. What happens with this? Well, in ambiguous prompts with fewer than 5 tags, for example, the style will be poor and look diluted. But when using many tags—specifically those mostly used in the dataset—the style will start to look much more present.Method with Trigger: By using a trigger word, we are giving the training a specific word where it can put everything that isn't tagged in the image. In a style, this would be the lineart, the brushstrokes, the color, etc. But one also has to be much more careful with tagging and prioritize a varied dataset to prevent the trigger word from picking up objects, concepts, poses, etc. However, unlike the method without a trigger, we would only need this trigger word for the style to appear with all (or most) of its characteristics (depending on which approach we decided to use with the trigger word).Note: This graph is a simplification; this would have to happen several times within the dataset.The Problem with the Trigger Word:As I mentioned before, our trigger word is a container waiting to be filled with information. How does the model know what information to put in there?Simple: the model looks at what doesn't change between images and associates it with your tag.If we want to train a chair, that chair must appear in all images with the trigger word.The problem: If the dataset isn't very varied, the model might associate unwanted things. Let’s say that in all the photos of the chair, a table also appears in the background. Since the table doesn't change and always accompanies the trigger, the model will think that “ch4ir” means "A chair AND a table." It will start putting the table inside the concept. (In this tag example, it would suffice to tag "table" in the dataset since it’s something the model already has broad knowledge of, which would prevent associating that table with the chair. The real problem occurs with things like poses, gestures, and other things we usually don't tag or that the model doesn't have much knowledge of).Once this is understood, let’s move on to talk about the examples and visual differences.Analysis of ResultsNote: All examples were made with the same configuration and a static seed.1. Presence of StyleOkay, right off the bat, you can say that with the trigger word, the style looks much more present. However, some flaws also appear, such as the combined color of the kimono and the change in the hand gesture compared to the version without a trigger. This could be an indication that our LoRa was surely already overtraining, so we'll overlook that.In these examples, the strength of the style is much more noticeable using the trigger word, and there is little change in the general composition.2. Background problem with trigger wordHere is an example that lets us see the problem we mentioned earlier in the theory. Leaving aside the fact that the intensity difference between "with" and "without" is very large, let's talk about that background in the trigger word example. That background style appears in the vast majority of images in the dataset, and it seems the trigger word has been learning it; specifically that it is a solid color, with a border and a subtle pattern. This is easy to fix. Since the prompt didn't specify what background was wanted, the trigger filled the void with what it saw most (that repeated background). By simply putting a setting (beach, street, city) or a color (white background, green background), it will surely stop appearing.3. Prompt taken from the DatasetThen there is this example where I took a prompt directly from the original dataset. We can see that here there is much less difference between the "without" and "with" trigger versions. This relates to what we explained in the theory section: the style began to associate with different tags within the dataset. This means that, in the version without a trigger, to get a result just as strong as in the version with a trigger, we must use mostly the same descriptive tags that were used in the dataset.So, which one to choose?It seems that the option with a trigger word is the best option, but it involves a bit more work behind the scenes by needing a more varied and much better-tagged dataset. Choose whichever fits your workflow best. For my part, I think I will start training more LoRAs with trigger words.If you are knowledgeable about this topic and notice I’ve made a mistake at any point, please let me know! The last thing I want to do is misinform people, and if that happens, I will edit this article as soon as possible to correct the errors.LoRa ConfigurationBase Model: Illustrious V1.0 Repeats: 5 Epoch: 10 Steps: 990 Batch Size: 4 Clip Skip: 1 UNet learning rate: 0.0005 LR Scheduler: cosine_with_restarts lr_scheduler_num_cycles: 3 Optimizer: AdamW8bit Network Dim: 32 Network Alpha: 16 Min SNR Gamma: 5 Noise Offset: 0.1 Multires noise discount: 0.3 Multires noise iterations: 8 Zero Terminal SNR: True Shuffle caption: TrueOther articles that may interest you:Web App: Booru Prompt Gallery V5.1 | CivitaiWeb App: Booru Tag Gallery | CivitaiApp: Regional Multi Crop - Dataset Tool | CivitaiTools I use for LoRa training | CivitaiTraining the Text Encoder in LoRa: Why it Matters for Style | CivitaiThat’s it!Stay hydrated and don’t forget to blink.

Mexes

Mexes

Web App: Booru Prompt Gallery - Get Prompts quickly! (V8.0)

About the trigger word in LoRa training

Training the Text Encoder in LoRa: Why it Matters for Style

Tools I use for LoRa training

Web App: Danbooru Prompt Gallery