Content:Files: Difference between revisions
|  (→Group:  removing the comment about group tags having to be unique in the DB, since they don't have to be anymore) |  (→Group:  adding some language around assigning files to groups, including the "no lookthrough" rule, and the application of a lower bar of evidence for older and more obscure fansubs) | ||
| Line 32: | Line 32: | ||
| {{eyecatch|Note|If a file is a joint release, like a file by e.g. ANBU-AonE, that joint group has its own group entry. The file should not be registered under either ANBU or AonE, but to the group that combines them both.}} | {{eyecatch|Note|If a file is a joint release, like a file by e.g. ANBU-AonE, that joint group has its own group entry. The file should not be registered under either ANBU or AonE, but to the group that combines them both.}} | ||
| {{eyecatch|Note|Not all RAW groups are credited. See [[which RAW groups to credit]].}} | {{eyecatch|Note|Not all RAW groups are credited. See [[which RAW groups to credit]].}} | ||
| =====Assigning files to a group when files may have been modified===== | |||
| In general, AniDB applies a "last modified" doctrine when it comes to assigning groups; this is done via the "no lookthrough" rule. When assigning groups, assign the file to the last group that modified it; we do not trace it backwards to the "original" group.<br> | |||
| '''--Example:''' | |||
| <pre style="white-space: pre-wrap;">* [AonE] issues a release in 1080p h.264 hi10p.  [Deadfish] then takes [AonE]'s release, and reencodes it from h.264 hi10p to 720p h.264 8p. [Deadfish] is the group that last modified, so the file is assigned to [Deadfish]. We do not "lookthrough" and assign the file to the original [AonE].</pre> | |||
| When determining whether a given file is a remux or not, it is not always clear, especially if the groups are obscure, the groups release anonymously, and/or the files in question are very old.  Here are some common pitfalls/misconceptions: | |||
| * The watermark in the video cannot be relied upon when assigning groups; a reencode would keep the watermark of the original group, rather than reflect the reencoding group. The watermark can generally confirm the original source, for the purposes of attribution in the [[Groupentries:Status#Animegroup_Release_Note_Comments|Animegroup release Notes]]; we do not apply too much weight on this for determining who is the "last modified" group. | |||
| * The names of tracks in the mkv metadata (e.g. name of the subtitle track) cannot be relied upon when assigning groups; mkvmerge by default keeps the existing track name, so if a remuxing group does not change it, the track name would be a false positive for the original group. The track name can generally ''help'' confirm the original source, for the purposes of attribution in the [[Groupentries:Status#Animegroup_Release_Note_Comments|Animegroup release Notes]]; we do not apply too much weight on this for determining who is the "last modified" group. | |||
| =====Level of evidence in determining determining group assignments===== | |||
| AniDB generally applies a high bar of evidence when making changes to the database; the evidence needs to show "beyond reasonable doubt", such as an 90%+ probability, that the change is valid -- this is comparable to the level of evidence required to convict a crime in a common law system. | |||
| However, AniDB also considers the practicality of finding this level of evidence; we recognize that fansubs are generally not well documented, and that this is especially true as the fansubs in question become more obscure or go further back in time.  To remedy this issue, as a general rule, we apply a lower bar of evidence for older fansubs; "the preponderance of the evidence", e.g. "more likely than not" (i.e. 51%+ probability) is sufficient to prove that the change is valid -- this is comparable to the level of evidence required to prove a civil claim in a common law system. | |||
| In summary: | |||
| * Older fansub = lower bar | |||
| * More obscure fansub = lower bar | |||
| ====Release date==== | ====Release date==== | ||
Revision as of 21:39, 15 February 2020
AniDB stores information (metadata) about digital files that at some point in history has been released on the Internet. This information is an integral part of AniDB and for many users the most important reason for using AniDB. Note that AniDB does not, never has and never will, have any copies of these files. Read the general disclaimer for more information on this.
As a user you do of course not have to use any feature related to files; AniDB offers many features that are completely unrelated to them. There is also the concept of generic files: If you want to add an anime to your MyList, but watched it on TV, have it on DVD or just don't care about registering actual files, see Files:Generic files.
Metadata
Technical information
Information that is of technical character and in most cases indisputable after verified by Avdump2. For track level information, see Content:Files:Video, Content:Files:Audio and Content:Files:Subtitle.
Type
The type of file. Possible values: video file, subtitle file, audio file, archive file, linker file, other.
Extension
Short name of the container format, see Wikipedia. All common and many uncommon video, audio, subtitle and archive file formats are supported; contact the staff if you have a file in an unusual file format we don't yet support.
Size
The actual size of the file in bytes.
Check/Hash sums
The hash sums are used for verification and identification purposes and includes CRC32, ED2K, MD5, SHA1 and TTH. Size+ED2K is the unique identifier used in AniDB.
Duration
The playback duration in hours:minutes:seconds.
Release information
Information related to the actual release of the file: group, release date, CRC status and version. If the origin of a file is unknown, i.e. no group, then most of these fields have no meaning and should therefore not be set. Release date can be set for no group files, but only when it can be reliably determined.
Group
AniDB stores the name and group tag of the group/person that produced and released a file. The short name is usually found as tags in file names, for example "[AonE]".
| Note | If a file is a joint release, like a file by e.g. ANBU-AonE, that joint group has its own group entry. The file should not be registered under either ANBU or AonE, but to the group that combines them both. | 
| Note | Not all RAW groups are credited. See which RAW groups to credit. | 
Assigning files to a group when files may have been modified
In general, AniDB applies a "last modified" doctrine when it comes to assigning groups; this is done via the "no lookthrough" rule. When assigning groups, assign the file to the last group that modified it; we do not trace it backwards to the "original" group.
--Example:
* [AonE] issues a release in 1080p h.264 hi10p. [Deadfish] then takes [AonE]'s release, and reencodes it from h.264 hi10p to 720p h.264 8p. [Deadfish] is the group that last modified, so the file is assigned to [Deadfish]. We do not "lookthrough" and assign the file to the original [AonE].
When determining whether a given file is a remux or not, it is not always clear, especially if the groups are obscure, the groups release anonymously, and/or the files in question are very old. Here are some common pitfalls/misconceptions:
- The watermark in the video cannot be relied upon when assigning groups; a reencode would keep the watermark of the original group, rather than reflect the reencoding group. The watermark can generally confirm the original source, for the purposes of attribution in the Animegroup release Notes; we do not apply too much weight on this for determining who is the "last modified" group.
- The names of tracks in the mkv metadata (e.g. name of the subtitle track) cannot be relied upon when assigning groups; mkvmerge by default keeps the existing track name, so if a remuxing group does not change it, the track name would be a false positive for the original group. The track name can generally help confirm the original source, for the purposes of attribution in the Animegroup release Notes; we do not apply too much weight on this for determining who is the "last modified" group.
Level of evidence in determining determining group assignments
AniDB generally applies a high bar of evidence when making changes to the database; the evidence needs to show "beyond reasonable doubt", such as an 90%+ probability, that the change is valid -- this is comparable to the level of evidence required to convict a crime in a common law system.
However, AniDB also considers the practicality of finding this level of evidence; we recognize that fansubs are generally not well documented, and that this is especially true as the fansubs in question become more obscure or go further back in time. To remedy this issue, as a general rule, we apply a lower bar of evidence for older fansubs; "the preponderance of the evidence", e.g. "more likely than not" (i.e. 51%+ probability) is sufficient to prove that the change is valid -- this is comparable to the level of evidence required to prove a civil claim in a common law system.
In summary:
- Older fansub = lower bar
- More obscure fansub = lower bar
Release date
The date when the group released the file. Usually provided on the group's homepage or on the relevant torrent tracker page. Release dates should be added according to the time of release in the UTC time zone, so sources whose timezone is known are preferred over ones displaying dates in unknown timezones.
CRC/hash status
Indicates whether the file entry in AniDB represents a perfect copy of the original file or not. You may request a change of this status if it hasn't been checked yet. Read more about file verification if you are unfamiliar with this subject.
| Note | Not all groups provide checksums for their files. This means that this status has no meaning and will have the value "not checked against official CRC/Hash". | 
Allowed sources for official hash
- Official .sfv (CRC checksum) or .md5 (MD5 checksum) files (good source)
- CRC listing on official page or in the topic of the group's IRC channel (good source)
- CRC tag in the filename:
- If the file is from the fansub group's official torrent or DDL, then the file name is acceptable. (good source)
- CRC tags added afterwards by third parties should not be considered a valid source.
- It should be noted that you should also check the actual calculated CRC of your file and compare it to the CRC in the filename to ensure that you don't yourself have a corrupted file.
 
No-lookthrough rule for CRC/hash verification
CRCs and other hashsums are only considered valid for verification if they are calculated directly from the target file in its entirety. This means that indirect checksums like the following ones are not to be considered:
- Checksums of a ZIP/RAR/etc. archive (or set of archives) containing the file in question, even if the archive only contains one file
- The inherent block-level checksums used by the BitTorrent protocol
Version
Sometimes the group makes mistakes and wants to correct them by releasing a second (third, fourth...) version, usually called v2 (v3, v4, ...). This field is usually based on information in the file name (where no information normally means version 1).
IMPORTANT: Even if a group releases two files of the same episode it does not mean that the first has to be marked v1 and the second has to be marked v2. There are other reasons to do something like that other than correcting mistakes, such as: different source (DTV/DVD), different sub languages, dual audio vs single audio releases, high and low quality versions, etc.
Discretion should be applied when "other reasons" are the cause of multiple releases. For example, if the group's intent is to release as a final product only a dual audio release, but they release a single audio release as a "v0" or "draft" version, then we should consider this single audio "draft" within the version continuity of any subsequent dual audio releases. However, if the group's intent is to release as a final product both a single and dual audio release, do not combine the versioning continuity.
| Note | We generally do not consider resolution changes and source changes as version bumps. That being said, you are free to add a file comment, indicating a file is marked as v2 and intended to replace a certain file. For example: Labelled as v2 of https://anidb.net/f2030447 | 
Other information
Other information that is set manually and in some cases disputable.
Censored
Only used and available for files with pornographic content. In rare borderline anime entries not classified as 18 Restricted, both uncensored and censored versions may still be available; in this case, include "Censored" or "Uncensored" in the file description field instead.
Source
The original source of the content. Possible values are; unknown, camcorder, TV (ie, analog TV), DTV, HDTV, VHS, VCD, SVCD, LD, DVD, HKDVD, HD-DVD, Blu-ray, www.
Look-through considerations for Source field
When considering source, we look at from which release by the original publisher/producer this file originated. For example, if a fansub released a file that was a rip from a web streaming website, but the web streaming website itself ripped its video from a BDMV, then the fansub-released file has its source marked as BD; the original publisher/producer source is the BD release, and not an HDTV release (i.e. video capture of a Japanese TV channel), and not a web simulcast (i.e. video rip from a streaming site like Crunchyroll, Funimation, or Daisuki).
Source field for standalone subtitle files
- When standalone subtitle files are released together with video files and the subtitles come from the same source as the video, mark the subtitles as the same source as the video files.
- When standalone subtitle files are released together with video files and the subtitles come from a different source than the video, mark the subtitles as the source where the subtitles came from, and apply look-through rules.
- For example, [Dmon] releases BD source files, and attached to this release are standalone subtitle files ripped from [HorribleSubs] (and possibly edited by the fansub). Since the standalone subtitles are not from the BD, their actual source is documented. Applying the look-through rules; [HorribleSubs] rips from Crunchyroll, so the standalone subtitle files are set as www.
- If the source of the subtitle is an original fan translation, mark the subtitle file source the same source as the original fan translation's release.
 
- For standalone subtitle files released on their own without video files:
- If the subtitles are 100% fan translations, mark the source as unknown.
- If the subtitles were ripped from a web streaming service such as Crunchyroll, either a direct rip or an indirect rip by extracting from another fansub's release (i.e. taking the subtitle track from [HorribleSubs]' rip of Crunchyroll), mark the source as www.
- If the subtitles were ripped from a home media release (LD, VHS, VCD, DVD, BD, etc), either directly from the home media or indirectly by extracting from another fansub's release (i.e. taking the subtitle track from [Exiled-Destiny]'s rip of a BD), mark the source as the corresponding media type.
 
Quality field for standalone subtitle files
The quality field is intended to indicate encode quality for video and audio; as such it is much less meaningful for standalone subtitle files.  In consideration of this fact, the following general guidelines apply for the quality field as it relates to standalone subtitle files.
- When standalone subtitle files are released together with video files and the subtitles come from the same source as the video, mark the subtitles at the same quality level as the video files. If there are multiple concurrent video releases (i.e. 720p and 1080p) where each release has a different quality level, but the subtitles are meant to be used for both sets of videos, set the quality level of the subtitles to match the higher quality release's level.
- When standalone subtitle files are released together with video files and the subtitles come from a different source than the video, mark the subtitles at the same quality level as the originating source.
- For standalone subtitle files released on their own without video files:
- If the subtitles are 100% fan translations, mark the quality as unknown.
- If the subtitles were ripped from a web streaming service such as Crunchyroll, either a direct rip or an indirect rip by extracting from another fansub's release (i.e. taking the subtitle track from [HorribleSubs]' rip of Crunchyroll), mark the quality as unknown.
- If the subtitles were ripped from a home media release (LD, VHS, VCD, DVD, BD, etc), either directly from the home media or indirectly by extracting from another fansub's release (i.e. taking the subtitle track from [Exiled-Destiny]'s rip of a BD), mark the quality as unknown.
 
Quality for non-subtitle files
This is a very arbitrary field. It depends completely on the eye of the beholder. You should not put too much meaning into it, but rather use it as a general pointer of quality. Possible values are; eyecancer, very low, low, med, high, very high.
Also see: Votes:Animegroups
As a general guideline, refer to the below for a soft-rule convention that is generally followed.  The below table can be seen as a "default" or "starting point".  If a particular file has sub-par quality, mark down its qualid field relative to the default, as appropriate.  Also note, the historical context of the anime should be kept in mind.  For example. if an anime is released in the 70s, and the highest quality medium of release is LD, it may be appropriate to use the higher range of tehbump the quality field  up a level for the LD release, relative to the defaults below; the goal is to use the qualid field as a differentiator, to identify the LD release as the definitive release relative to other options that may be available (TV, VHS, etc).
Note: All items highlighted in red below are upscales by definition, as the specifications of the mentioned media do not reach into those resolution ranges.  Items in bold signify the more common qualid (if applicable), where more than 1 is specified.
Ultimately, use some discretion and common sense.
| Source | < 480p (or equivalent) | 480p or equivalents (440x486 (NTSC), 520x576 (PAL, SECAM), etc) | 720p | 1080p | 
|---|---|---|---|---|
| camcorder | very low, low | low | low, medium | medium, high, very high | 
| TV (ie, analog TV) | very low, low | low, medium | low, medium | medium | 
| VHS | very low, low | low, medium | low, medium | medium | 
| LD | low, medium | medium, high | high | high | 
| VCD | low, medium | medium, high | medium, high | medium, high | 
| SVCD | low, medium | medium, high | medium, high | medium, high | 
| HKDVD | low, medium | medium, high | medium, high | medium, high | 
| DTV | low, medium | medium, high | medium, high | medium, high | 
| DVD | low, medium | medium, high, very high | high, very high | high, very high | 
| www | low, medium | medium | high | very high | 
| HDTV | low, medium | medium | high | very high | 
| HD-DVD | medium | high | very high | very high | 
| Blu-ray | medium | high | very high | very high | 
| unknown | Use discretion | Use discretion | Use discretion | Use discretion | 
Description
Additional useful information about the file. See Content:Filecomments.
Statistics
This section provides some basic insight on how many users claim to have had this file and what kind of media they currently have it on. The following statistics are given:
- internal: The number of users who currently have the file on their hard disk.
- external: The number of users who have stored this file on an external storage, such as a NAS or a burned DVD.
- deleted: The number of users who formerly had this file but don't have it anymore.
- unknown: The number of users who haven't specified the location of this file on their mylist.
- total: The total number of users with this file on their mylist. Clicking on this link opens a detailed list of those users.
Do note that not all users share the storage status of their files publicly, or in some cases don't even allow to see you on the user list, and in this case the first four numbers may not sum up to the total number of users.
Registering new files
What is accepted?
Private / Personal files
The general rule is that file entries in AniDB are supposed to be useful for a larger group of users. Adding private files is therefore not allowed.
What is a private / personal file?
A private or personal file is a file that you create for yourself. It is a file that is not released publicly on the internet. For example, if you reencode a file for your own use, or you remux a bunch of files together to create a new file, these are private files as they are only available for your personal consumption.
As a general statement, if the files were not published on the internet, they likely should not be added to AniDB. "Published on the internet" includes, but is not limited towards:
- public trackers like Nyaa
- private trackers like BakaBT
- DDL links published on the fansub group's blog
- XDCC bots on IRC
Personal files are NOT to be added to AniDB.
Corruption
An entry for a corrupted file may still be useful for others as long as it's likely that more people have the exact same corrupted file as you have. (For example if one source sent the corrupted version to many people.)
If - and only if - this is the case, you can add the file normally to AniDB, selecting CRC does not match official source and/or Quality: Corrupted if there are visible/audible errors in the file.
| Warning | If the corruption is based on some hard- or software-problem on your side, or if you manually edited the file, you should not add them to AniDB as they serve no purpose for other users. | 
Don't worry - you can still list such files properly in your MyList. For example: You have a corrupted version of group X's release. Navigate to the corresponding episode where the regular, uncorrupted version of group X's release is listed and click on the "add to mylist" icon on the far right. On the next page, you can select the "Type: corrupted version/invalid CRC" so that your MyList entries properly indicate which version you have.
If there's no CRC-valid version of group X added for the release you have, consider adding the episodes' generic files to your list instead.
Remuxed files
- They have to be wide-spread (see above); private files are not allowed.
- Do not mark them with the original group. Use no group or make your own group, adding the original group's name the subtitles/video are taken from to the file's description.
- Adding such files is generally frown upon unless some value was actually added in the remuxing process.
How to add
| Important | Please consult Tutorial:How_to_Add_Files_for_Dummies! if this is your first time adding files. | 
Since files in AniDB are tied with an actual episode, the place where you add files is at the far right of each episode entry. Click the "add file" icon that looks like this:  , and it will take you to a new page where you can fill in the information about the file you want to add. For video files, the most important fields are: ED2K-link, CRC/CRC Status, Release group and Audio and Subtitle languages.
, and it will take you to a new page where you can fill in the information about the file you want to add. For video files, the most important fields are: ED2K-link, CRC/CRC Status, Release group and Audio and Subtitle languages.
Alternatively you can use the mass add file page which is linked at the bottom of every anime page (massf).
Furthermore, it's strongly suggested (we basically demand it) to use Avdump2 to parse the file before or after adding it to AniDB to get accurate metadata and to mark the file as verified (which causes some data of the file to stay locked and non-creq'able).
- The description is missing or severely incomplete.If you can, please help by explaining it.
Editing file entries
For editing see How to get started with creqing.
Removing file entries
If you added a bad entry you can also remove it as long as no one else has added it to his/her MyList. Otherwise, you have to issue a delete request.