Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0517 |
Symbol | |
ID | 7408641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 584660 |
End bp | 585898 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714899 |
Product | protein of unknown function DUF195 |
Protein accession | YP_002572416 |
Protein GI | 222528534 |
COG category | [S] Function unknown |
COG ID | [COG1322] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0973861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAAGTAG TTTTGCTAAT TGTTGCTATT GTTCTTGTCA TATCAAACTT GATTTTGTTA ATAAGGCTTA AAAATAATAT AAATTCTTCT TTGGATACTC AAAATAAACT GTTAGAGATT GAAAAAGAAC TTGAACAAAT TCAAAATTCT ATCTCACAGC AATTTTCTCA GAATAAAAAT GAAATGCAAA ATATAATAAG CTCATTTGGC AGCATTTTAA TGACAAGATT TTCAGATCTA TCCAATCAGA TAATAAATTT TACATCATCA AGTCAGGAAA GGCTTGACAG TATCCGAAAA GAGATAGATA GTAAGCTTGA GAAAATACGA GAGACTGTTG ACAGCCAGCT ACAAAGCACA TTAGAGACAA AACTTTCGCA GTCTTTCAAG CTTGTATCAG AGCGTCTGGA GCTTGTCCAC AGAGGGCTTG GTGAGATGCA GGCCCTGGCC GGAAGTGTTG GAGACCTTAA AAAGATTTTG AGCAATGTAA AGGTTCGTGG AACACTTGGT GAGATTCAGC TTGGCAATAT CATAGACCAG ATTTTGGATC AATCACAGTA CGAAAGAAAT GTCAGGATAA AACCGCACAC TCAAGAGCAA GTTGAGTTTG CAATAAAGAT TCCTTCTAAA AATTCAAAAG ATAATGAATT TATATACCTT CCAATAGACT CCAAATTCCC CATAGAAAGT TATCAGCGGC TTATTGAGGC GCAGGAGAAA GCGGAGACAG AAGAAGTTGC AAGATTTTCG AAGGAGCTTG AAAATAGTAT AAGACAGAAT GCAAAGACTA TAAAGGAAAA GTACATAGAC CCGCCTAAAA CAACAGATTT TGCTATCATG TTTTTGCCCT CTGAAGGGCT TTATGCAGAG GTGCTGAAGA TACCCGGGCT GTTTGAGTCT GTGCAAAGGG AATACAAGGT AATTATTGCA GGACCTACAA CAGTTGTTGC AATGCTCAAC ACCATTTCGC TTGGATTTAA AACTTTTGCT ATTGAAAAGA GAACAAATGA GATCTGGGAG CTTTTGTCTG CCGTCAAGAC TGAGTTTTCA AGGTTTGCTG AGATTCTTGA AAAGGTTAAA AAGAAGCTTT CTGAAGCGCA GGATACAATT GACACTGCAA CAAGAAAGAC AAGAACTATA GAAAGAAAGC TTAAAAGTGT TGAGACCCTC TCTTCAGAAA AAGATATAGA TATGATTCTT TATGATGAGG AAGCTATTGA AGAAGGTTCA GGGAAATAA
|
Protein sequence | MEVVLLIVAI VLVISNLILL IRLKNNINSS LDTQNKLLEI EKELEQIQNS ISQQFSQNKN EMQNIISSFG SILMTRFSDL SNQIINFTSS SQERLDSIRK EIDSKLEKIR ETVDSQLQST LETKLSQSFK LVSERLELVH RGLGEMQALA GSVGDLKKIL SNVKVRGTLG EIQLGNIIDQ ILDQSQYERN VRIKPHTQEQ VEFAIKIPSK NSKDNEFIYL PIDSKFPIES YQRLIEAQEK AETEEVARFS KELENSIRQN AKTIKEKYID PPKTTDFAIM FLPSEGLYAE VLKIPGLFES VQREYKVIIA GPTTVVAMLN TISLGFKTFA IEKRTNEIWE LLSAVKTEFS RFAEILEKVK KKLSEAQDTI DTATRKTRTI ERKLKSVETL SSEKDIDMIL YDEEAIEEGS GK
|
| |