Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0373 |
Symbol | |
ID | 7409303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 425493 |
End bp | 426755 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643714759 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002572282 |
Protein GI | 222528400 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000321341 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAA TGAGCTTAGC AAAACAAGGG ATATTTACAA GAGAGATGGA GCTTGCCATA AAAAATGAGG AGATAGATAA AGAAGAGTTT TTGCAAAAGG TTGCAGAGGG CAAAATTGTA ATTCCTGCGA ATAAGAATAG AAAAAGAGAC AAATATTTTG CCATTGGCGA TGGAACATAT GTCAAGATAA ATGTTAATCT TGGTGTGTCA GAGGCATGTC CAAACTTTGA TTTAGAGCGC CAAAAGCTTG AACTTGCCAA AAAATTTGAT GTTGAATCTG TGATGGATTT ATCGAGCGGG CTTGATGCTT CAAACTTTAG AAAATACATT CTTCAAAACT ATGACTTTAT AGTAGGAACA GTTCCAGTTT ACCAGGTTGC ATCAAGGCAC GACGACATCA CAAAGATTGA CAGTAAAGAG TTTATAGAAG AGATTGAAAG GCAGGCAGAA GAAGGAGTTG ACTTTTTCAC AATCCATGCA GGAATTACAA GAAGGACTTT GGAGAGGTTT GAAAAAAATG AGCGTCTGCT CAAGATTGTC TCAAGAGGCG GAGCACTTCT TTATAAATGG ATGATGGCAA ACCGAAAAGA AAATCCTTTG TATGAGCATT TTGATGAGAT TTTGAAGATA TGCAAAAAAC ATGATGTTAC AATCAGCCTT GGCGATAGTC TAAGACCAGG TGCGGTGGCT GATGCAACAG ACGCGCTGCA GATAGAAGAA CTTATAAACC TTGGCGAGCT TACAAAAATG GCTTGGAAAG AAGATGTGCA GGTGATGATA GAAGGGCCAG GGCACATGAG AGCAAATGAG ATTGCAGCAA ACATGGTAAT TCAAAAGAGG CTTTGCCACG GCGCACCGTT TTATGTCTTG GGGCCTCTTA CGACAGACAT TGCAGCTGGC TATGACCATA TCTCAGGTGC GATGGGGGCG CTCATTGCAG CTTTAAATGG AGCAGATTTT CTGTGCTATG TGACACCTGC TGAACATTTG AGGCTTCCAT CTTTAGAGGA TGTCAAAGAA GGAATTGTTG CGTTCAAAAT TGCAGCGCAC AGTGCAAATA TAGCAAAAGG GTTCAAAAAG CCGCTTGAAA AGGATATTGA GATGTCAGTA GCAAGAAGAG ACCTTGACTG GGAAAAGATG ATAAGCCTTT CGGTTGACCC TGAGAAGGCA AGAGAGTACA GAAGCAGTTT CACATCTGAT ACATGTTCAA TGTGTGGAAG ACTCTGCGCT GTAAAAAATT CAAGGGATGA AGCTGTTTTA TAA
|
Protein sequence | MTQMSLAKQG IFTREMELAI KNEEIDKEEF LQKVAEGKIV IPANKNRKRD KYFAIGDGTY VKINVNLGVS EACPNFDLER QKLELAKKFD VESVMDLSSG LDASNFRKYI LQNYDFIVGT VPVYQVASRH DDITKIDSKE FIEEIERQAE EGVDFFTIHA GITRRTLERF EKNERLLKIV SRGGALLYKW MMANRKENPL YEHFDEILKI CKKHDVTISL GDSLRPGAVA DATDALQIEE LINLGELTKM AWKEDVQVMI EGPGHMRANE IAANMVIQKR LCHGAPFYVL GPLTTDIAAG YDHISGAMGA LIAALNGADF LCYVTPAEHL RLPSLEDVKE GIVAFKIAAH SANIAKGFKK PLEKDIEMSV ARRDLDWEKM ISLSVDPEKA REYRSSFTSD TCSMCGRLCA VKNSRDEAVL
|
| |