Gene Athe_0373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0373 
Symbol 
ID7409303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp425493 
End bp426755 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content41% 
IMG OID643714759 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002572282 
Protein GI222528400 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000321341 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA TGAGCTTAGC AAAACAAGGG ATATTTACAA GAGAGATGGA GCTTGCCATA 
AAAAATGAGG AGATAGATAA AGAAGAGTTT TTGCAAAAGG TTGCAGAGGG CAAAATTGTA
ATTCCTGCGA ATAAGAATAG AAAAAGAGAC AAATATTTTG CCATTGGCGA TGGAACATAT
GTCAAGATAA ATGTTAATCT TGGTGTGTCA GAGGCATGTC CAAACTTTGA TTTAGAGCGC
CAAAAGCTTG AACTTGCCAA AAAATTTGAT GTTGAATCTG TGATGGATTT ATCGAGCGGG
CTTGATGCTT CAAACTTTAG AAAATACATT CTTCAAAACT ATGACTTTAT AGTAGGAACA
GTTCCAGTTT ACCAGGTTGC ATCAAGGCAC GACGACATCA CAAAGATTGA CAGTAAAGAG
TTTATAGAAG AGATTGAAAG GCAGGCAGAA GAAGGAGTTG ACTTTTTCAC AATCCATGCA
GGAATTACAA GAAGGACTTT GGAGAGGTTT GAAAAAAATG AGCGTCTGCT CAAGATTGTC
TCAAGAGGCG GAGCACTTCT TTATAAATGG ATGATGGCAA ACCGAAAAGA AAATCCTTTG
TATGAGCATT TTGATGAGAT TTTGAAGATA TGCAAAAAAC ATGATGTTAC AATCAGCCTT
GGCGATAGTC TAAGACCAGG TGCGGTGGCT GATGCAACAG ACGCGCTGCA GATAGAAGAA
CTTATAAACC TTGGCGAGCT TACAAAAATG GCTTGGAAAG AAGATGTGCA GGTGATGATA
GAAGGGCCAG GGCACATGAG AGCAAATGAG ATTGCAGCAA ACATGGTAAT TCAAAAGAGG
CTTTGCCACG GCGCACCGTT TTATGTCTTG GGGCCTCTTA CGACAGACAT TGCAGCTGGC
TATGACCATA TCTCAGGTGC GATGGGGGCG CTCATTGCAG CTTTAAATGG AGCAGATTTT
CTGTGCTATG TGACACCTGC TGAACATTTG AGGCTTCCAT CTTTAGAGGA TGTCAAAGAA
GGAATTGTTG CGTTCAAAAT TGCAGCGCAC AGTGCAAATA TAGCAAAAGG GTTCAAAAAG
CCGCTTGAAA AGGATATTGA GATGTCAGTA GCAAGAAGAG ACCTTGACTG GGAAAAGATG
ATAAGCCTTT CGGTTGACCC TGAGAAGGCA AGAGAGTACA GAAGCAGTTT CACATCTGAT
ACATGTTCAA TGTGTGGAAG ACTCTGCGCT GTAAAAAATT CAAGGGATGA AGCTGTTTTA
TAA
 
Protein sequence
MTQMSLAKQG IFTREMELAI KNEEIDKEEF LQKVAEGKIV IPANKNRKRD KYFAIGDGTY 
VKINVNLGVS EACPNFDLER QKLELAKKFD VESVMDLSSG LDASNFRKYI LQNYDFIVGT
VPVYQVASRH DDITKIDSKE FIEEIERQAE EGVDFFTIHA GITRRTLERF EKNERLLKIV
SRGGALLYKW MMANRKENPL YEHFDEILKI CKKHDVTISL GDSLRPGAVA DATDALQIEE
LINLGELTKM AWKEDVQVMI EGPGHMRANE IAANMVIQKR LCHGAPFYVL GPLTTDIAAG
YDHISGAMGA LIAALNGADF LCYVTPAEHL RLPSLEDVKE GIVAFKIAAH SANIAKGFKK
PLEKDIEMSV ARRDLDWEKM ISLSVDPEKA REYRSSFTSD TCSMCGRLCA VKNSRDEAVL