Gene Athe_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1946 
Symbol 
ID7407360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2058469 
End bp2059626 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content37% 
IMG OID643716318 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_002573806 
Protein GI222529924 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA CACTCATTGT ATTTGGAACA AGACCTGAGG CAATAAAGAT GGCACCACTT 
GTAAAAGAAT TACAGGAAGA TTCTAATTTT GATGTTAAAG TATGTGTCAC AGCACAGCAC
AGACAGATGC TTGACCAGGT ACTTGAGATT TTTGATATAA AGCCTGGTTA TGACCTTGAC
ATAATGAAAT ACAACCAAAG CTTATTTTCT ATAACTTCTG ATGTGCTTTT GAGATTTGAA
AAGGTTTTGG AGAAAGAAAG ACCAGACATT GTGCTTGTCC ATGGCGACAC AACAACCACA
TTTGCATCAG CACTTGCAAG TTTTTATTTT AAAACAAAGG TAGGACATGT TGAGGCGGGT
TTGAGAACAT ATAATAAATA TTCACCTTTT CCAGAAGAGA TGAACAGAAA GCTTACGGCA
GCGCTGAGCG ACCTTCATTT TGCCCCCACA AAAAAGGCGA AGTTAAATTT GATGGCAGAA
GGAGTAAAAG AAGAAAGCAT CTTTGTGACT GGTAACACTG TGATTGATAC ACTTAAATTT
ACTGTGAAAG AAGACTATGT GTTCAAGGAA GATAGTCTAA ACAATATAAA TTTTTCAAAG
AGAGTAATTC TTCTCACTGC CCACAGGAGA GAAAATTTTG GAAAACCGCT TGAGAATATT
TTTGAGGCTG TCCTGAAGAT TGCAAATGAG TTTGACGATG TAGTTTTTGT ATATCCTGTA
CATCTAAATC CGAATGTCAA GAATGTTGCG TACAGGATTT TAAAAGACCA TCCAAGGATA
AAACTGATAA ATCCAATTGA CGTTGATGAT ATGCACAACC TCATTGCAAG AAGCTATTTG
GTTTTAACAG ACTCTGGTGG GCTTCAGGAA GAAGCACCGT CTTTGGGGAA ACCTGTTGTT
GTTCTGCGCG ACACAACAGA AAGACCAGAA GCTGTTTTAG CAAAGACAGT TGTAGTTGCA
GGGACACAGA AAGAAAGGAT AATTCAGATT GTTACAAAAC TCTTAACAGA TGAAGAAGAG
TATCTTAAGA TGGCAAAAGC TATAAACCCT TATGGTGATG GAAATGCCTC AAAAAGAATA
AAAGAAGCAC TTTTATTTTA TTTTGGGAAA AGCAGTAAAA AACCTGAGGA ATTTTGTGGA
AGCGATATGT TTGTGTAA
 
Protein sequence
MIKTLIVFGT RPEAIKMAPL VKELQEDSNF DVKVCVTAQH RQMLDQVLEI FDIKPGYDLD 
IMKYNQSLFS ITSDVLLRFE KVLEKERPDI VLVHGDTTTT FASALASFYF KTKVGHVEAG
LRTYNKYSPF PEEMNRKLTA ALSDLHFAPT KKAKLNLMAE GVKEESIFVT GNTVIDTLKF
TVKEDYVFKE DSLNNINFSK RVILLTAHRR ENFGKPLENI FEAVLKIANE FDDVVFVYPV
HLNPNVKNVA YRILKDHPRI KLINPIDVDD MHNLIARSYL VLTDSGGLQE EAPSLGKPVV
VLRDTTERPE AVLAKTVVVA GTQKERIIQI VTKLLTDEEE YLKMAKAINP YGDGNASKRI
KEALLFYFGK SSKKPEEFCG SDMFV