Gene Athe_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0207 
Symbol 
ID7407198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp255267 
End bp256511 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content43% 
IMG OID643714608 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_002572131 
Protein GI222528249 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAAAT TTGTAATAGA AGGTGCAAAT AAGATTTCTG GCTCAATAAA GGTTCACAGC 
TCGAAAAATG CGATTTTACC ACTTTTGACA GCATCTTTGC TGTCGAATGA TATGGTTGAA
ATTCTGGATG TGCCGCTTTT GGAAGACATC AAGGTGCTTT TGCAGCTTCT TTCACATTGT
GGGTGCGATG TAAAAATTGA GGACGGCAAA CTGCAAATCA CTCCTGATAT CAAAAACGGT
GATATAAACG TAGATGGGGT AAACAAACTC AGAGCTTCAA TACTTCTTTT AGGAAGCCTT
CTTGCGCGTG GGAAAAAGGT TGTGCTGGGT ATGCCGGGCG GGTGTAACAT TGGCACACGA
CCCATTGACC TTCACATAAA AGGGCTTTCC CAGCTTGGTG CAGAGATTAA ACTCAATCAA
GGGTACATTG AAGCAAAGGT TAAAAAGCTA AAAGGTGCTA AGGTATATTT AGACTTTCCG
TCCGTGGGTG CAACAGAAAA TATCATGATA GCAGCTGTTT TAGCAGAGGG GGAGACCATA
ATAGAAAACG CTGCGACAGA GCCAGAGGTT GTGTGTCTTG CAAATTTTTT AAACTCAATG
GGCGCAAAAG TGATGGGTGC CGGCACTGAT ACCATCAGGG TTTTGGGTGT GAAAAAACTT
TTTGGCTGCT CATTTGTTCC AATTCCAGAC AGGATTGAAG CAGGGACGTA CATGGCAATG
GCTGCAATGT GTGGTGGGGA GCTTGAACTT ACAAATGTGA TTTGTGAGCA TATAAGGTCT
ATTGTTGCCA AGTTCAAAGA AAGTGGTGTG AAGGTCTATG AAGGGGAAAA TGCTGTGCGC
GTTGAAGCAC CCGAAAGGAT TTTGGCAACA GACATAAAAA CCCTGCCTTA CCCTGGTTTT
CCGACTGATA TGCAGGCACC CATGATGAGT ATGCTAAGTA TAGCAAAAGG CACAAGCGTT
ATCATTGAGA CAATATTTGA AAACAGATTT TTGCACGTTG GAGAGCTTGT CAAAATGGGT
GCGAATATAA AAGTGGAAGG AAGAGTAGCA GTGGTTGAAG GTGTTAAAAA GCTCAAAGGT
GCAAAGGTTG AAGCAAAAGA CTTGCGCGGT GGTGCTGCAC TTGTTTTGGC AGCGCTCTGT
GCAGAAGGTG TAAGCGAGAT TGAAGGTGCA TCTCACGTTG ACAGGGGGTA TTTTGAATTT
GAGAAAAACA TTACAAGTCT TGGGGGTAAG ATAAAGAGAG TTTAA
 
Protein sequence
MSKFVIEGAN KISGSIKVHS SKNAILPLLT ASLLSNDMVE ILDVPLLEDI KVLLQLLSHC 
GCDVKIEDGK LQITPDIKNG DINVDGVNKL RASILLLGSL LARGKKVVLG MPGGCNIGTR
PIDLHIKGLS QLGAEIKLNQ GYIEAKVKKL KGAKVYLDFP SVGATENIMI AAVLAEGETI
IENAATEPEV VCLANFLNSM GAKVMGAGTD TIRVLGVKKL FGCSFVPIPD RIEAGTYMAM
AAMCGGELEL TNVICEHIRS IVAKFKESGV KVYEGENAVR VEAPERILAT DIKTLPYPGF
PTDMQAPMMS MLSIAKGTSV IIETIFENRF LHVGELVKMG ANIKVEGRVA VVEGVKKLKG
AKVEAKDLRG GAALVLAALC AEGVSEIEGA SHVDRGYFEF EKNITSLGGK IKRV