Gene GWCH70_3331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3331 
Symbol 
ID7979223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3355404 
End bp3356690 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content48% 
IMG OID644800098 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_002951237 
Protein GI239828613 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.683565 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA TCAAAATTAT CGGCGGTGAT CCGCTGCAGG GAACGATCAA GGTAAGCGGC 
GCAAAAAATA GCGCCGTTGC CCTCATCCCT GCTACGATTC TCGCTGATTC ACCGGTTACA
ATCGAAGGAT TGCCGGACAT TTCTGATGTG CGAATTTTAG GCGACTTAAT TAAAGAGATT
GGCGGAACGT TCCATTTCGA TGGCAAAAAA GCGGTCATCG ATCCGACCAA TATGGTACCG
ATGCCGCTGC CGAATGGAAA AGTAAAAAAA TTGCGTGCTT CGTATTATTT AATGGGAGCA
ATGCTTGGTC GTTTTAAAAA AGCGGTTGTC GGGCTGCCAG GAGGCTGCCA TCTAGGTCCG
CGCCCGATTG ACCAGCATAT TAAAGGCTTT GAGGCGCTAG GAGCGAAAGT AACGAACGAG
CAAGGTGCGA TTTATTTGCG CGCGGAGGAA TTGCGAGGTG CCCGTATTTT TTTAGATGTG
GTAAGCGTAG GGGCAACGAT TAACATCATG TTGGCCGCGG TGCGCGCCAA AGGCCGGACG
ATTATTGAAA ACGCTGCAAA AGAGCCGGAA ATTATTGATG TGGCGACATT GCTTTCCAAC
ATGGGAGCAA AAATTAAAGG CGCCGGAACC GATGTCATTC GCATCGACGG TGTTGAGAAA
TTATCAGGAT GTCGTCATTC GATTATTCCG GACCGCATTG AGGCTGGTAC ATATATGATT
GCTGCGGCAG CGATGGGGAA AGAAGTAGTC GTTGATAACG TTATTCCTCA GCATGTTGAA
TCATTGATCG CAAAATTGCG CGAAATGGGC GTGCATGTAG AAACGAGCGA CGATCAAATC
CTTGTTTCCA GTGCACCAAC TTTAAAAGCA GTGGACGTGA AAACGCTTGT TTATCCTGGT
TTTCCAACCG ACTTACAGCA GCCGTTTACA GCGCTTTTAA CAAAAGCGCA CGGGACAAGC
GTTGTCACGG ATACGATTTA TAGCGCCCGC TTTAAGCATG TCGATGAACT TCGCAGAATG
AATGCGAACA TAAAGGTGGA AGGTCGTTCC GCCATTATTA CCGGTCCGGT TCGGCTACAG
GGCGCAAAAG TAAAAGCGAG CGATTTGCGC GCAGGCGCAG CGCTTGTGGT TGCTGGTTTA
ATGGCACAAG GGCTTACGGA AATCACGGGA GTGGAGCACA TTGACCGCGG ATACAGCAAT
CTTGTCGAAA AGTTAAATAG CATAGGAGCA ACGATTTGGC GAGAAAAAAT GACGGACGAA
GAGATTGAAC AAGTCAAAAA TGCATAG
 
Protein sequence
MEKIKIIGGD PLQGTIKVSG AKNSAVALIP ATILADSPVT IEGLPDISDV RILGDLIKEI 
GGTFHFDGKK AVIDPTNMVP MPLPNGKVKK LRASYYLMGA MLGRFKKAVV GLPGGCHLGP
RPIDQHIKGF EALGAKVTNE QGAIYLRAEE LRGARIFLDV VSVGATINIM LAAVRAKGRT
IIENAAKEPE IIDVATLLSN MGAKIKGAGT DVIRIDGVEK LSGCRHSIIP DRIEAGTYMI
AAAAMGKEVV VDNVIPQHVE SLIAKLREMG VHVETSDDQI LVSSAPTLKA VDVKTLVYPG
FPTDLQQPFT ALLTKAHGTS VVTDTIYSAR FKHVDELRRM NANIKVEGRS AIITGPVRLQ
GAKVKASDLR AGAALVVAGL MAQGLTEITG VEHIDRGYSN LVEKLNSIGA TIWREKMTDE
EIEQVKNA