Gene GWCH70_2734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2734 
Symbol 
ID7976550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2772417 
End bp2773499 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content41% 
IMG OID644799531 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_002950690 
Protein GI239828066 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATA AGAGATTAGA TGAGCTACGG GCAAAGGTGG ATGAGATTAA CTTACAAATT 
TTAAAATTAA TTAATGAACG AGGGAGACTT GTTCAAGAAA TTGGAAAAAT CAAGGAAACG
CAAGGAACAT ATCGTTATGA CCCAGTGCGT GAACGGAAAA TGCTTGATTT AATTTCTGAG
CACAACGATG GACCATTTGA AACATCGACA TTGCAGCATA TTTTTAAAGA AATTTTTAAA
GCTGGTCTTG AACTGCAAGA AGATGATCAT CGTAAAGCAT TGCTTGTATC GCGCAAGAAG
CATCCGGAAA ATACGATTGT TGATGTAAAA GGCGAAAAAA TTGGCGACGG CAACCAATAT
TTTGTGATGG GACCGTGTGC GGTCGAAAGT TATGAACAAG TTGCGGCTGT TGCAAAAGCG
GTGAAGAAAC AAGGATTAAA ACTTCTTCGC GGCGGTGCGT ACAAACCGAG AACATCGCCA
TATGATTTCC AAGGACTAGG CGTGGAAGGA TTAAAAATTT TAAAACGAAT TGCCGATGAG
TTTGACTTAG CTGTGATTAG TGAAATTGTC ACCCCTGCGG ATATTGAAAT AGCGCTAGAC
TATATTGATG TGATTCAAAT TGGTGCGCGC AACATGCAAA ACTTTGAGCT TTTAAAAGCG
GCAGGCCAAG TGAACAAGCC AATTTTGTTA AAACGCGGGC TAGCGGCAAC GATTGAAGAA
TTCATTAATG CGGCAGAGTA CATTATGTCG CAAGGAAACG GTCAAATTAT TCTTTGTGAA
CGCGGTATTC GCACATATGA GCGCGCGACA AGAAATACGT TGGATATTTC TGCGGTGCCA
ATTTTAAAGA AAGAAACACA CTTGCCTGTA TTGGTTGATG TTACTCATTC AACAGGCCGT
CGTGACTTAT TAATTCCTTG TGCGAAAGCA GCGTTAGCAA TTGGCGCGGA TGGAGTAATG
GCAGAGGTAC ATCCAGATCC AGCGGTTGCA TTATCGGATT CGGCACAACA AATGGATATT
GCTCAATTTA ATGAATTTAT GGAAGAAATA AGAGCGTTCC AGCGGCAAAT GGTAAAAGCA
TAA
 
Protein sequence
MSNKRLDELR AKVDEINLQI LKLINERGRL VQEIGKIKET QGTYRYDPVR ERKMLDLISE 
HNDGPFETST LQHIFKEIFK AGLELQEDDH RKALLVSRKK HPENTIVDVK GEKIGDGNQY
FVMGPCAVES YEQVAAVAKA VKKQGLKLLR GGAYKPRTSP YDFQGLGVEG LKILKRIADE
FDLAVISEIV TPADIEIALD YIDVIQIGAR NMQNFELLKA AGQVNKPILL KRGLAATIEE
FINAAEYIMS QGNGQIILCE RGIRTYERAT RNTLDISAVP ILKKETHLPV LVDVTHSTGR
RDLLIPCAKA ALAIGADGVM AEVHPDPAVA LSDSAQQMDI AQFNEFMEEI RAFQRQMVKA