Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2734 |
Symbol | |
ID | 7976550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2772417 |
End bp | 2773499 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644799531 |
Product | bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase |
Protein accession | YP_002950690 |
Protein GI | 239828066 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase [TIGR01801] chorismate mutase domain of gram positive AroA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATA AGAGATTAGA TGAGCTACGG GCAAAGGTGG ATGAGATTAA CTTACAAATT TTAAAATTAA TTAATGAACG AGGGAGACTT GTTCAAGAAA TTGGAAAAAT CAAGGAAACG CAAGGAACAT ATCGTTATGA CCCAGTGCGT GAACGGAAAA TGCTTGATTT AATTTCTGAG CACAACGATG GACCATTTGA AACATCGACA TTGCAGCATA TTTTTAAAGA AATTTTTAAA GCTGGTCTTG AACTGCAAGA AGATGATCAT CGTAAAGCAT TGCTTGTATC GCGCAAGAAG CATCCGGAAA ATACGATTGT TGATGTAAAA GGCGAAAAAA TTGGCGACGG CAACCAATAT TTTGTGATGG GACCGTGTGC GGTCGAAAGT TATGAACAAG TTGCGGCTGT TGCAAAAGCG GTGAAGAAAC AAGGATTAAA ACTTCTTCGC GGCGGTGCGT ACAAACCGAG AACATCGCCA TATGATTTCC AAGGACTAGG CGTGGAAGGA TTAAAAATTT TAAAACGAAT TGCCGATGAG TTTGACTTAG CTGTGATTAG TGAAATTGTC ACCCCTGCGG ATATTGAAAT AGCGCTAGAC TATATTGATG TGATTCAAAT TGGTGCGCGC AACATGCAAA ACTTTGAGCT TTTAAAAGCG GCAGGCCAAG TGAACAAGCC AATTTTGTTA AAACGCGGGC TAGCGGCAAC GATTGAAGAA TTCATTAATG CGGCAGAGTA CATTATGTCG CAAGGAAACG GTCAAATTAT TCTTTGTGAA CGCGGTATTC GCACATATGA GCGCGCGACA AGAAATACGT TGGATATTTC TGCGGTGCCA ATTTTAAAGA AAGAAACACA CTTGCCTGTA TTGGTTGATG TTACTCATTC AACAGGCCGT CGTGACTTAT TAATTCCTTG TGCGAAAGCA GCGTTAGCAA TTGGCGCGGA TGGAGTAATG GCAGAGGTAC ATCCAGATCC AGCGGTTGCA TTATCGGATT CGGCACAACA AATGGATATT GCTCAATTTA ATGAATTTAT GGAAGAAATA AGAGCGTTCC AGCGGCAAAT GGTAAAAGCA TAA
|
Protein sequence | MSNKRLDELR AKVDEINLQI LKLINERGRL VQEIGKIKET QGTYRYDPVR ERKMLDLISE HNDGPFETST LQHIFKEIFK AGLELQEDDH RKALLVSRKK HPENTIVDVK GEKIGDGNQY FVMGPCAVES YEQVAAVAKA VKKQGLKLLR GGAYKPRTSP YDFQGLGVEG LKILKRIADE FDLAVISEIV TPADIEIALD YIDVIQIGAR NMQNFELLKA AGQVNKPILL KRGLAATIEE FINAAEYIMS QGNGQIILCE RGIRTYERAT RNTLDISAVP ILKKETHLPV LVDVTHSTGR RDLLIPCAKA ALAIGADGVM AEVHPDPAVA LSDSAQQMDI AQFNEFMEEI RAFQRQMVKA
|
| |