Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0856 |
Symbol | aroG |
ID | 6966646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 872858 |
End bp | 873910 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384881 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002269381 |
Protein GI | 209400570 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0352016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.842103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATC AGAACGACGA TTTACGCATC AAAGAAATCA AAGAGTTACT TCCTCCTGTC GCATTGCTGG AAAAATTCCC CGCTACTGAA AATGCCGCGA ATACGGTTGC CCATGCCCGA AAAGCGATCC ATAAGATCCT GAAAGGTAAT GATGATCGCC TGTTGGTGGT GATTGGCCCG TGCTCAATTC ATGATCCTGT CGCGGCTAAA GAGTATGCCA CTCGCTTGCT GGCGCTGCGT GAAGAGCTGA AAGATGAGCT GGAAATCGTA ATGCGCGTCT ATTTTGAAAA GCCGCGTACT ACGGTGGGCT GGAAAGGGCT GATTAACGAT CCGCATATGG ATAACAGCTT CCAGATCAAC GACGGTCTGC GTATAGCCCG TAAATTGCTG CTTGATATTA ACGACAGCGG TCTGCCAGCG GCGGGTGAAT TCCTCGATAT GATCACTCCT CAGTATCTCG CTGACCTGAT GAGCTGGGGC GCAATTGGTG CACGTACCAC GGAATCGCAG GTGCACCGCG AACTGGCGTC TGGTCTTTCT TGTCCGGTAG GTTTTAAAAA TGGCACAGAC GGTACGATTA AAGTGGCTAT CGATGCCATT AATGCCGCCG GTGCGCCGCA CTGCTTCCTG TCCGTAACTA AATGGGGGCA TTCGGCGATT GTGAATACCA GCGGTAACGG CGATTGCCAT ATCATTCTGC GCGGCGGTAA AGAGCCTAAC TACAGCGCGA AGCACGTTGC TGAAGTGAAA GAAGGGCTGA ACAAAGCAGG TCTGCCAGCT CAGGTGATGA TCGATTTCAG CCATGCTAAT TCGTCCAAAC AATTCAAAAA GCAGATGGAT GTTTGTGCTG ACGTTTGCCA GCAGATTGCC GGTGGCGAAA AGGCCATTAT CGGTGTGATG GTGGAAAGTC ATCTGGTGGA AGGCAATCAG AGCCTGGAGA GCGGGGAGCC GCTGGCCTAT GGTAAGAGCA TCACCGATGC CTGCATTGGC TGGGAAGATA CCGATGCTCT GTTACGTCAA CTGGCGAACG CAGTGAAAGC GCGTCGCGGG TAA
|
Protein sequence | MNYQNDDLRI KEIKELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP CSIHDPVAAK EYATRLLALR EELKDELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN YSAKHVAEVK EGLNKAGLPA QVMIDFSHAN SSKQFKKQMD VCADVCQQIA GGEKAIIGVM VESHLVEGNQ SLESGEPLAY GKSITDACIG WEDTDALLRQ LANAVKARRG
|
| |