Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0781 |
Symbol | aroG |
ID | 5586458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 805097 |
End bp | 806149 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640924493 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001461908 |
Protein GI | 157154975 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00182809 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTATC AGAACGACGA TTTACGCATC AAAGAAATCA AAGAGTTACT TCCTCCTGTC ACATTGCTGG AAAAATTCCC CGCTACTGAA AATGCCGCGA ATACGGTTGC CCATGCCCGA AAAGCGATCC ATAAGATCCT GAAAGGTAAT GATGATCGCC TGTTGGTTGT GATTGGCCCA TGCTCAATTC ATGATCCTGT CGCGGCAAAA GAGTATGCCA CTCGCTTGCT GGCGCTGCGT GAAGAGCTGA AAGATGAGCT GGAAATCGTA ATGCGCGTCT ATTTTGAAAA GCCGCGTACC ACGGTGGGCT GGAAAGGGCT GATTAACGAT CCGCATATGG ATAATAGCTT CCAGATCAAC GACGGTCTGC GTATAGCCCG TAAATTGCTG CTTGATATTA ACGACAGCGG TCTGCCAGCG GCAGGTGAGT TTCTCGATAT GATCACCCCA CAATATCTCG CTGACCTGAT GAGCTGGGGC GCAATTGGCG CACGTACCAC CGAATCGCAG GTGCACCGCG AACTGGCATC AGGGCTTTCT TGTCCGGTCG GCTTCAAAAA TGGCACCGAC GGTACGATTA AAGTGGCTAT CGATGCCATT AATGCCGCCG GTGCGCCGCA CTGCTTCCTG TCCGTAACGA AATGGGGGCA TTCGGCGATT GTGAATACCA GCGGTAACGG CGATTGCCAT ATCATTCTGC GCGGCGGTAA AGAGCCTAAC TACAGCGCGA AGCACGTTGC TGAAGTGAAA GAAGGGCTGA ACAAAGCAGG CCTGCCAGCA CAGGTGATGA TCGATTTCAG CCATGCTAAC TCGTCCAAAC AATTCAAAAA GCAGATGGAT GTTTGTGCTG ACGTTTGCCA GCAGATTGCC GGTGGCGAAA AGGCCATTAT TGGCGTGATG GTGGAAAGCC ATCTGGTGGA AGGCAATCAG AGCCTCGAGA GCGGGGAGCC GCTGGCCTAC GGTAAGAGCA TCACCGATGC CTGCATCGGC TGGGAAGATA CCGATGCTCT GTTACGTCAA CTGGCGAATG CAGTAAAAGC GCGTCGCGGG TAA
|
Protein sequence | MNYQNDDLRI KEIKELLPPV TLLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP CSIHDPVAAK EYATRLLALR EELKDELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN YSAKHVAEVK EGLNKAGLPA QVMIDFSHAN SSKQFKKQMD VCADVCQQIA GGEKAIIGVM VESHLVEGNQ SLESGEPLAY GKSITDACIG WEDTDALLRQ LANAVKARRG
|
| |