Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0777 |
Symbol | aroG |
ID | 6142849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 777652 |
End bp | 778704 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615665 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001742857 |
Protein GI | 170684235 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0582465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTATC AGAACGACGA TTTACGCATC AAAGAAATCA AAGAGTTACT TCCTCCTGTC GCATTGCTGG AAAAATTCCC CGCTACTGAA AATGCCGCGA ATACGGTCGC CCATGCCCGA AAAGCGATCC ATAAGATCCT GAAAGGTAAT GATGATCGCC TGTTGGTGGT GATTGGCCCA TGCTCAATCC ATGATCCTGT CGCAGCAAAA GAGTATGCCA CTCGCTTGCT GGCGCTGCGT GAAGAGCTGA AAGATGAGCT GGAAATCGTA ATGCGCGTCT ATTTTGAAAA GCCGCGTACA ACGGTGGGCT GGAAAGGGCT GATTAACGAT CCGCATATGG ATAACAGCTT CCAGATCAAC GACGGTCTGC GTATTGCCCG CAAATTGCTG CTCGATATTA ACGACAGCGG TCTGCCAGCG GCGGGTGAAT TCCTGGATAT GATCACCCCA CAATATCTCG CTGACCTGAT GAGCTGGGGC GCAATTGGCG CACGTACTAC GGAATCGCAG GTGCACCGCG AACTGGCGTC TGGTCTTTCT TGCCCGGTAG GCTTCAAAAA TGGCACTGAT GGTACGATTA AAGTGGCTAT CGATGCTATT AATGCCGCCG GTGCGCCGCA CTGCTTCCTG TCCGTAACGA AATGGGGGCA TTCGGCGATT GTGAATACCA GCGGTAACGG CGATTGCCAT ATCATTCTGC GCGGCGGTAA AGAGCCTAAC TACAGCGCGA AGCACGTTGC TGAAGTGAAA GAAGGGCTGA ACAAAGCAGG CCTGCCAGCA CAGGTGATGA TCGATTTCAG CCATGCTAAC TCGTCAAAAC AATTCAAAAA GCAGATGGAT GTTTGTGCTG ACGTTTGCCA GCAGATTGCC GGTGGCGAAA AGGCTATTAT TGGCGTGATG GTGGAAAGCC ATCTGGTGGA AGGCAATCAG AGCCTGGAGA GCGGGGAACC GCTGGCTTAT GGCAAGAGCA TCACCGATGC CTGCATTGGC TGGGATGATA CCGATGCTCT GTTACGTCAA CTGGCGAATG CAGTAAAAGC GCGTCGCGGG TAA
|
Protein sequence | MNYQNDDLRI KEIKELLPPV ALLEKFPATE NAANTVAHAR KAIHKILKGN DDRLLVVIGP CSIHDPVAAK EYATRLLALR EELKDELEIV MRVYFEKPRT TVGWKGLIND PHMDNSFQIN DGLRIARKLL LDINDSGLPA AGEFLDMITP QYLADLMSWG AIGARTTESQ VHRELASGLS CPVGFKNGTD GTIKVAIDAI NAAGAPHCFL SVTKWGHSAI VNTSGNGDCH IILRGGKEPN YSAKHVAEVK EGLNKAGLPA QVMIDFSHAN SSKQFKKQMD VCADVCQQIA GGEKAIIGVM VESHLVEGNQ SLESGEPLAY GKSITDACIG WDDTDALLRQ LANAVKARRG
|
| |