Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_3859 |
Symbol | aroB |
ID | 5587826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 3832076 |
End bp | 3833164 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640927482 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001464843 |
Protein GI | 157158844 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000000380736 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGGA TTGTCGTTAC TCTCGGGGAA CGTAGTTACC CAATTACCAT CGCATCTGGT TTGTTTAATG AACCAGCTTC ATTCTTACCG CTGAAATCGG GCGAGCAGGT CATGTTGGTC ACCAACGAAA CCCTGGCTCC TCTGTATCTC GATAAGGTCC GCGGCGTACT TGAACAGGCG GGTGTTAACG TCGATAGCGT TATCCTCCCT GACGGCGAGC AGTATAAAAG CCTGGCTGTA CTCGATACCG TCTTTACGGC GTTGTTACAA AAGCCGCATG GTCGCGATAC TACGCTGGTG GCGCTTGGCG GCGGCGTAGT GGGCGATCTG ACCGGCTTCG CGGCGGCGAG TTATCAGCGC GGTGTTCGTT TCATTCAAGT CCCGACGACG TTACTGTCGC AGGTCGATTC CTCCGTTGGC GGCAAAACTG CGGTCAACCA TCCCCTCGGT AAAAACATGA TTGGCGCGTT CTACCAGCCT GCTTCAGTGG TGGTGGATCT CGACTGTCTG AAAACGCTTC CCCCGCGTGA GTTAGCGTCG GGGCTGGCAG AAGTCATCAA ATACGGCATT ATTCTTGACG GTGCGTTTTT TAACTGGCTG GAAGAGAATC TGGATGCGTT GTTGCGTCTG GACGGTCCGG CAATGGCGTA CTGTATTCGC CGTTGTTGTG AACTGAAGGC AGAAGTTGTC GCCGCCGACG AGCGCGAAAC CGGGTTACGT GCTTTACTGA ATCTGGGACA CACCTTTGGT CATGCCATTG AAGCTGAAAT GGGGTATGGC AATTGGTTAC ATGGTGAAGC GGTCGCTGCG GGTATGGTGA TGGCGGCGCG GACGTCGGAA CGTCTCGGGC AGTTTAGTTC TGCCGAAACG CAGCGTATTA TAACCCTGCT CAAGCGGGCT GGGTTACCGG TCAATGGGCC GCGCGAAATG TCCGCGCAGG CGTATTTACC GCATATGCTG CGTGACAAGA AAGTCCTTGC GGGAGAGATA CGCTTAATTC TTCCGTTGGC AATTGGTAAG AGTGAAGTTC GCAGCGGCGT TTCGCACGAG CTTGTTCTTA ACGCCATTGC CGATTGTCAA TCAGCGTAA
|
Protein sequence | MERIVVTLGE RSYPITIASG LFNEPASFLP LKSGEQVMLV TNETLAPLYL DKVRGVLEQA GVNVDSVILP DGEQYKSLAV LDTVFTALLQ KPHGRDTTLV ALGGGVVGDL TGFAAASYQR GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP ASVVVDLDCL KTLPPRELAS GLAEVIKYGI ILDGAFFNWL EENLDALLRL DGPAMAYCIR RCCELKAEVV AADERETGLR ALLNLGHTFG HAIEAEMGYG NWLHGEAVAA GMVMAARTSE RLGQFSSAET QRIITLLKRA GLPVNGPREM SAQAYLPHML RDKKVLAGEI RLILPLAIGK SEVRSGVSHE LVLNAIADCQ SA
|
| |