Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3470 |
Symbol | aroC |
ID | 6966953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3214203 |
End bp | 3215288 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387276 |
Product | chorismate synthase |
Protein accession | YP_002271739 |
Protein GI | 209400995 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGAA ACACAATTGG ACAACTCTTT CGCGTAACCA CCTTCGGCGA ATCGCACGGG CTGGCGCTCG GCTGCATCGT CGATGGTGTT CCGCCAGGCA TTCCGCTGAC GGAAGCGGAC CTGCAACATG ACCTCGACCG TCGTCGCCCT GGGACATCGC GCTATACCAC CCAGCGTCGT GAGCCAGATC AGGTCAAAAT TCTCTCCGGT GTTTTTGAAG GTGTTACTAC CGGCACCAGC ATTGGTTTGT TGATCGAAAA TACCGATCAG CGTTCTCAGG ATTATAGTGC GATTAAGGAC GTTTTCCGTC CTGGCCATGC CGATTACACC TACGAACAAA AATACGGTCT GCGCGATTAT CGCGGCGGTG GACGTTCTTC CGCCCGTGAA ACCGCCATGC GCGTTGCAGC AGGGGCGATT GCCAAAAAAT ATCTCGCTGA GAAATTTGGC ATCGAAATTC GCGGCTGCCT GACCCAGATG GGCGACATTC CGCTGGAAAT CAAAGACTGG TCGCAGGTCG AGCAAAATCC ATTCTTCTGC CCGGACCCGG ACAAAATCGA CGCGTTAGAT GAACTGATGC GCGCGCTGAA AAAAGAGGGC GACTCTATTG GCGCGAAAGT CACCGTTGTT GCCAGTGGCG TTCCTGCCGG GCTTGGTGAG CCGGTCTTTG ATCGCCTGGA TGCCGACATC GCCCATGCGC TAATGAGCAT TAACGCGGTG AAAGGCGTGG AAATTGGTGA TGGTTTTGAC GTGGTGGCGC TGCGTGGCAG CCAGAACCGC GACGAAATCA CCAAAGACGG TTTCCAGAGT AACCATGCGG GCGGCATTCT CGGCGGAATT AGCAGCGGGC AGCAAATCAT TGCTCATATG GCACTGAAAC CGACCTCCAG TATTACCGTG CCGGGGCGCA CCATTAACCG CTTTGGCGAA GAAGTTGAGA TGATCACCAA AGGCCGTCAC GATCCCTGTG TCGGGATCCG CGCGGTGCCG ATAGCAGAAG CGATGCTGGC GATCGTTTTA ATGGATCACC TGTTACGGCA GCGGGCGCAA AATGCCGATG TGAAGACTGA TATTCCACGC TGGTAA
|
Protein sequence | MAGNTIGQLF RVTTFGESHG LALGCIVDGV PPGIPLTEAD LQHDLDRRRP GTSRYTTQRR EPDQVKILSG VFEGVTTGTS IGLLIENTDQ RSQDYSAIKD VFRPGHADYT YEQKYGLRDY RGGGRSSARE TAMRVAAGAI AKKYLAEKFG IEIRGCLTQM GDIPLEIKDW SQVEQNPFFC PDPDKIDALD ELMRALKKEG DSIGAKVTVV ASGVPAGLGE PVFDRLDADI AHALMSINAV KGVEIGDGFD VVALRGSQNR DEITKDGFQS NHAGGILGGI SSGQQIIAHM ALKPTSSITV PGRTINRFGE EVEMITKGRH DPCVGIRAVP IAEAMLAIVL MDHLLRQRAQ NADVKTDIPR W
|
| |