Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1896 |
Symbol | trpE |
ID | 6968646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1789418 |
End bp | 1790980 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385830 |
Product | anthranilate synthase component I |
Protein accession | YP_002270319 |
Protein GI | 209397030 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00565] anthranilate synthase component I, proteobacterial subset |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.000481284 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACAC AAAAACCGAC TCTCGAACTG CTAACCTGCG AAGGCGCTTA TCGCGACAAT CCCACCGCGC TTTTTCACCA ATTGTGTGGG GATCGTCCGG CAACGCTGCT GCTGGAATCC GCAGATATCG ACAGCAAAGA TGATTTAAAA AGCCTGCTGC TGGTAGACAG TGCGCTGCGC ATTACAGCTT TAGGTGACAC TGTCACAATC CAGTCCCTTT CCGGCAACGG CGAAGCCCTG CTGACACTAC TGGATAACGC CCTGCCTGCG GGTGTGGAAA ATGAACAATT ACCAAACTGC CGTGTGCTGC GCTTCCCCCC TGTTGGTCCA CTGCTGGATG AAGACGCTCG CTTATGCTCC CTTTCGGTTT TTGACGCTTT CCGCTTATTG CAGAATCTGT TGAATGTACC GAAGGAAGAA CGAGAAGCCA TGTTCTTCGG CGGCCTGTTC TCTTATGACC TTGTGGCGGG GTTTGAAGAT TTACCGCAAC TGTCAGCGGA AAATAACTGC CCTGATTTCT GTTTTTATCT CGCTGAAACT CTGATGGTGA TTGACCATCA GAAAAAAAGC ACCCGCATTC AGGCCAGCCT GTTTGCTCCG AATGAAGAAG AAAAACAACG TCTCACTGCT CGCCTGAACG ATCTTCGCCA GCAACTGACC GAAACCGCGC CACCGCTGCC GGTGGTTTCC GTGCCGCATA TGCGTTGTGA ATGTAACCAG AGCGATGAAG AGTTCGGTGG CGTAGTGCGT TTGTTGCAAA AAGCGATTCG CGCCGGAGAA ATTTTCCAGG TGGTGCCATC TCGCCGTTTC TCTCTGCCCT GCCCGTCACC GCTGGCGGCC TATTACGTGC TGAAAAAGAG TAATCCCAGC CCGTACATGT TTTTTATGCA GGATAATGAT TTCACCCTGT TTGGCGCGTC GCCGGAAAGT TCGCTCAAGT ATGACGCCAC CAGCCGCCAG ATTGAGATCT ACCCGATTGC CGGAACACGT CCACGCGGTC GTCGCGCCGA TGGCTCGCTC GACAGGGACC TTGAAAGCCG TATTGAACTG GAAATGCGTA CCGATCATAA AGAGCTTTCT GAACATCTGA TGTTGGTGGA TCTCGCCCGT AATGACCTGG CACGCATTTG CACCCCCGGC AGCCGCTACG TCGCCGACCT CACCAAAGTT GACCGTTACT CTTACGTGAT GCACCTGGTC TCCCGCGTGG TCGGTGAGCT GCGCCACGAT CTCGACGCCC TGCACGCTTA CCGCGCCTGT ATGAATATGG GGACGTTAAG CGGTGCGCCG AAAGTACGCG CTATGCAGTT AATTGCCGAG GCTGAAGGTC GTCGCCGCGG CAGCTACGGC GGCGCGGTAG GTTATTTCAC CGCGCACGGC GATCTCGACA CCTGCATTGT GATCCGCTCG GCGCTGGTGG AAAACGGTAT CGCTACCGTG CAAGCCGGTG CTGGCGTAGT CCTTGATTCT GTTCCGCAGT CGGAAGCCGA CGAAACCCGT AATAAAGCCC GCGCTGTACT GCGCGCTATT GCCACCGCGC ATCATGCACA GGAGACGTTC TAA
|
Protein sequence | MQTQKPTLEL LTCEGAYRDN PTALFHQLCG DRPATLLLES ADIDSKDDLK SLLLVDSALR ITALGDTVTI QSLSGNGEAL LTLLDNALPA GVENEQLPNC RVLRFPPVGP LLDEDARLCS LSVFDAFRLL QNLLNVPKEE REAMFFGGLF SYDLVAGFED LPQLSAENNC PDFCFYLAET LMVIDHQKKS TRIQASLFAP NEEEKQRLTA RLNDLRQQLT ETAPPLPVVS VPHMRCECNQ SDEEFGGVVR LLQKAIRAGE IFQVVPSRRF SLPCPSPLAA YYVLKKSNPS PYMFFMQDND FTLFGASPES SLKYDATSRQ IEIYPIAGTR PRGRRADGSL DRDLESRIEL EMRTDHKELS EHLMLVDLAR NDLARICTPG SRYVADLTKV DRYSYVMHLV SRVVGELRHD LDALHAYRAC MNMGTLSGAP KVRAMQLIAE AEGRRRGSYG GAVGYFTAHG DLDTCIVIRS ALVENGIATV QAGAGVVLDS VPQSEADETR NKARAVLRAI ATAHHAQETF
|
| |