Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3839 |
Symbol | tyrA |
ID | 6970161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3563176 |
End bp | 3564297 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387622 |
Product | bifunctional chorismate mutase/prephenate dehydrogenase |
Protein accession | YP_002272071 |
Protein GI | 209399005 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0287] Prephenate dehydrogenase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01799] chorismate mutase domain of T-protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00182808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGCTG AATTGACCGC ATTACGCGAT CAAATTGATG AAGTCGATAA AGCGCTGCTG AATTTATTAG CGAAGCGTCT GGAACTGGTT GCTGAAGTGG GCGAGGTGAA AAGCCGCTTT GGACTGCCTA TTTATGTTCC GGAGCGAGAG GCATCTATGT TGGCCTCGCG TCGTGCAGAG GCGGAAGCTC TGGGTGTACC GCCAGATCTG ATTGAGGATG TTTTGCGTCG GGTGATGCGT GAATCTTACT CCAGTGAAAA CGACAAAGGA TTTAAAACGC TTTGTCCTGC GTTACGCCCG GTAGTTATCG TTGGCGGCGG CGGTCAGATG GGACGTCTGT TCGAGAAGAT GCTGACACTC TCGGGTTATC AGGTGCGGAT TCTGGAGCAA CATGACTGGG ATCGAGCGGC TGATATTGTT GCCGATGCCG GAATGGTGAT TGTTAGTGTG CCAATCCACG TTACTGAGCA AGTTATTGGC AAATTACCGC CTTTACCGAA AGATTGTATT CTGGTTGATC TGGCATCAGT GAAAAATGGA CCATTACAGG CCATGCTGGC GGCGCACGAT GGCCCGGTAC TGGGGTTACA CCCAATGTTC GGTCCGGACA GCGGTAGCCT GGCAAAGCAA GTTGTGGTCT GGTGTGATGG ACGTAAACCG GAAGCATACC AATGGTTTCT GGAGCAAATT CAGGTCTGGG GCGCTCGGTT GCATCGTATT AGCGCCGTCG AGCACGATCA GAATATGGCG TTTATTCAGG CACTGCGCCA CTTTGCTACT TTTGCTTACG GGCTGCACCT GGCAGAAGAA AATGTTCAGC TTGAGCAACT TCTGGCGCTC TCTTCGCCGA TTTACCGCCT TGAGCTGGCG ATGGTCGGGC GACTGTTCGC TCAGGATCCG CAGCTTTATG CCGACATTAT TATGTCGTCA GAGCGTAATC TGGCGTTAAT CAAACGTTAC TATAAGCGTT TCGGCGAGGC GATTGAGTTG CTGGAGCAGG GCGATAAGCA GGCGTTTATT GACAGTTTCC GCAAGGTGGA GCACTGGTTC GGCGATTACG CACAGCGTTT TCAGAGTGAA AGCCGCGTGT TATTGCGTCA GGCGAATGAC AATCGCCAGT AA
|
Protein sequence | MVAELTALRD QIDEVDKALL NLLAKRLELV AEVGEVKSRF GLPIYVPERE ASMLASRRAE AEALGVPPDL IEDVLRRVMR ESYSSENDKG FKTLCPALRP VVIVGGGGQM GRLFEKMLTL SGYQVRILEQ HDWDRAADIV ADAGMVIVSV PIHVTEQVIG KLPPLPKDCI LVDLASVKNG PLQAMLAAHD GPVLGLHPMF GPDSGSLAKQ VVVWCDGRKP EAYQWFLEQI QVWGARLHRI SAVEHDQNMA FIQALRHFAT FAYGLHLAEE NVQLEQLLAL SSPIYRLELA MVGRLFAQDP QLYADIIMSS ERNLALIKRY YKRFGEAIEL LEQGDKQAFI DSFRKVEHWF GDYAQRFQSE SRVLLRQAND NRQ
|
| |