Gene ECH74115_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3839 
SymboltyrA 
ID6970161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3563176 
End bp3564297 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content52% 
IMG OID643387622 
Productbifunctional chorismate mutase/prephenate dehydrogenase 
Protein accessionYP_002272071 
Protein GI209399005 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00182808 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCTG AATTGACCGC ATTACGCGAT CAAATTGATG AAGTCGATAA AGCGCTGCTG 
AATTTATTAG CGAAGCGTCT GGAACTGGTT GCTGAAGTGG GCGAGGTGAA AAGCCGCTTT
GGACTGCCTA TTTATGTTCC GGAGCGAGAG GCATCTATGT TGGCCTCGCG TCGTGCAGAG
GCGGAAGCTC TGGGTGTACC GCCAGATCTG ATTGAGGATG TTTTGCGTCG GGTGATGCGT
GAATCTTACT CCAGTGAAAA CGACAAAGGA TTTAAAACGC TTTGTCCTGC GTTACGCCCG
GTAGTTATCG TTGGCGGCGG CGGTCAGATG GGACGTCTGT TCGAGAAGAT GCTGACACTC
TCGGGTTATC AGGTGCGGAT TCTGGAGCAA CATGACTGGG ATCGAGCGGC TGATATTGTT
GCCGATGCCG GAATGGTGAT TGTTAGTGTG CCAATCCACG TTACTGAGCA AGTTATTGGC
AAATTACCGC CTTTACCGAA AGATTGTATT CTGGTTGATC TGGCATCAGT GAAAAATGGA
CCATTACAGG CCATGCTGGC GGCGCACGAT GGCCCGGTAC TGGGGTTACA CCCAATGTTC
GGTCCGGACA GCGGTAGCCT GGCAAAGCAA GTTGTGGTCT GGTGTGATGG ACGTAAACCG
GAAGCATACC AATGGTTTCT GGAGCAAATT CAGGTCTGGG GCGCTCGGTT GCATCGTATT
AGCGCCGTCG AGCACGATCA GAATATGGCG TTTATTCAGG CACTGCGCCA CTTTGCTACT
TTTGCTTACG GGCTGCACCT GGCAGAAGAA AATGTTCAGC TTGAGCAACT TCTGGCGCTC
TCTTCGCCGA TTTACCGCCT TGAGCTGGCG ATGGTCGGGC GACTGTTCGC TCAGGATCCG
CAGCTTTATG CCGACATTAT TATGTCGTCA GAGCGTAATC TGGCGTTAAT CAAACGTTAC
TATAAGCGTT TCGGCGAGGC GATTGAGTTG CTGGAGCAGG GCGATAAGCA GGCGTTTATT
GACAGTTTCC GCAAGGTGGA GCACTGGTTC GGCGATTACG CACAGCGTTT TCAGAGTGAA
AGCCGCGTGT TATTGCGTCA GGCGAATGAC AATCGCCAGT AA
 
Protein sequence
MVAELTALRD QIDEVDKALL NLLAKRLELV AEVGEVKSRF GLPIYVPERE ASMLASRRAE 
AEALGVPPDL IEDVLRRVMR ESYSSENDKG FKTLCPALRP VVIVGGGGQM GRLFEKMLTL
SGYQVRILEQ HDWDRAADIV ADAGMVIVSV PIHVTEQVIG KLPPLPKDCI LVDLASVKNG
PLQAMLAAHD GPVLGLHPMF GPDSGSLAKQ VVVWCDGRKP EAYQWFLEQI QVWGARLHRI
SAVEHDQNMA FIQALRHFAT FAYGLHLAEE NVQLEQLLAL SSPIYRLELA MVGRLFAQDP
QLYADIIMSS ERNLALIKRY YKRFGEAIEL LEQGDKQAFI DSFRKVEHWF GDYAQRFQSE
SRVLLRQAND NRQ