Gene EcSMS35_2752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2752 
SymboltyrA 
ID6145812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2833917 
End bp2835038 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content52% 
IMG OID641617622 
Productbifunctional chorismate mutase/prephenate dehydrogenase 
Protein accessionYP_001744783 
Protein GI170679824 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0287] Prephenate dehydrogenase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01799] chorismate mutase domain of T-protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000708785 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0483395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCTG AATTGACCGC ATTACGCGAT CAAATTGATG AAGTCGATAA AGCGCTGCTG 
AATTTATTAG CGAAGCGTCT GGAACTGGTT GCTGAAGTTG GCGAGGTGAA AAGCCGCTTT
GGACTGCCTA TTTATGTTCC GGAGCGCGAG GCATCTATGC TGGCCTCGCG GCGTGCAGAG
GCGGAAGCTC TGGGTGTACC GCCAGATCTG ATTGAGGATG TTTTGCGTCG GGTGATGCGT
GAATCTTACT CCAGTGAAAA CGACAAAGGA TTTAAAACGC TTTGTCCGTC ACTGCGTCCG
GTGGTTATCG TCGGCGGTGG CGGTCAGATG GGACGTCTGT TCGAGAAGAT GCTGACACTA
TCGGGTTATC AGGTGCGGAT TCTGGAGCAA CATGACTGGG ATCGCGCGGC TGATATTGTT
GCCGATGCCG GAATGGTGAT TGTTAGTGTG CCAATCCACG TTACTGAGCA AGTTATCGGC
AAATTACCGC CTTTACCGAA AGATTGTATT CTGGTCGATC TGGCATCGGT GAAAAATGGA
CCATTACAGG CCATGCTGGC GGCGCATGAT GGTCCGGTGC TGGGGTTACA CCCGATGTTC
GGCCCGGACA GCGGTAGCCT GGCAAAGCAA GTTGTGGTCT GGTGTGATGG ACGTAAGCCG
GAAGCATACC AATGGTTTCT GGAGCAAATT CAGGTCTGGG GCGCTCGGCT GCATCGTATT
AGCGCTGTCG AGCACGATCA GAATATGGCG TTTATTCAGG CTCTGCGCCA CTTTGCTACT
TTTGCTTATG GGCTGCACCT GGCAGAAGAA AATGTTCAGC TTGAGCAACT TCTGGCGCTT
TCTTCGCCGA TTTACCGCCT TGAGCTGGCG ATGGTTGGGC GACTGTTTGC CCAGGATCCG
CAACTGTATG CCGACATTAT TATGTCGTCA GAGCGTAATC TGGCGTTAAT CAAACGTTAC
TATAAGCGTT TCGGCGAGGC GATTGAGTTG CTGGAGCAGG GCGATAAGCA GGCGTTTATT
GACAGTTTCC GCAAGGTGGA GCACTGGTTC GGCGATTACG CACAGCGTTT TCAGAGTGAA
AGCCGCGTGT TATTGCGTCA GGCGAATGAC AATCGCCAGT AA
 
Protein sequence
MVAELTALRD QIDEVDKALL NLLAKRLELV AEVGEVKSRF GLPIYVPERE ASMLASRRAE 
AEALGVPPDL IEDVLRRVMR ESYSSENDKG FKTLCPSLRP VVIVGGGGQM GRLFEKMLTL
SGYQVRILEQ HDWDRAADIV ADAGMVIVSV PIHVTEQVIG KLPPLPKDCI LVDLASVKNG
PLQAMLAAHD GPVLGLHPMF GPDSGSLAKQ VVVWCDGRKP EAYQWFLEQI QVWGARLHRI
SAVEHDQNMA FIQALRHFAT FAYGLHLAEE NVQLEQLLAL SSPIYRLELA MVGRLFAQDP
QLYADIIMSS ERNLALIKRY YKRFGEAIEL LEQGDKQAFI DSFRKVEHWF GDYAQRFQSE
SRVLLRQAND NRQ