Gene ECH74115_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3838 
SymbolpheA 
ID6970980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3561973 
End bp3563133 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content49% 
IMG OID643387621 
Productbifunctional chorismate mutase/prephenate dehydratase 
Protein accessionYP_002272070 
Protein GI209399059 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000574807 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGG AAAACCCGTT ACTGGCGCTG CGAGAGAAAA TCAGCGCGCT GGATGAAAAA 
TTATTAGCAT TACTGGCAGA GCGGCGCGAA CTGGCCGTCG AGGTGGGAAA AGCCAAACTG
CTCTCGCATC GCCCGGTACG TGATATTGAT CGTGAACGCG ATTTACTGGA AAGATTAATT
ACGCTCGGTA AAGCGCACCA TCTGGACGCC CATTACATTA CTCGCCTGTT CCAGCTCATC
ATTGAAGATT CCGTATTAAC TCAGCAGGCT TTGCTCCAAC AACATCTCAA TAAAATTAAT
CCGCACTCAG CACGCATCGC TTTTCTCGGC CCCAAAGGTT CTTATTCCCA TCTTGCGGCG
CGCCAGTATG CTGCCCGTCA CTTTGAGCAA TTCATTGAAA GTGGCTGCGC CAAATTTGCC
GATATTTTTA ATCAGGTGGA AACCGGCCAG GCCGACTATG CCGTCGTACC GATTGAAAAT
ACCAGCTCCG GTGCCATAAA TGACGTTTAC GATCTGCTGC AACATACCAG CTTGTCGATT
GTTGGCGAGA TGACGTTAAC TATCGACCAT TGTTTGTTAG TCTCCGGCAC GACTGATTTA
TCCACCATCA ATACGGTCTA CAGCCATCCG CAGCCATTCC AGCAATGCAG CAAATTCCTT
AATCGTTATC CGCACTGGAA GATTGAATAT ACCGAAAGTA CGTCTGCGGC AATGGAAAAG
GTTGCACAGG CAAAATCACC GCATGTTGCT GCGTTAGGAA GCGAAGCTGG CGGCACTTTG
TACGGTTTGC AGGTACTGGA GCGGATTGAA GCGAATCAGC GACAAAACTT CACCCGATTT
GTGGTGTTGG CGCGTAAAGC CATTAACGTG TCTGATCAGG TTCCGGCGAA AACGACGTTG
TTAATGGCGA CCGGGCAACA AGCCGGTGCG CTGGTTGAAG CGTTGCTGGT ACTGCGCAAC
CACAATCTGA TTATGACCCG TCTGGAATCA CGCCCGATTC ACGGTAATCC ATGGGAAGAG
ATGTTTTATC TGGATATTCA GGCCAATCTT GAATCAGCGG AAATGCAAAA AGCATTGAAA
GAGTTAGGGG AAATCACCCG TTCAATGAAG GTATTGGGCT GTTACCCTAG TGAGAACGTA
GTGCCTGTTG ATCCAACCTG A
 
Protein sequence
MTSENPLLAL REKISALDEK LLALLAERRE LAVEVGKAKL LSHRPVRDID RERDLLERLI 
TLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNKIN PHSARIAFLG PKGSYSHLAA
RQYAARHFEQ FIESGCAKFA DIFNQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI
VGEMTLTIDH CLLVSGTTDL STINTVYSHP QPFQQCSKFL NRYPHWKIEY TESTSAAMEK
VAQAKSPHVA ALGSEAGGTL YGLQVLERIE ANQRQNFTRF VVLARKAINV SDQVPAKTTL
LMATGQQAGA LVEALLVLRN HNLIMTRLES RPIHGNPWEE MFYLDIQANL ESAEMQKALK
ELGEITRSMK VLGCYPSENV VPVDPT