Gene EcSMS35_2751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2751 
SymbolpheA 
ID6146584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2832714 
End bp2833874 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content49% 
IMG OID641617621 
Productbifunctional chorismate mutase/prephenate dehydratase 
Protein accessionYP_001744782 
Protein GI170681627 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000608431 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0711319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGG AAAACCCGTT ACTGGCGCTG CGAGAGAAAA TCAGCGCGCT GGATGAAAAA 
TTATTAGCAT TACTGGCAGA GCGACGCGAA CTGGCCGTCG AGGTGGGAAA AGCCAAACTA
CTCTCGCATC GCCCGGTACG TGATATTGAT CGTGAACGCG ATTTGCTGGA AAGATTAATT
ACGCTCGGTA AAGCGCACCA TCTGGACGCC CATTACATTA CTCGCCTGTT CCAGCTCATC
ATTGAAGATT CCGTATTAAC TCAGCAGGCT TTGCTCCAGC AACATCTCAA TAAAATTAAT
CCGCACTCAG CACGCATCGC TTTTCTCGGC CCCAAAGGCT CCTATTCACA TCTTGCCGCT
CGTCAGTACG CTGCGCGTCA CTTTGAGCAA TTCATTGAAA GTGGCTGCGC CAAATTTGCC
GATATTTTTA ATCAGGTGGA AACCGGCCAG GCCGACTATG CCGTCGTACC GATTGAAAAT
ACCAGCTCCG GTGCCATAAA CGACGTTTAC GATCTGCTGC AACATACCAG TTTGTCGATT
GTTGGCGAGA TGACGTTAAC TATCGATCAT TGTTTGTTGG TCTCCGGCAC TACTGATTTA
TCCACCATTA ATACGGTCTA CAGCCATCCG CAGCCATTCC AGCAATGCAG CAAATTCCTT
AATCGTTATC CGCACTGGAA GATTGAATAT ACCGAAAGTA CGTCTGCGGC AATGGAAAAG
GTTGCACAGG CAAAATCACC GCATGTTGCT GCGTTGGGAA GCGAAGCTGG CGGCACTTTG
TACGGTTTGC AGGTACTGGA GCGTATTGAA GCGAATCAGC GACAAAACTT CACCCGATTT
GTGGTGTTGG CACGTAAAGC CATTAACGTT TCTGACCAGG TTCCGGCGAA AACGACGTTG
TTAATGGCGA CCGGACAACA AGCTGGTGCA CTGGTTGAAG CGTTGCTGGT ACTGCGCAAC
CACAGTCTAA TTATGACCCG TCTGGAATCA CGTCCGATTC ACGGTAATCC GTGGGAAGAG
ATGTTTTATC TGGATATTCA GGCCAATCTT GAATCAGCGG AAATGCAAAA AGCATTGAAA
GAGTTAGGGG AAATCACCCG TTCGATGAAG GTATTGGGCT GTTACCCAAG TGAGAACGTA
GTGCCTGTTG ATCCAACCTG A
 
Protein sequence
MTSENPLLAL REKISALDEK LLALLAERRE LAVEVGKAKL LSHRPVRDID RERDLLERLI 
TLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNKIN PHSARIAFLG PKGSYSHLAA
RQYAARHFEQ FIESGCAKFA DIFNQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI
VGEMTLTIDH CLLVSGTTDL STINTVYSHP QPFQQCSKFL NRYPHWKIEY TESTSAAMEK
VAQAKSPHVA ALGSEAGGTL YGLQVLERIE ANQRQNFTRF VVLARKAINV SDQVPAKTTL
LMATGQQAGA LVEALLVLRN HSLIMTRLES RPIHGNPWEE MFYLDIQANL ESAEMQKALK
ELGEITRSMK VLGCYPSENV VPVDPT