Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2751 |
Symbol | pheA |
ID | 6146584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2832714 |
End bp | 2833874 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617621 |
Product | bifunctional chorismate mutase/prephenate dehydratase |
Protein accession | YP_001744782 |
Protein GI | 170681627 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000608431 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0711319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCGG AAAACCCGTT ACTGGCGCTG CGAGAGAAAA TCAGCGCGCT GGATGAAAAA TTATTAGCAT TACTGGCAGA GCGACGCGAA CTGGCCGTCG AGGTGGGAAA AGCCAAACTA CTCTCGCATC GCCCGGTACG TGATATTGAT CGTGAACGCG ATTTGCTGGA AAGATTAATT ACGCTCGGTA AAGCGCACCA TCTGGACGCC CATTACATTA CTCGCCTGTT CCAGCTCATC ATTGAAGATT CCGTATTAAC TCAGCAGGCT TTGCTCCAGC AACATCTCAA TAAAATTAAT CCGCACTCAG CACGCATCGC TTTTCTCGGC CCCAAAGGCT CCTATTCACA TCTTGCCGCT CGTCAGTACG CTGCGCGTCA CTTTGAGCAA TTCATTGAAA GTGGCTGCGC CAAATTTGCC GATATTTTTA ATCAGGTGGA AACCGGCCAG GCCGACTATG CCGTCGTACC GATTGAAAAT ACCAGCTCCG GTGCCATAAA CGACGTTTAC GATCTGCTGC AACATACCAG TTTGTCGATT GTTGGCGAGA TGACGTTAAC TATCGATCAT TGTTTGTTGG TCTCCGGCAC TACTGATTTA TCCACCATTA ATACGGTCTA CAGCCATCCG CAGCCATTCC AGCAATGCAG CAAATTCCTT AATCGTTATC CGCACTGGAA GATTGAATAT ACCGAAAGTA CGTCTGCGGC AATGGAAAAG GTTGCACAGG CAAAATCACC GCATGTTGCT GCGTTGGGAA GCGAAGCTGG CGGCACTTTG TACGGTTTGC AGGTACTGGA GCGTATTGAA GCGAATCAGC GACAAAACTT CACCCGATTT GTGGTGTTGG CACGTAAAGC CATTAACGTT TCTGACCAGG TTCCGGCGAA AACGACGTTG TTAATGGCGA CCGGACAACA AGCTGGTGCA CTGGTTGAAG CGTTGCTGGT ACTGCGCAAC CACAGTCTAA TTATGACCCG TCTGGAATCA CGTCCGATTC ACGGTAATCC GTGGGAAGAG ATGTTTTATC TGGATATTCA GGCCAATCTT GAATCAGCGG AAATGCAAAA AGCATTGAAA GAGTTAGGGG AAATCACCCG TTCGATGAAG GTATTGGGCT GTTACCCAAG TGAGAACGTA GTGCCTGTTG ATCCAACCTG A
|
Protein sequence | MTSENPLLAL REKISALDEK LLALLAERRE LAVEVGKAKL LSHRPVRDID RERDLLERLI TLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNKIN PHSARIAFLG PKGSYSHLAA RQYAARHFEQ FIESGCAKFA DIFNQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI VGEMTLTIDH CLLVSGTTDL STINTVYSHP QPFQQCSKFL NRYPHWKIEY TESTSAAMEK VAQAKSPHVA ALGSEAGGTL YGLQVLERIE ANQRQNFTRF VVLARKAINV SDQVPAKTTL LMATGQQAGA LVEALLVLRN HSLIMTRLES RPIHGNPWEE MFYLDIQANL ESAEMQKALK ELGEITRSMK VLGCYPSENV VPVDPT
|
| |