Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2756 |
Symbol | pheA |
ID | 5593279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 2777358 |
End bp | 2778518 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640921872 |
Product | bifunctional chorismate mutase/prephenate dehydratase |
Protein accession | YP_001459391 |
Protein GI | 157162073 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 8.8068e-16 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCGG AAAACCCGTT ACTGGCGCTG CGAGAGAAAA TCAGCGCGCT GGATGAAAAA TTATTAGCAT TACTCGCAGA GCGGCGCGAA CTGGCCGTCG AGGTGGGAAA AGCCAAACTG CTCTCGCATC GCCCGGTACG TGATATTGAT CGTGAACGCG ATTTACTGGA AAGATTAATT ACGCTCGGTA AAGCGCACCA TCTGGACGCC CATTACATTA CTCGCCTGTT CCAGCTCATC ATTGAAGATT CCGTATTAAC TCAGCAGGCT TTGCTCCAAC AACATCTCAA TAAAATTAAT CCGCACTCAG CACGCATCGC TTTTCTCGGC CCCAAAGGCT CCTATTCACA TCTTGCCGCT CGTCAGTACG CTGCCCGTCA CTTTGAGCAA TTCATTGAAA GTGGCTGCGC CAAATTTGCC GATATTTTTA ATCAGGTGGA AACCGGCCAG GCCGACTATG CCGTCGTACC GATTGAAAAT ACCAGCTCCG GTGCCATAAA CGACGTTTAC GATCTGCTGC AACATACCAG CTTGTCGATT GTTGGCGAGA TGACGTTAAC TATCGACCAT TGTTTGTTGG TCTCCGGCAC TACTGATTTA TCCACCATCA ATACGGTCTA CAGCCATCCG CAGCCATTCC AGCAATGCAG CAAATTCCTT AATCGTTATC CGCACTGGAA GATTGAATAT ACCGAAAGTA CGTCTGCGGC AATGGAAAAG GTTGCACAGG CAAAATCACC GCATGTTGCT GCGTTGGGAA GCGAAGCTGG CGGCACTTTG TACGGTTTGC AGGTACTGGA GCGTATTGAA GCAAATCAGC GACAAAACTT CACCCGATTT GTGGTGTTGG CGCGTAAAGC CATTAACGTG TCTGATCAGG TTCCGGCGAA AACCACGTTG TTAATGGCGA CCGGGCAACA AGCCGGTGCG CTGGTTGAAG CGTTGCTGGT ACTGCGCAAC CACAATCTGA TTATGACCCG TCTGGAATCA CGCCCGATTC ACGGTAATCC ATGGGAAGAG ATGTTCTATC TGGATATTCA GGCCAATCTT GAATCAGCGG AAATGCAAAA AGCATTGAAA GAGTTAGGGG AAATTACCCG TTCAATGAAG GTATTGGGCT GTTACCCAAG TGAGAACGTA GTGCCTGTTG ATCCAACCTG A
|
Protein sequence | MTSENPLLAL REKISALDEK LLALLAERRE LAVEVGKAKL LSHRPVRDID RERDLLERLI TLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNKIN PHSARIAFLG PKGSYSHLAA RQYAARHFEQ FIESGCAKFA DIFNQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI VGEMTLTIDH CLLVSGTTDL STINTVYSHP QPFQQCSKFL NRYPHWKIEY TESTSAAMEK VAQAKSPHVA ALGSEAGGTL YGLQVLERIE ANQRQNFTRF VVLARKAINV SDQVPAKTTL LMATGQQAGA LVEALLVLRN HNLIMTRLES RPIHGNPWEE MFYLDIQANL ESAEMQKALK ELGEITRSMK VLGCYPSENV VPVDPT
|
| |