Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2994 |
Symbol | pheA |
ID | 6872523 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 2887205 |
End bp | 2888365 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642786030 |
Product | bifunctional chorismate mutase/prephenate dehydratase |
Protein accession | YP_002216676 |
Protein GI | 198242157 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATCGG AAAACCCATT ACTGGCGCTG CGAGATAAAA TCAGCGCTTT AGACGAAGAG TTACTGGCCT TACTGGCAAA ACGACGCGCG CTGGCGATTG AAGTGGGACA AGCAAAACTA CTGTCGCATC GTCCGGTTCG GGATATCGAT CGTGAACGCG CGCTGCTGGA CAGACTCATC CATCTCGGTA AAGCCCACCA TCTCGACGCA CACTACATTA CCCGTCTGTT CCAGCTTATC ATTGAAGACT CCGTGCTTAC TCAGCAGGCG CTGCTGCAAC AACATCTGAA TAATACTCAC CCTCATTCGG CACGTATTGC GTTTCTTGGG CCGAAAGGCT CCTATTCTCA TCTCGCGGCG CGCCAGTATG CTGCACGCCA TTTTGAGCAA TTTATTGAGA GCGGCTGCGC AAAATTCGCC GATATTTTTC ATCAGGTCGA AACCGGCCAG GCCGATTACG CCGTGGTTCC GATAGAGAAC ACCAGCTCCG GCGCTATCAA CGATGTGTAC GACTTATTGC AACACACCAG TCTGTCGATT GTCGGTGAGA TGACTGTCAC TATCGATCAC TGCGTGCTGG TTTCCGGCGC TACAGATCTG AATACCATCG AAACGGTGTA CAGCCATCCG CAGCCGTTTC AGCAGTGCAG TAAATTTTTG AGCCGCTATC CGCACTGGAA AATCGACTAT ACCGAGAGTA CGTCGGCAGC GATGGAAAAA GTCGCGCAGG CAAACTCTCC GCGCGTCGCG GCGCTCGGCA GCGAGGCAGG CGGCATGTTG CACGGTTTAC AGGTGCTGGA ACGCATTGCC GCAAACCAGA CGCAGAATAT CACCCGCTTT CTGGTACTGG CGCGCAAAGC CATCAACGTT TCCGATCAGG TTCCGGCAAA AACCACTCTG TTAATCGCCA CCGGGCAGCA AGCTGGCGCG CTGGTCGAAG CGCTGCTGGT GCTGCGTAAC CACAATCTCA TCATGACGAA ACTGGAGTCG CGCCCCATTC ACGACAATCC GTGGGAAGAG ATGTTTTATC TCGATATTCA GGCGAACCTG GAGTCGCAGG TAATGCAAAG CGCGCTAAAA GAGCTGGGCG AGATCACGCG CTCAATGAAA GTGCTTGGCT GCTATCCCAG CGAAAACGTC GTGCCGGTAG AACCTGCCTG A
|
Protein sequence | MTSENPLLAL RDKISALDEE LLALLAKRRA LAIEVGQAKL LSHRPVRDID RERALLDRLI HLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNNTH PHSARIAFLG PKGSYSHLAA RQYAARHFEQ FIESGCAKFA DIFHQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI VGEMTVTIDH CVLVSGATDL NTIETVYSHP QPFQQCSKFL SRYPHWKIDY TESTSAAMEK VAQANSPRVA ALGSEAGGML HGLQVLERIA ANQTQNITRF LVLARKAINV SDQVPAKTTL LIATGQQAGA LVEALLVLRN HNLIMTKLES RPIHDNPWEE MFYLDIQANL ESQVMQSALK ELGEITRSMK VLGCYPSENV VPVEPA
|
| |