Gene SeD_A2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2994 
SymbolpheA 
ID6872523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2887205 
End bp2888365 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content54% 
IMG OID642786030 
Productbifunctional chorismate mutase/prephenate dehydratase 
Protein accessionYP_002216676 
Protein GI198242157 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCGG AAAACCCATT ACTGGCGCTG CGAGATAAAA TCAGCGCTTT AGACGAAGAG 
TTACTGGCCT TACTGGCAAA ACGACGCGCG CTGGCGATTG AAGTGGGACA AGCAAAACTA
CTGTCGCATC GTCCGGTTCG GGATATCGAT CGTGAACGCG CGCTGCTGGA CAGACTCATC
CATCTCGGTA AAGCCCACCA TCTCGACGCA CACTACATTA CCCGTCTGTT CCAGCTTATC
ATTGAAGACT CCGTGCTTAC TCAGCAGGCG CTGCTGCAAC AACATCTGAA TAATACTCAC
CCTCATTCGG CACGTATTGC GTTTCTTGGG CCGAAAGGCT CCTATTCTCA TCTCGCGGCG
CGCCAGTATG CTGCACGCCA TTTTGAGCAA TTTATTGAGA GCGGCTGCGC AAAATTCGCC
GATATTTTTC ATCAGGTCGA AACCGGCCAG GCCGATTACG CCGTGGTTCC GATAGAGAAC
ACCAGCTCCG GCGCTATCAA CGATGTGTAC GACTTATTGC AACACACCAG TCTGTCGATT
GTCGGTGAGA TGACTGTCAC TATCGATCAC TGCGTGCTGG TTTCCGGCGC TACAGATCTG
AATACCATCG AAACGGTGTA CAGCCATCCG CAGCCGTTTC AGCAGTGCAG TAAATTTTTG
AGCCGCTATC CGCACTGGAA AATCGACTAT ACCGAGAGTA CGTCGGCAGC GATGGAAAAA
GTCGCGCAGG CAAACTCTCC GCGCGTCGCG GCGCTCGGCA GCGAGGCAGG CGGCATGTTG
CACGGTTTAC AGGTGCTGGA ACGCATTGCC GCAAACCAGA CGCAGAATAT CACCCGCTTT
CTGGTACTGG CGCGCAAAGC CATCAACGTT TCCGATCAGG TTCCGGCAAA AACCACTCTG
TTAATCGCCA CCGGGCAGCA AGCTGGCGCG CTGGTCGAAG CGCTGCTGGT GCTGCGTAAC
CACAATCTCA TCATGACGAA ACTGGAGTCG CGCCCCATTC ACGACAATCC GTGGGAAGAG
ATGTTTTATC TCGATATTCA GGCGAACCTG GAGTCGCAGG TAATGCAAAG CGCGCTAAAA
GAGCTGGGCG AGATCACGCG CTCAATGAAA GTGCTTGGCT GCTATCCCAG CGAAAACGTC
GTGCCGGTAG AACCTGCCTG A
 
Protein sequence
MTSENPLLAL RDKISALDEE LLALLAKRRA LAIEVGQAKL LSHRPVRDID RERALLDRLI 
HLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNNTH PHSARIAFLG PKGSYSHLAA
RQYAARHFEQ FIESGCAKFA DIFHQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI
VGEMTVTIDH CVLVSGATDL NTIETVYSHP QPFQQCSKFL SRYPHWKIDY TESTSAAMEK
VAQANSPRVA ALGSEAGGML HGLQVLERIA ANQTQNITRF LVLARKAINV SDQVPAKTTL
LIATGQQAGA LVEALLVLRN HNLIMTKLES RPIHDNPWEE MFYLDIQANL ESQVMQSALK
ELGEITRSMK VLGCYPSENV VPVEPA