Gene SbBS512_E2986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2986 
SymbolpheA 
ID6272064 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2788177 
End bp2789337 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content49% 
IMG OID641726923 
Productbifunctional chorismate mutase/prephenate dehydratase 
Protein accessionYP_001881388 
Protein GI187730271 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0077] Prephenate dehydratase
[COG1605] Chorismate mutase 
TIGRFAM ID[TIGR01797] chorismate mutase domain of proteobacterial P-protein, clade 1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0000154451 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCGG AAAATCCATT ACTGGCGCTG CGAGAGAAAA TCAGCGCGCT GGATGAAAAA 
TTATTAGCAT TACTGGCAGA GCGGCGCGAA CTGGCCGTCG AGGTGGGAAA AGCCAAACTG
CTCTCGCATC GCCCGGTACG AGATATTGAT CGTGAACGCG ATTTACTGGA AAGATTAATT
ACGCTCGGTA AAGCGCACCA TCTGGACGCC CATTACATTA CTCGCCTGTT CCAGCTCATC
ATTGAAGATT CCGTATTAAC TCAGCAGGCT TTGCTCCAGC AACATCTCAA TAAAATTAAT
CCGCACTCAG CACGCATCGC TTTTCTCGGC CCCAAAGGTT CTTATTCCCA TCTTGCGGCG
CGCCAGTATG CTGCCCGTCA CTTTGAGCAA TTCATTGAAA GTGGCTGCGC CAAATTTGCC
GATATTTTTA ATCAGGTGGA AACCGGCCAG GCCGACTATG CCGTCGTACC GATTGAAAAT
ACCAGCTCCG GTGCCATAAA CGACGTTTAC GATCTGCTGC AACATACCAG CTTGTCGATT
GTTGGCGAGA TGACGTTAAC TATCGACCAT TGTTTGTTGG TCTCCGGCAC GACTGATTTA
TCCACCATTA ATACGGTCTA CAGCCATCCG CAGCCATTCC AGCAATGCAG CAAATTCCTT
AATCGTTATC CTCACTGGAA GATTGAATAT ACCGAAAGTA CGTCTGCGGC AATGGAAAAG
GTTGCACAGG CAAAATCACC GCATGTTGCT GCGTTGGGAA GCGAAGCTGG CGGCACTTTG
TACGGTTTGC AGGTACTGGA GCGTATTGAA GCGAATCAGC AACAAAACTT CACCCGATTT
GTGGTGTTGG CACGTAAAGC CATTAACGTG TCTGACCAGG TTCCGGCGAA AACGACGTTG
TTAATGGCGA CCGGGCAACA AGCCGGTGCG CTGGTTGAAG CGTTGCTGGT GCTGCGCAAC
CACAATCTGA TTATGACCCG TCTGGAATCA CGCCCGATTC ACGGTAATCC ATGGGAAGAG
ATGTTTTATC TGGATATTCA GGCCAATCTT GAATCAGCGG AAATGCAAAA AGCATTGAAA
GAGTTAGGGG AAATTACCCG TTCAATGAAG GTATTGGGCT GTTACCCAAG TGAGAACGTA
GTGCCTGTTG ATCCAACCTG A
 
Protein sequence
MTSENPLLAL REKISALDEK LLALLAERRE LAVEVGKAKL LSHRPVRDID RERDLLERLI 
TLGKAHHLDA HYITRLFQLI IEDSVLTQQA LLQQHLNKIN PHSARIAFLG PKGSYSHLAA
RQYAARHFEQ FIESGCAKFA DIFNQVETGQ ADYAVVPIEN TSSGAINDVY DLLQHTSLSI
VGEMTLTIDH CLLVSGTTDL STINTVYSHP QPFQQCSKFL NRYPHWKIEY TESTSAAMEK
VAQAKSPHVA ALGSEAGGTL YGLQVLERIE ANQQQNFTRF VVLARKAINV SDQVPAKTTL
LMATGQQAGA LVEALLVLRN HNLIMTRLES RPIHGNPWEE MFYLDIQANL ESAEMQKALK
ELGEITRSMK VLGCYPSENV VPVDPT