Gene SbBS512_E3352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3352 
Symbolepd 
ID6271921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3121980 
End bp3122999 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content51% 
IMG OID641727246 
Producterythrose 4-phosphate dehydrogenase 
Protein accessionYP_001881696 
Protein GI187730160 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01532] D-erythrose-4-phosphate dehydrogenase
[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.652674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTAC GCGTAGCGAT AAATGGCTTC GGTCGCATCG GGCGTAATGT GGTTCGTGCT 
TTGTATGAAT CCGGACGCCG GGCGGAAATT ACCGTGGTGG CAATCAACGA ACTGGCGGAT
GCTGCGGGCA TGGCGCATTT GTTGAAATAT GACACCAGCC ATGGCCGTTT TGCATGGGAA
GTACGACAGG AACGCGATCA ACTTTTTGTT GGTGATGACG CCATCCGCGT ATTGCATGAA
CGTTCACTGC AATCGCTCCC CTGGCGTGAA CTTGGCGTTG ATGTAGTCCT CGACTGCACC
GGCGTATATG GCTCCCGCGA GCATGGCGAA GCGCATATTG CCGCCGGGGC CAAAAAAGTG
CTCTTTTCAC ATCCTGGCAG TAACGATCTC GACGCGACCG TTGTTTACGG CGTCAATCAG
GATCAACTTC GTGCGGAACA CCGCATCGTT TCTAACGCTT CCTGTACCAC GAATTGCATA
ATTCCCGTCA TCAAATTGTT AGATGATGCG TACGGTATTG AGTCCGGCAC TGTGACCACA
ATTCACTCCG CCATGCACGA TCAACAGGTT ATTGATGCAT ACCATCCTGA CCTGCGTCGC
ACCCGGGCAG CCAGCCAGTC GATCATTCCG GTCGATACTA AACTGGCCGC CGGTATCACA
CGATTTTTTC CGCAATTTAA CGATCGCTTT GAAGCGATTG CGGTACGTGT GCCAACCATA
AATGTGACGG CAATCGATTT AAGCGTGACG GTGAAGAAAC CTGTAAAAGC CAATGAAGTC
AACCTGTTGC TGCAAAAAGC AGCACAAGGT GCATTTCATG GTATAGTTGA CTATACGGAA
TTGCCGTTGG TCTCTGTAGA TTTTAACCAC GATCCGCACA GTGCCATTGT CGATGGCACC
CAAACCCGGG TCAGTGGCGC ACACCTGATC AAAACGTTGG TCTGGTGCGA TAACGAATGG
GGCTTTGCTA ACCGAATGCT CGACACGACG TTAGCTATGG CTACTGTTGC TTTCAGGTAA
 
Protein sequence
MTVRVAINGF GRIGRNVVRA LYESGRRAEI TVVAINELAD AAGMAHLLKY DTSHGRFAWE 
VRQERDQLFV GDDAIRVLHE RSLQSLPWRE LGVDVVLDCT GVYGSREHGE AHIAAGAKKV
LFSHPGSNDL DATVVYGVNQ DQLRAEHRIV SNASCTTNCI IPVIKLLDDA YGIESGTVTT
IHSAMHDQQV IDAYHPDLRR TRAASQSIIP VDTKLAAGIT RFFPQFNDRF EAIAVRVPTI
NVTAIDLSVT VKKPVKANEV NLLLQKAAQG AFHGIVDYTE LPLVSVDFNH DPHSAIVDGT
QTRVSGAHLI KTLVWCDNEW GFANRMLDTT LAMATVAFR