Gene SbBS512_E3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3495 
SymbolddtA 
ID6269637 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3246962 
End bp3247873 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content54% 
IMG OID641727378 
Producttartrate dehydratase subunit alpha 
Protein accessionYP_001881825 
Protein GI187733439 
COG category[C] Energy production and conversion 
COG ID[COG1951] Tartrate dehydratase alpha subunit/Fumarate hydratase class I, N-terminal domain 
TIGRFAM ID[TIGR00722] hydro-lyases, Fe-S type, tartrate/fumarate subfamily, alpha region 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCG AAAGTAATAA GCAACAGGCA GTGAATAAGT TGACAGAGAT TGTCGCTAAC 
TTTACCGCCA TGATTTCTAC CCGAATGCCT GATGACGTGG TGGATAAACT AAAACAGCTA
AAGGATGCCG AAACGTCGTC GATGGGGAAA ATTATCTACC ATACGATGTT CGACAACATG
CAAAAAGCGA TTGACCTGAA TCGTCCTGCC TGTCAGGACA CCGGGGAGAT TATGTTCTTC
GTTAAAGTCG GTTCCCGCTT CCCACTGCTT GGCGAGCTGC AAAGCATACT CAAACAAGCC
GTGGAAGAGG CAACCGTCAA AGCGCCACTA CGTCACAATG CGGTAGAAAT TTTTGACGAA
GTAAACACCG GCAAAAATAC CGGTAGCGGC GTACCGTGGG TCACCTGGGA CATCATCCCC
GACAATGACG ATGCGGAAAT CGAAGTTTAC ATGGCAGGTG GCGGCTGCAC GCTACCTGGC
CGCTCGAAAG TGTTAATGCC GTCAGAAGGC TACGAAGGCG TGGTGAAATT CGTCTTCGAA
AATATCTCCA CCCTCGCCGT AAACGCCTGT CCACCGGTAC TGGTGGGCGT GGGCATCGCC
ACCTCGGTGG AAACCGCCGC CGTACTCTCG CGTAAAGCCA TTTTGCGCCC GATTGGCAGC
CGCCATCCCA ATCCAAAAGC GGCAGAACTG GAGCTACGCC AGGAAGAAGG ACTCAACCGT
CTGGGGATTG GTCCACAAGG GCTGACTGGC AACAGTTCAG TGATGGGCGT ACATATCGAA
TCTGCCGCCC GCCATCCGTC AACCATCGGC GTTGCTGTCT CTACCGGCTG CTGGGCGCAT
CGTCGCGGCA CACTGCTGGT TCATGCCGAT CTCACCTTTG AAAATCTGTC TCACACCCGG
AGCGCGTTAT GA
 
Protein sequence
MMSESNKQQA VNKLTEIVAN FTAMISTRMP DDVVDKLKQL KDAETSSMGK IIYHTMFDNM 
QKAIDLNRPA CQDTGEIMFF VKVGSRFPLL GELQSILKQA VEEATVKAPL RHNAVEIFDE
VNTGKNTGSG VPWVTWDIIP DNDDAEIEVY MAGGGCTLPG RSKVLMPSEG YEGVVKFVFE
NISTLAVNAC PPVLVGVGIA TSVETAAVLS RKAILRPIGS RHPNPKAAEL ELRQEEGLNR
LGIGPQGLTG NSSVMGVHIE SAARHPSTIG VAVSTGCWAH RRGTLLVHAD LTFENLSHTR
SAL