Gene SbBS512_E2216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2216 
SymbolnagZ 
ID6269103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2016595 
End bp2017620 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID641726237 
Productbeta-hexosaminidase 
Protein accessionYP_001880722 
Protein GI187730351 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.350801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGTCCAG TAATGTTGGA TGTCGAAGGT TACGAACTGG ACGCGGAAGA GCGTGAAATA 
CTGGCGCATC CGCTGGTGGG AGGGCTGATT CTCTTTACGC GTAACTATCA TGATCCTGCC
CAGTTACGTG AACTGGTGCG CCAGATCCGC GCAGCATCGC GCAATCATCT GGTGGTGGCG
GTAGATCAGG AAGGTGGACG CGTGCAGCGT TTTCGCGAAG GTTTTACCCG CTTACCGGCA
GCACAATCCT TTGCTGCGCT GTTGGGAATG GAAGAGGGCG GCAAACTGGC GCAAGAGGCG
GGTTGGCTGA TGGCCAGCGA AATGATCGCT ATGGATATTG ATATCAGCTT TGCGCCAGTG
CTGGATGTAG GACATATCAG CGCGGCGATT GGCGAGCGTT CTTATCATGC CGACCCAGAA
AAAGCCCTGG CAATCGCCAG TCGGTTTATT GATGGGATGC ATGAAGCCGG AATGAAAACG
ACCGGGAAAC ACTTCCCAGG ACACGGTGCA GTAACTGCAG ATTCACACAA AGAGACGCCG
TGCGACCCAC GCCCGCAAGC GGAAATTCGT GCCAAAGATA TGTCGGTTTT CAGCACGTTA
ATCCGCGAAA ATAAACTCGA CGCCATTATG CCTGCGCATG TGATCTACAG TGATGTTGAT
CCGCGTCCGG CGAGCGGTTC TTCCTACTGG CTGAAAACCG TTTTGCGTCA GGAACTGTGT
TTTGACGGTG TAATTTTCTC TGACGATTTA TCGATGGAAG GTGCCGCGAT TATGGGCAGT
TATGCCGAAC GCGGGCAGGC ATCACTGGAC GCAGGTTGCG ATATGATCCT GGTCTGCAAT
AATCGTAAAG GGGCCGTCAG CGTGTTAGAT AATCTGTCAC CGATCAAGGC AGAACGTGTT
ACACGTTTGT ATCATAAAGG TTCATTTTCG CGACAGGAAC TGATGGACTC GGCTCGCTGG
AAGGCGAGCA GCACCCGTCT GAATCAGTTA CATGAACGCT GGCAGGAAGA GAAAGCAGGT
CACTAA
 
Protein sequence
MGPVMLDVEG YELDAEEREI LAHPLVGGLI LFTRNYHDPA QLRELVRQIR AASRNHLVVA 
VDQEGGRVQR FREGFTRLPA AQSFAALLGM EEGGKLAQEA GWLMASEMIA MDIDISFAPV
LDVGHISAAI GERSYHADPE KALAIASRFI DGMHEAGMKT TGKHFPGHGA VTADSHKETP
CDPRPQAEIR AKDMSVFSTL IRENKLDAIM PAHVIYSDVD PRPASGSSYW LKTVLRQELC
FDGVIFSDDL SMEGAAIMGS YAERGQASLD AGCDMILVCN NRKGAVSVLD NLSPIKAERV
TRLYHKGSFS RQELMDSARW KASSTRLNQL HERWQEEKAG H