Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2216 |
Symbol | nagZ |
ID | 6269103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2016595 |
End bp | 2017620 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641726237 |
Product | beta-hexosaminidase |
Protein accession | YP_001880722 |
Protein GI | 187730351 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.350801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTCCAG TAATGTTGGA TGTCGAAGGT TACGAACTGG ACGCGGAAGA GCGTGAAATA CTGGCGCATC CGCTGGTGGG AGGGCTGATT CTCTTTACGC GTAACTATCA TGATCCTGCC CAGTTACGTG AACTGGTGCG CCAGATCCGC GCAGCATCGC GCAATCATCT GGTGGTGGCG GTAGATCAGG AAGGTGGACG CGTGCAGCGT TTTCGCGAAG GTTTTACCCG CTTACCGGCA GCACAATCCT TTGCTGCGCT GTTGGGAATG GAAGAGGGCG GCAAACTGGC GCAAGAGGCG GGTTGGCTGA TGGCCAGCGA AATGATCGCT ATGGATATTG ATATCAGCTT TGCGCCAGTG CTGGATGTAG GACATATCAG CGCGGCGATT GGCGAGCGTT CTTATCATGC CGACCCAGAA AAAGCCCTGG CAATCGCCAG TCGGTTTATT GATGGGATGC ATGAAGCCGG AATGAAAACG ACCGGGAAAC ACTTCCCAGG ACACGGTGCA GTAACTGCAG ATTCACACAA AGAGACGCCG TGCGACCCAC GCCCGCAAGC GGAAATTCGT GCCAAAGATA TGTCGGTTTT CAGCACGTTA ATCCGCGAAA ATAAACTCGA CGCCATTATG CCTGCGCATG TGATCTACAG TGATGTTGAT CCGCGTCCGG CGAGCGGTTC TTCCTACTGG CTGAAAACCG TTTTGCGTCA GGAACTGTGT TTTGACGGTG TAATTTTCTC TGACGATTTA TCGATGGAAG GTGCCGCGAT TATGGGCAGT TATGCCGAAC GCGGGCAGGC ATCACTGGAC GCAGGTTGCG ATATGATCCT GGTCTGCAAT AATCGTAAAG GGGCCGTCAG CGTGTTAGAT AATCTGTCAC CGATCAAGGC AGAACGTGTT ACACGTTTGT ATCATAAAGG TTCATTTTCG CGACAGGAAC TGATGGACTC GGCTCGCTGG AAGGCGAGCA GCACCCGTCT GAATCAGTTA CATGAACGCT GGCAGGAAGA GAAAGCAGGT CACTAA
|
Protein sequence | MGPVMLDVEG YELDAEEREI LAHPLVGGLI LFTRNYHDPA QLRELVRQIR AASRNHLVVA VDQEGGRVQR FREGFTRLPA AQSFAALLGM EEGGKLAQEA GWLMASEMIA MDIDISFAPV LDVGHISAAI GERSYHADPE KALAIASRFI DGMHEAGMKT TGKHFPGHGA VTADSHKETP CDPRPQAEIR AKDMSVFSTL IRENKLDAIM PAHVIYSDVD PRPASGSSYW LKTVLRQELC FDGVIFSDDL SMEGAAIMGS YAERGQASLD AGCDMILVCN NRKGAVSVLD NLSPIKAERV TRLYHKGSFS RQELMDSARW KASSTRLNQL HERWQEEKAG H
|
| |