Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3937 |
Symbol | bcsZ |
ID | 6271662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 3669983 |
End bp | 3671080 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641727787 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_001882220 |
Protein GI | 187733064 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.0936967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGTAGTG GAATCGTGAC GATGCTGTTG TTGGCTGCCT TTAGTGTTCA GGCAGCCTGT ACCTGGCCTG CCTGGGAGCA GTTTAAAAAG GATTACATCA GTCAGGAAGG GCGCGTCATC GACCCCAGCG ACGCGCGCAA AATCACCACC TCCGAAGGGC AAAGTTACGG TATGTTCTTT GCCCTGGCGG CTAACGACCG TGCAGCTTTC GATAATATTC TCGACTGGAC GCAGAACAAT CTCGCTCAGG GTTCTTTAAA AGAACGTTTG CCCGCCTGGC TGTGGGGCAA GAAAGAGAAC AGTAAGTGGG AAGTGCTGGA CAGCAATTCG GCCTCCGATG GTGATGTCTG GATGGCCTGG TCGTTGCTGG AGGCGGGGCG TTTGTGGAAA GAGCAGCGTT ATACCGACAT CGGCAGCGCG TTGCTAAAAC GTATCGCGCG GGAGGAAGTG GTGACGGTGC CTGGGCTGGG TTCCATGTTG TTACCGGGCA AAGTGGGTTT TGCTGAGGAT AACAGCTGGC GTTTTAACCC CAGCTACCTG CCGCCGACGC TGGCGCAGTA TTTCACCCGC TTTGGCGCGC CGTGGACTAC GCTGCGCGAA ACCAATCAAC GTTTATTGCT GGAAACCGCC CCGAAAGGCT TTTCGCCAGA CTGGGTGCGT TATGAGAAAG ACAAAGGCTG GCAGCTAAAA GCCGAAAAAA CATTGATCAG CAGCTACGAC GCCATCCGCG TTTACATGTG GGTAGGCATG ATGCCTGACA GCGATCCGCA GAAAGCGCGG ATGCTCAACC GGTTTAAACC GATGGCGACA TTCACTGAGA AAAACGGTTA TCCGCCGGAA AAAGTGGATG TGGCTACGGG GAAAGCGCAG GGTAAAGGAC CAGTCGGTTT TTCTGCCGCC ATGCTGCCCT TTTTACAAAA CCGCGATGCG CAGGTCGTTC AGCGCCAGCG CGTGGCCGAT AACTTTCCCG GCAGCGATGC CTATTACAAC TATGTGCTGA CCCTGTTTGG ACAAGGCTGG GATCAACACC GTTTCCGCTT CTCGACAAAA GGTGAGTTAT TACCTGACTG GGGCCAGGAA TGCGCAAATT CACACTAA
|
Protein sequence | MRSGIVTMLL LAAFSVQAAC TWPAWEQFKK DYISQEGRVI DPSDARKITT SEGQSYGMFF ALAANDRAAF DNILDWTQNN LAQGSLKERL PAWLWGKKEN SKWEVLDSNS ASDGDVWMAW SLLEAGRLWK EQRYTDIGSA LLKRIAREEV VTVPGLGSML LPGKVGFAED NSWRFNPSYL PPTLAQYFTR FGAPWTTLRE TNQRLLLETA PKGFSPDWVR YEKDKGWQLK AEKTLISSYD AIRVYMWVGM MPDSDPQKAR MLNRFKPMAT FTEKNGYPPE KVDVATGKAQ GKGPVGFSAA MLPFLQNRDA QVVQRQRVAD NFPGSDAYYN YVLTLFGQGW DQHRFRFSTK GELLPDWGQE CANSH
|
| |