Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK1629 |
Symbol | |
ID | 3024747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 1714734 |
End bp | 1716146 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637545857 |
Product | S-layer protein |
Protein accession | YP_083223 |
Protein GI | 52143605 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4193] Beta- N-acetylglucosaminidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00040233 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TTATTTCTAA TGTGTTAGCA GTGACAGTCG CACTTCAAGT AGTGATGGCT CCAGCAACTT CTTTTGCATC TACAAAAGAA TTTCCAGACG TTCCGAAAAA TCATTGGGCA CTTGAAGCGA TTAATGATTT AACGTCAAAA GGGGTTATTG CAGGTTACGA TAATGGTAAA TTTGGCTTTG GAGATGTTGT AACTCGTGAG CAAGTAGCAG CATTAATGTA TCGCGCACTA AAACCAGAGG TGAAAAGCGA TTATAAAAAT CCATACTCTG ATATTAGTGC AGGAACGACG ATGTTCCAAA AAGAAATCTT GGCATTAACA GATATGGGAA TTTTCGTAGG TGATGGTAAA GGAACGTTTA GACCGAAAGA ATCATTAACA CGTGCGGAGA TGTCTGTTAT TTTGCAAAAA GCATTTAAAT TAGAAGTAAA AGCACCACAT ACATTTGATG ATATAGATGC AACATATTGG TGGGCAAAGG AAGCAATTAG TGCATTGCAA TCTAATGGTG TAGCGGCAGG GAATGGACTT GGGGGATTTG ATCCATCAGG TGTATTAACG CGTGAAGGCT ATGCACAATT ATTATATAAA GCGATGCAAA TAAAAAAGGA TGTTCCTGTT GAACAACCAT CATATATGAA TCTAGATGTG ACATTGCCAT CTAACATAAC GGCACAGGAG ATTGATGGAT TTATTAAAGA ATGGCACCCT GACAGTCCGC TTATTGGAAC TGGACAAGAT TTTATTCAAG CACAAAATGA GTATGGTGTG AGCGCATTAT ACTTAGCTGC ACATGCAATT TTAGAATCTG GTTACGGTAA ATCAGAAATT GCATATCGTA AACATAATTT ATTTGGTTTA AGAGCATATG ATCGCGATCC ATTTGCATAC GCAAAATATT TACCATCATA CAAGGACAGT ATTTCTTACA ATGCTGATTA TGTAAGAAAG AATTACTTAG AAAAAGGTGC TGATCATTTT AATGGTTACA CATTGCCTGC TATGAATATT AAGTATGCAA CAGATAAAGA ATGGGCTGGC AAAATCGCTA ATCTTATGGA GCGTATTAAA CCGTTTAACA AAAAAGATTA TGAAAATGTA AAACGATTAC CAAAGAATCC TAATACATTG AATGTAGAGG CATTAGGAAA AGAAATTCCA TATAAAGATT ATGCAAAAGA TGCAACAGCT ACTGTTCAAT TAGTAGGCTC TTACTATCAA GTACCATATC CATTTGGCTA TACAATTAAG AGTGTACCAA ATATTACGCA AAATGAAGTT GGAAAATTAG AAAGTGGCAA GAAAGTAAAT GTATATCGTG AAGATCCAAA CGGCTGGGTA GAATTTTCAT TTGAAAACGC TCAAGAAAAA TATTGGACAT TGAAGAAGAA CTTAAAAATA TAA
|
Protein sequence | MKKVISNVLA VTVALQVVMA PATSFASTKE FPDVPKNHWA LEAINDLTSK GVIAGYDNGK FGFGDVVTRE QVAALMYRAL KPEVKSDYKN PYSDISAGTT MFQKEILALT DMGIFVGDGK GTFRPKESLT RAEMSVILQK AFKLEVKAPH TFDDIDATYW WAKEAISALQ SNGVAAGNGL GGFDPSGVLT REGYAQLLYK AMQIKKDVPV EQPSYMNLDV TLPSNITAQE IDGFIKEWHP DSPLIGTGQD FIQAQNEYGV SALYLAAHAI LESGYGKSEI AYRKHNLFGL RAYDRDPFAY AKYLPSYKDS ISYNADYVRK NYLEKGADHF NGYTLPAMNI KYATDKEWAG KIANLMERIK PFNKKDYENV KRLPKNPNTL NVEALGKEIP YKDYAKDATA TVQLVGSYYQ VPYPFGYTIK SVPNITQNEV GKLESGKKVN VYREDPNGWV EFSFENAQEK YWTLKKNLKI
|
| |