Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_1431 |
Symbol | |
ID | 5346293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | + |
Start bp | 1529430 |
End bp | 1530806 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640839019 |
Product | S-layer domain-containing protein |
Protein accession | YP_001374745 |
Protein GI | 152975228 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4193] Beta- N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TAATTTCTAA CGTTTTAGCA ATGACAGCTG CACTACAAGT TATAATGGTT CCAACAACCT CATTTGCGGC AGAAAAAGGA TTTTCAGATG TACCAAAGAG TCATTGGGCA TATGAGGCAA TCAATGATTT AGCAAATCGA AACATCATTG CAGGTTATGA CAACGGTAAA TTTGGATTGG GAGACAGCGT AACGCGTGAA CAAGTAGCTG CACTTATTTA TCGTGCATTG AAACCAGAAG CGAAAGCAGA ATATAAAAAT CCATATCATG ATGTAAGTGC AAGTACAACA ATGTTTCCAA ATGAGATTTT AGCTTTAACG GAGATGGGAA TCTTTACAGG GGATGAGAAC AAAAACTTTA GACCAAAGGA TTCATTAACT CGCGCTGAAA TGGCAATGAT TTTACAAAGA GCTTATCATT TAAAAGTAAA GGCGAATCAT ACATTTCATG ATGTGGATCC GAATTCTTGG GCGAAAGATG CAATTAGTGC GTTACAGTCT AATGGAATGG CAGAGGGAGA TGGAACGGGT GCATTTTATC CGTCAAAAAC TGTCACACGT GAGGAATATG CACAATTTTT ATTTAATGCG GAACAATCTT ATTTAAATTT AGATTTAACA TTAGCTTCCA ATGTAACAGC AGAAGAAATT GATAATTTTC TTAAGAAATC GCGTTCTGAT AGTCCGTTAA TTGGCCATGG ACAAGACTTT ATTGCAGCAC AAAATGAACA TGGTGTAAAT GCTCTTTACT TAGCAGCACA TGCAATTTTA GAATCTGGAT ATGGAAGATC TGAGATTGCA TATCGTAAAC ATAATTTATT TGGACTACGT GCATATGATC GCGATCCATT CTATCATGCA AAATATTTAC CAACATACCG TGATAGTATT TCGTACAATG CTAACTACGT GAGAGAACGT TATTTAGAGA AAGGTGCAAT CTATTATAAT GGTCCAACAT TAGTTGGTAT GAATGTGAAA TATGCATCCG ATCCAGAATG GGCTGGAAAA ATTGCTGGTT TAATGGAGCG TATTAAGCCG TTTGATCGAA ATGATTATAA AAATGCTAAC AGATTACCGA AAAACCCACA TACTTTAGAC GTTGAAGCGT TAGGAAATGA AATTCCATAT AAGGATTTAG GAAATAGAGA AATTGCTGTT CAGGCATCTG GAAAATATTA TAAAGTGCCG TATCCATACG ATTTGAAAAT TAAGAGTATC CCAGACATTA CACAAAATGA AATGGGAACA TTAACGAATG GATCAAAAGT AACAGTTCAT CGTGAAGATC CAAATGGATG GGTAGAGTTT TCGATGAAAG ATAAGCAAGA AAAATATTGG ACATTGAAAA GTAATTTAAA AATGTAA
|
Protein sequence | MKKVISNVLA MTAALQVIMV PTTSFAAEKG FSDVPKSHWA YEAINDLANR NIIAGYDNGK FGLGDSVTRE QVAALIYRAL KPEAKAEYKN PYHDVSASTT MFPNEILALT EMGIFTGDEN KNFRPKDSLT RAEMAMILQR AYHLKVKANH TFHDVDPNSW AKDAISALQS NGMAEGDGTG AFYPSKTVTR EEYAQFLFNA EQSYLNLDLT LASNVTAEEI DNFLKKSRSD SPLIGHGQDF IAAQNEHGVN ALYLAAHAIL ESGYGRSEIA YRKHNLFGLR AYDRDPFYHA KYLPTYRDSI SYNANYVRER YLEKGAIYYN GPTLVGMNVK YASDPEWAGK IAGLMERIKP FDRNDYKNAN RLPKNPHTLD VEALGNEIPY KDLGNREIAV QASGKYYKVP YPYDLKIKSI PDITQNEMGT LTNGSKVTVH REDPNGWVEF SMKDKQEKYW TLKSNLKM
|
| |