Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1684 |
Symbol | |
ID | 7185797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3453962 |
End bp | 3457129 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643551357 |
Product | collagen adhesion protein |
Protein accession | YP_002447027 |
Protein GI | 218898616 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0235076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000000016254 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACTGGA GTTATAGTAA TAGTTTAGGA AAGCACATTC GAACTGAAAT GATTAAAAAT TCAAGTGGTC AAATTGCTTA TTGCTTAACT CTTGGACTAA AATCACCAAA TGGTGAAGAT CTTCCTGAAA TGGGGAAAAC AGATAATGTA GTATATAGGC TCCTATTAAA CGGTTTCCCG CAAAAAAGTG TTCAGCAGTT GGGAGTAGCT AATCAGAATG AGGCACACTA CGCAACTCAG CTTGCAGTTT GGAATGCATT AGGTCAACTT GATGTAAATG AATTAAAACA TGAGAATAAA AATGTTGAAA AAGCAGCTAA GGCTATTATT AGTAATGCTA ATAATAGTGA GGAGACTCAA GATGTTTTTA TGAATGTGAT TCCTGCTGAA AAGCAAAAAG CAGAATTAAA GGGTGAATTC TTCGAAACAA ATCTATATTC AGTACAAACA AATGCTAAGG GTGGTTCTTA TAAAGTAGTA GCAAAAAATG CACCTAATGG AGTAAAAATT GTTAGTGAAA ATGGTGAAGT GAAAGATCAA CTTTCAGTTG GAGAGAAATT CCGTATCCAA ATTCCTAAAA ATACTAAAAC AGGTGAATTT AATCTAAGTG TTGCTGCAAA CTTAACAAAA GTTCAAGCAA TTGCTTACCG TGGTACGGAT ACTGTTCAGA ATGCTACAGT ACTATTAGAA AGAAATGAAG AAAAGCTTAG TAGTGATCTT GCAGTAAATT GGGAAGCAGC TGGTTCTTTA AAAATTAAAA AGGTTGGAGA AAATGGAGAA GTTTTAGCTG ATGCAGTATT TGAGGTCTTT AATGCAAATA ATGAATCAGT TGGAAAAATC ACGACTGGTG CTGATGGTAT AGCAGAATTA AACAACTTAC CAATTGGTAC TTACACATTA AAAGAAATTA AAGCACCAAC AGGCTATGTT TCAGATGATA AACCACAAAC TATTGAAGTT AAGACAGGAG AAACAGGAGC TGTCCAAATA GTAAATAACA AAGTAAAAGG TAACATCGAA ATTAAAAAGC TTAGTGATTC AGGAAAGGTT TTACCAAATG TTGAATTCAC AGTCTTCACT GAAGATGGTA AAGAAGTGAA AAAAGTAGTA ACAAAAGAAA ATGGAATTGC AAACGTTGAA GGTCTTACGT ATGGTAAATA TTATTTCTTA GAGACACAAA CACCTAATGG ATATATTGGA AATAAAACAA AGTATCCTTT TGAAATTAAA GAGCACAATA AAACACTTAC TTTTACAGTG GAAAATACAG AAGTAAAAGG TAGTGTAAAA TTACTTAAAG TGGATAATGA AGATGTTAGT AAAAAATTAG AAGGTGCAGA ATTCGAGCTA AAAGATGCAA GTGGAAAAGT AATTGGTGAA TATAAAACAG ATAAAAATGG TGAGATTAAC GTCAAAGATT TAGCGTATGG TAAATATTCA TTTGTAGAAA AGGCTTCTCC AAATGGTTAT GTTCTTGTTA AAGAACCAAT TATGTTCGAA ATAAAGGAAC ATGGAAAGAT TATTGAGTTA TTGGCAGTTA ATCATCTTAT CAAAGGTGAT CTAGAAATTA CAAAGGTAGA TGTAGCTGAT GGAAATAACA AGCTTCCAAA TGCAGAATTT ACGATTTATA ACGAAGCAGG AAAAGAAGTA GTAAAAGGAA AAACAGACGA TAAAGGAATC GCTAAATTTG AAAAGTTACC ATTTGGAAAA TACACATATA AAGAAACTGT TGCACCAAAA GGTTATGTAT TAAATGAAGA AACGTTCTCT TTTGAAATTA AAGAGAATGG TCAAATCATT AAACACATTG TCAAAGATGA AAAGATTCCT TCAATAAAAA CAACAGCTAC TGATAAAACA GATGGTACAA AAGAAATGCA CACGTCTAAG TCTGTAACAA TCCAAGATAA AGTGGAATAC AAAGATCTAC AAGTAGGTAA AGAGTACACT GTAAAAGGTA AATTAATGAA TAAAGAGACT AATAAACCAT TAGTTGTAAA TGGTAAAGAA GTAACTGCTG AAACGAAGTT TACACCAACA GAAGCAAATG GTTTTATTAC ACTAGATTTT ACTTTTGATG CAACTGGTTT AGAAGAAAAA GAAGTAGTAG TATTCGAAGA GTTACTGAAA GACGGAAAAG TTGTTACGAC ACATGCTGAT ATCAATGATA AAGGTCAAAC AGTTAAGTTC GTTAAGCCAT CAGTAAAAAC GACAGCTACA AACAAAGCTG ATGGTGGAAA AGAGATTCAT TCAAAGGATT CTATCACTAT TCAAGACAAA GTGGAGTATA CTAACTTAGT TGTAGGTAAA GAGTACACTG TAAAAGGTAA ACTTATGAAC AAAGCTATTA ACAAACCATT ATTAATTGAT GGAAAAGAAG TAACAGCTGA AACGAAGTTT ACTGCAAAAG AAAAGAACGG TTTTGTAACA TTAGACTTTA CTTTTGTTGG TGCTGAACAG CAAGGAAGAG AAGTAGTCGT GTTTGAAGAC TTATTACATG AAGGTCAAGT AATTGCAACT CATGCTGACA TTAATGATGT AGGTCAAACA GTTCGGTTTG TAGAACCTTC TATTAAAACG ACAGCTACAA ATAAAGCTGA TGGTTCTAAA GAGTTAGACG CTTCTAAATC TGTAACAATC CAAGATAAAG TGGAATACAA AGACTTAATC GTGGGTAAAG AGTACGTTGT AAAAGGTAAA CTAATGGATA AAGCAACAAA CAAGCCATTA TTAGTTGATG GTAAAGAGGT AACAGTAGAA TCTAAGTTTA CTGCTAAAGA GAAGAATGGT TCTATCATAC TAGATTTCAC ATGTAATGCT TCTGCATTAC AAGGTAAAGA AGTAGTAGTA TTTGAAGAAC TGTATCAAGA TAATATATTA ATAGCTATTC ACGCTGAAAT TGAAGATAAG GGACAAACAG TGAAATTTAA AGAGGTAAAG CCGGAACAAC CTAAACCAGA ACAGCCGAAT TCGGATGAAA ATACTCCAAC ACCAGAGCAA CCTAACGAGC AAGTAAAAGA ACAACCACAA CCTAAGAAAG AAATCCAATC TAAAATTGGA TGGTTACCAC AAACAGGTAC TAATCTTACA AGTTGGATCT CTATGGCTGC AGGAGCATTA CTGTTAATTG TTGGTGGAGT GATTTTCTTA AAACGTAAAA ATGCATAG
|
Protein sequence | MNWSYSNSLG KHIRTEMIKN SSGQIAYCLT LGLKSPNGED LPEMGKTDNV VYRLLLNGFP QKSVQQLGVA NQNEAHYATQ LAVWNALGQL DVNELKHENK NVEKAAKAII SNANNSEETQ DVFMNVIPAE KQKAELKGEF FETNLYSVQT NAKGGSYKVV AKNAPNGVKI VSENGEVKDQ LSVGEKFRIQ IPKNTKTGEF NLSVAANLTK VQAIAYRGTD TVQNATVLLE RNEEKLSSDL AVNWEAAGSL KIKKVGENGE VLADAVFEVF NANNESVGKI TTGADGIAEL NNLPIGTYTL KEIKAPTGYV SDDKPQTIEV KTGETGAVQI VNNKVKGNIE IKKLSDSGKV LPNVEFTVFT EDGKEVKKVV TKENGIANVE GLTYGKYYFL ETQTPNGYIG NKTKYPFEIK EHNKTLTFTV ENTEVKGSVK LLKVDNEDVS KKLEGAEFEL KDASGKVIGE YKTDKNGEIN VKDLAYGKYS FVEKASPNGY VLVKEPIMFE IKEHGKIIEL LAVNHLIKGD LEITKVDVAD GNNKLPNAEF TIYNEAGKEV VKGKTDDKGI AKFEKLPFGK YTYKETVAPK GYVLNEETFS FEIKENGQII KHIVKDEKIP SIKTTATDKT DGTKEMHTSK SVTIQDKVEY KDLQVGKEYT VKGKLMNKET NKPLVVNGKE VTAETKFTPT EANGFITLDF TFDATGLEEK EVVVFEELLK DGKVVTTHAD INDKGQTVKF VKPSVKTTAT NKADGGKEIH SKDSITIQDK VEYTNLVVGK EYTVKGKLMN KAINKPLLID GKEVTAETKF TAKEKNGFVT LDFTFVGAEQ QGREVVVFED LLHEGQVIAT HADINDVGQT VRFVEPSIKT TATNKADGSK ELDASKSVTI QDKVEYKDLI VGKEYVVKGK LMDKATNKPL LVDGKEVTVE SKFTAKEKNG SIILDFTCNA SALQGKEVVV FEELYQDNIL IAIHAEIEDK GQTVKFKEVK PEQPKPEQPN SDENTPTPEQ PNEQVKEQPQ PKKEIQSKIG WLPQTGTNLT SWISMAAGAL LLIVGGVIFL KRKNA
|
| |