Gene BCG9842_B1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1684 
Symbol 
ID7185797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3453962 
End bp3457129 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content33% 
IMG OID643551357 
Productcollagen adhesion protein 
Protein accessionYP_002447027 
Protein GI218898616 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0235076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000016254 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTGGA GTTATAGTAA TAGTTTAGGA AAGCACATTC GAACTGAAAT GATTAAAAAT 
TCAAGTGGTC AAATTGCTTA TTGCTTAACT CTTGGACTAA AATCACCAAA TGGTGAAGAT
CTTCCTGAAA TGGGGAAAAC AGATAATGTA GTATATAGGC TCCTATTAAA CGGTTTCCCG
CAAAAAAGTG TTCAGCAGTT GGGAGTAGCT AATCAGAATG AGGCACACTA CGCAACTCAG
CTTGCAGTTT GGAATGCATT AGGTCAACTT GATGTAAATG AATTAAAACA TGAGAATAAA
AATGTTGAAA AAGCAGCTAA GGCTATTATT AGTAATGCTA ATAATAGTGA GGAGACTCAA
GATGTTTTTA TGAATGTGAT TCCTGCTGAA AAGCAAAAAG CAGAATTAAA GGGTGAATTC
TTCGAAACAA ATCTATATTC AGTACAAACA AATGCTAAGG GTGGTTCTTA TAAAGTAGTA
GCAAAAAATG CACCTAATGG AGTAAAAATT GTTAGTGAAA ATGGTGAAGT GAAAGATCAA
CTTTCAGTTG GAGAGAAATT CCGTATCCAA ATTCCTAAAA ATACTAAAAC AGGTGAATTT
AATCTAAGTG TTGCTGCAAA CTTAACAAAA GTTCAAGCAA TTGCTTACCG TGGTACGGAT
ACTGTTCAGA ATGCTACAGT ACTATTAGAA AGAAATGAAG AAAAGCTTAG TAGTGATCTT
GCAGTAAATT GGGAAGCAGC TGGTTCTTTA AAAATTAAAA AGGTTGGAGA AAATGGAGAA
GTTTTAGCTG ATGCAGTATT TGAGGTCTTT AATGCAAATA ATGAATCAGT TGGAAAAATC
ACGACTGGTG CTGATGGTAT AGCAGAATTA AACAACTTAC CAATTGGTAC TTACACATTA
AAAGAAATTA AAGCACCAAC AGGCTATGTT TCAGATGATA AACCACAAAC TATTGAAGTT
AAGACAGGAG AAACAGGAGC TGTCCAAATA GTAAATAACA AAGTAAAAGG TAACATCGAA
ATTAAAAAGC TTAGTGATTC AGGAAAGGTT TTACCAAATG TTGAATTCAC AGTCTTCACT
GAAGATGGTA AAGAAGTGAA AAAAGTAGTA ACAAAAGAAA ATGGAATTGC AAACGTTGAA
GGTCTTACGT ATGGTAAATA TTATTTCTTA GAGACACAAA CACCTAATGG ATATATTGGA
AATAAAACAA AGTATCCTTT TGAAATTAAA GAGCACAATA AAACACTTAC TTTTACAGTG
GAAAATACAG AAGTAAAAGG TAGTGTAAAA TTACTTAAAG TGGATAATGA AGATGTTAGT
AAAAAATTAG AAGGTGCAGA ATTCGAGCTA AAAGATGCAA GTGGAAAAGT AATTGGTGAA
TATAAAACAG ATAAAAATGG TGAGATTAAC GTCAAAGATT TAGCGTATGG TAAATATTCA
TTTGTAGAAA AGGCTTCTCC AAATGGTTAT GTTCTTGTTA AAGAACCAAT TATGTTCGAA
ATAAAGGAAC ATGGAAAGAT TATTGAGTTA TTGGCAGTTA ATCATCTTAT CAAAGGTGAT
CTAGAAATTA CAAAGGTAGA TGTAGCTGAT GGAAATAACA AGCTTCCAAA TGCAGAATTT
ACGATTTATA ACGAAGCAGG AAAAGAAGTA GTAAAAGGAA AAACAGACGA TAAAGGAATC
GCTAAATTTG AAAAGTTACC ATTTGGAAAA TACACATATA AAGAAACTGT TGCACCAAAA
GGTTATGTAT TAAATGAAGA AACGTTCTCT TTTGAAATTA AAGAGAATGG TCAAATCATT
AAACACATTG TCAAAGATGA AAAGATTCCT TCAATAAAAA CAACAGCTAC TGATAAAACA
GATGGTACAA AAGAAATGCA CACGTCTAAG TCTGTAACAA TCCAAGATAA AGTGGAATAC
AAAGATCTAC AAGTAGGTAA AGAGTACACT GTAAAAGGTA AATTAATGAA TAAAGAGACT
AATAAACCAT TAGTTGTAAA TGGTAAAGAA GTAACTGCTG AAACGAAGTT TACACCAACA
GAAGCAAATG GTTTTATTAC ACTAGATTTT ACTTTTGATG CAACTGGTTT AGAAGAAAAA
GAAGTAGTAG TATTCGAAGA GTTACTGAAA GACGGAAAAG TTGTTACGAC ACATGCTGAT
ATCAATGATA AAGGTCAAAC AGTTAAGTTC GTTAAGCCAT CAGTAAAAAC GACAGCTACA
AACAAAGCTG ATGGTGGAAA AGAGATTCAT TCAAAGGATT CTATCACTAT TCAAGACAAA
GTGGAGTATA CTAACTTAGT TGTAGGTAAA GAGTACACTG TAAAAGGTAA ACTTATGAAC
AAAGCTATTA ACAAACCATT ATTAATTGAT GGAAAAGAAG TAACAGCTGA AACGAAGTTT
ACTGCAAAAG AAAAGAACGG TTTTGTAACA TTAGACTTTA CTTTTGTTGG TGCTGAACAG
CAAGGAAGAG AAGTAGTCGT GTTTGAAGAC TTATTACATG AAGGTCAAGT AATTGCAACT
CATGCTGACA TTAATGATGT AGGTCAAACA GTTCGGTTTG TAGAACCTTC TATTAAAACG
ACAGCTACAA ATAAAGCTGA TGGTTCTAAA GAGTTAGACG CTTCTAAATC TGTAACAATC
CAAGATAAAG TGGAATACAA AGACTTAATC GTGGGTAAAG AGTACGTTGT AAAAGGTAAA
CTAATGGATA AAGCAACAAA CAAGCCATTA TTAGTTGATG GTAAAGAGGT AACAGTAGAA
TCTAAGTTTA CTGCTAAAGA GAAGAATGGT TCTATCATAC TAGATTTCAC ATGTAATGCT
TCTGCATTAC AAGGTAAAGA AGTAGTAGTA TTTGAAGAAC TGTATCAAGA TAATATATTA
ATAGCTATTC ACGCTGAAAT TGAAGATAAG GGACAAACAG TGAAATTTAA AGAGGTAAAG
CCGGAACAAC CTAAACCAGA ACAGCCGAAT TCGGATGAAA ATACTCCAAC ACCAGAGCAA
CCTAACGAGC AAGTAAAAGA ACAACCACAA CCTAAGAAAG AAATCCAATC TAAAATTGGA
TGGTTACCAC AAACAGGTAC TAATCTTACA AGTTGGATCT CTATGGCTGC AGGAGCATTA
CTGTTAATTG TTGGTGGAGT GATTTTCTTA AAACGTAAAA ATGCATAG
 
Protein sequence
MNWSYSNSLG KHIRTEMIKN SSGQIAYCLT LGLKSPNGED LPEMGKTDNV VYRLLLNGFP 
QKSVQQLGVA NQNEAHYATQ LAVWNALGQL DVNELKHENK NVEKAAKAII SNANNSEETQ
DVFMNVIPAE KQKAELKGEF FETNLYSVQT NAKGGSYKVV AKNAPNGVKI VSENGEVKDQ
LSVGEKFRIQ IPKNTKTGEF NLSVAANLTK VQAIAYRGTD TVQNATVLLE RNEEKLSSDL
AVNWEAAGSL KIKKVGENGE VLADAVFEVF NANNESVGKI TTGADGIAEL NNLPIGTYTL
KEIKAPTGYV SDDKPQTIEV KTGETGAVQI VNNKVKGNIE IKKLSDSGKV LPNVEFTVFT
EDGKEVKKVV TKENGIANVE GLTYGKYYFL ETQTPNGYIG NKTKYPFEIK EHNKTLTFTV
ENTEVKGSVK LLKVDNEDVS KKLEGAEFEL KDASGKVIGE YKTDKNGEIN VKDLAYGKYS
FVEKASPNGY VLVKEPIMFE IKEHGKIIEL LAVNHLIKGD LEITKVDVAD GNNKLPNAEF
TIYNEAGKEV VKGKTDDKGI AKFEKLPFGK YTYKETVAPK GYVLNEETFS FEIKENGQII
KHIVKDEKIP SIKTTATDKT DGTKEMHTSK SVTIQDKVEY KDLQVGKEYT VKGKLMNKET
NKPLVVNGKE VTAETKFTPT EANGFITLDF TFDATGLEEK EVVVFEELLK DGKVVTTHAD
INDKGQTVKF VKPSVKTTAT NKADGGKEIH SKDSITIQDK VEYTNLVVGK EYTVKGKLMN
KAINKPLLID GKEVTAETKF TAKEKNGFVT LDFTFVGAEQ QGREVVVFED LLHEGQVIAT
HADINDVGQT VRFVEPSIKT TATNKADGSK ELDASKSVTI QDKVEYKDLI VGKEYVVKGK
LMDKATNKPL LVDGKEVTVE SKFTAKEKNG SIILDFTCNA SALQGKEVVV FEELYQDNIL
IAIHAEIEDK GQTVKFKEVK PEQPKPEQPN SDENTPTPEQ PNEQVKEQPQ PKKEIQSKIG
WLPQTGTNLT SWISMAAGAL LLIVGGVIFL KRKNA