Gene BCG9842_B1865 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1865 
Symbol 
ID7182685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3265944 
End bp3268460 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content49% 
IMG OID643551177 
Productcollagen triple helix repeat domain protein 
Protein accessionYP_002446847 
Protein GI218898436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.224261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCATC ATAAAAATTG CAAAAAATTC GGAGTGGCAC TGCCTCTTCC GCTTATAGGT 
GCAACTGGTC CAACAGGTAA TAGTGGCCCT GTAGGTCCAA CTGGAGGGCA TGAAGGTCCT
GTTGGTCCAA CAGGGATGAC TGGACCGACG GGAGCAACTG GTCCCCAAGG ACCCCAAGGA
CCTCAAGGTA TCAGGGGAAT ACAAGGTTCG CGAGGGACAA CAGGAGCGCA GGGAATACAA
GGAGCTGTTG GTGATACCGG AGAGCAAGGT ATAACTGGTC CAACTGGCCT CCAAGGTGCT
CAAGGATTAA TCGGAAACCA AGGTCCAATT GGTGATATTG GGGTACAAGG ATTAGAAGGA
ACACAGGGAG CTACGGGTCC TGCTGGTAGT CAAGGAATAC AGGGCATACA AGGAATACAA
GGGGAAGTAG GTGAGCGAGG ACAAACAGGG GCACAAGGAA TACAAGGAGA GAGAGGGGTA
ACTGGAGTTC CAGGGGTAAC TGGTTCCCAA GGTCCTCAAG GTGTTCAAGG GATACAAGGA
GAGCAAGGGG CTACTGGTAT ACAAGGAGAA GACGGCGCTC AAGGAATACA AGGAATTACT
GGAGAACAAG GACACCAAGG TGATCAAGGG GCACAAGGTG TAGTAGGACC AACTGGTTCT
ACTGGAATAC AAGGTCCTCA AGGTATAAGC GGAATAAAGG GAATAACCGG TGTAACAGGC
CCTCAAGGTC CACAAGGAGT TCAAGGAATA CAAGGAGTAA GTGGATCCAC AGGTTTTCAA
GGAGCAAAAG GAGTACGAGG GATAACAGGA GCAACTGGTC CTACTGGTAC ACAAGGCTCA
GAAGGACCAC CGGGTGGTCC GACGGGAACA ACCGGTCCAA TTGGTCCATC TAGTGGAGTG
ACTGGTCCAT CAGGTCCACC CGGCCCGCCA GGAGGTCCAA CAGGTCCAAC GGGTGCGACT
GGTTCAACAG GTGTAAGCGG AGGGATAGGA CCAATTGGAG CGCAAGGAGT GCAAGGTATA
ACAGGTCCAA CAGGTCCTCA AGGTGTAAGG GGAGTCCAAG GTTCCCAAGG TGTAGTTGGA
GCAGTTGGAG TCCAAGGGGC ACAAGGTCCT CAAGGTAACC CAGGTATAAC GGGTCCAACG
GGTGCAGAAG GTTCTCAAGG AATACAAGGT ACGCGTGGAG TAACTGGTCC GACAGGTGCA
GAAGGCCCTA AAGGTATTCA AGGTATTCAA GGTGTAATAG GCCCAACAGG TGCACAAGGA
ATGGTGGGAA TACAAGGGAT AGCCGGTCCA GCTGGTGTTA CAGGAGCGGA AGGAGTTCAA
GGTGAACAAG GAATCCGAGG AGCAACTGGT CCAGCTGGAG CGCAAGGTAT ACAAGGAGTA
CAAGGAATTC AAGGGGAAAC AGGAGCAGCT GGAGCACAAG GTCCTCAAGG AGTTCAAGGG
ATACAAGGGA CTACGGGATT GACAGGTGCA CAAGGTGCAC AAGGTCTTCA AGGAGTGCAA
GGAATAATCG GTCCGACTGG AGCTATTGGT TTGCAAGGTC CTCAAGGATT ACAAGGAATA
ACGGGTCCAA CAGGAGTTCA AGGAGTGCAA GGAATACAAG GAGTGCAAGG AATAATCGGT
CCAACAGGAG CACAAGGAAT TAGAGGTCCC CAAGGAAATA CAGGGGAGGT CGGTATAACT
GGCCCTCAAG GAGTCCAAGG GGCACAAGGT AGTCAAGGAC CCCAAGGCCC ACAAGGGAAT
ATTGGAATTA CTGGTGCAAC GGGTGAAACT GGAGCTACTG GAGCAACCGG GTCAACTGGA
CCACAAGGTG TGCAAGGTGT TCAAGGGATT ACGGGAATAC AAGGAATAAC AGGAATGACA
GGCGATATAG GTGCAACGGG TCCGCAAGGA ATACAAGGTA TACAAGGAAT TCAAGGTTTG
CAAGGCCCAA TTGGAGCAAG TGGGGTGACA GGTGCTAGTG GTAGTATAGG CCCAGTTGGA
GCGCAAGGTA GTCAGGGGCA AAACGGAGCG GTTGGACCAA CAGGCGCAAC TGGTAATGTA
GGATCAATAG GTTTCTCGGG GATAGCAGGA GCGACTGGAG CGACTGGTTT ACCTAGTGGG
GGCGGTTATT TCTTTTCCAC TGCAACGAGT ACAATTGCAG CGAATGCGCT AATACCAATT
AATTCTGGTT CTACAATTTT TGGAGCAGGA GTTAGTTTAA CAAATGCGAC AACTATAACG
TTAAGTACGC CAGGGATATA TTTAATAAGT TATTATTTTC AAGGGGATGC AATTTTGGGG
AATGAAACGA TTTCGGTAAG GCTTGTTTTA AACGGAACGC AAGTCGCAGG GAGTTTTATT
CTTTATGTTA CAAAAGGTAA TTTTATATTA GAACCAGCGA TTTCAAATAC GATGGTAATT
GAAGTTACTT CTCCAAATTC CACTTTGTCA TTACAAAATG GTCCATTAGC TATTGGGCAT
GTAACGACAT TAGCGGGAAT AATAACAGCT AGCTTAAACA TATTACAAAT AGTTTGA
 
Protein sequence
MRHHKNCKKF GVALPLPLIG ATGPTGNSGP VGPTGGHEGP VGPTGMTGPT GATGPQGPQG 
PQGIRGIQGS RGTTGAQGIQ GAVGDTGEQG ITGPTGLQGA QGLIGNQGPI GDIGVQGLEG
TQGATGPAGS QGIQGIQGIQ GEVGERGQTG AQGIQGERGV TGVPGVTGSQ GPQGVQGIQG
EQGATGIQGE DGAQGIQGIT GEQGHQGDQG AQGVVGPTGS TGIQGPQGIS GIKGITGVTG
PQGPQGVQGI QGVSGSTGFQ GAKGVRGITG ATGPTGTQGS EGPPGGPTGT TGPIGPSSGV
TGPSGPPGPP GGPTGPTGAT GSTGVSGGIG PIGAQGVQGI TGPTGPQGVR GVQGSQGVVG
AVGVQGAQGP QGNPGITGPT GAEGSQGIQG TRGVTGPTGA EGPKGIQGIQ GVIGPTGAQG
MVGIQGIAGP AGVTGAEGVQ GEQGIRGATG PAGAQGIQGV QGIQGETGAA GAQGPQGVQG
IQGTTGLTGA QGAQGLQGVQ GIIGPTGAIG LQGPQGLQGI TGPTGVQGVQ GIQGVQGIIG
PTGAQGIRGP QGNTGEVGIT GPQGVQGAQG SQGPQGPQGN IGITGATGET GATGATGSTG
PQGVQGVQGI TGIQGITGMT GDIGATGPQG IQGIQGIQGL QGPIGASGVT GASGSIGPVG
AQGSQGQNGA VGPTGATGNV GSIGFSGIAG ATGATGLPSG GGYFFSTATS TIAANALIPI
NSGSTIFGAG VSLTNATTIT LSTPGIYLIS YYFQGDAILG NETISVRLVL NGTQVAGSFI
LYVTKGNFIL EPAISNTMVI EVTSPNSTLS LQNGPLAIGH VTTLAGIITA SLNILQIV