Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1865 |
Symbol | |
ID | 7182685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3265944 |
End bp | 3268460 |
Gene Length | 2517 bp |
Protein Length | 838 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643551177 |
Product | collagen triple helix repeat domain protein |
Protein accession | YP_002446847 |
Protein GI | 218898436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.224261 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTCATC ATAAAAATTG CAAAAAATTC GGAGTGGCAC TGCCTCTTCC GCTTATAGGT GCAACTGGTC CAACAGGTAA TAGTGGCCCT GTAGGTCCAA CTGGAGGGCA TGAAGGTCCT GTTGGTCCAA CAGGGATGAC TGGACCGACG GGAGCAACTG GTCCCCAAGG ACCCCAAGGA CCTCAAGGTA TCAGGGGAAT ACAAGGTTCG CGAGGGACAA CAGGAGCGCA GGGAATACAA GGAGCTGTTG GTGATACCGG AGAGCAAGGT ATAACTGGTC CAACTGGCCT CCAAGGTGCT CAAGGATTAA TCGGAAACCA AGGTCCAATT GGTGATATTG GGGTACAAGG ATTAGAAGGA ACACAGGGAG CTACGGGTCC TGCTGGTAGT CAAGGAATAC AGGGCATACA AGGAATACAA GGGGAAGTAG GTGAGCGAGG ACAAACAGGG GCACAAGGAA TACAAGGAGA GAGAGGGGTA ACTGGAGTTC CAGGGGTAAC TGGTTCCCAA GGTCCTCAAG GTGTTCAAGG GATACAAGGA GAGCAAGGGG CTACTGGTAT ACAAGGAGAA GACGGCGCTC AAGGAATACA AGGAATTACT GGAGAACAAG GACACCAAGG TGATCAAGGG GCACAAGGTG TAGTAGGACC AACTGGTTCT ACTGGAATAC AAGGTCCTCA AGGTATAAGC GGAATAAAGG GAATAACCGG TGTAACAGGC CCTCAAGGTC CACAAGGAGT TCAAGGAATA CAAGGAGTAA GTGGATCCAC AGGTTTTCAA GGAGCAAAAG GAGTACGAGG GATAACAGGA GCAACTGGTC CTACTGGTAC ACAAGGCTCA GAAGGACCAC CGGGTGGTCC GACGGGAACA ACCGGTCCAA TTGGTCCATC TAGTGGAGTG ACTGGTCCAT CAGGTCCACC CGGCCCGCCA GGAGGTCCAA CAGGTCCAAC GGGTGCGACT GGTTCAACAG GTGTAAGCGG AGGGATAGGA CCAATTGGAG CGCAAGGAGT GCAAGGTATA ACAGGTCCAA CAGGTCCTCA AGGTGTAAGG GGAGTCCAAG GTTCCCAAGG TGTAGTTGGA GCAGTTGGAG TCCAAGGGGC ACAAGGTCCT CAAGGTAACC CAGGTATAAC GGGTCCAACG GGTGCAGAAG GTTCTCAAGG AATACAAGGT ACGCGTGGAG TAACTGGTCC GACAGGTGCA GAAGGCCCTA AAGGTATTCA AGGTATTCAA GGTGTAATAG GCCCAACAGG TGCACAAGGA ATGGTGGGAA TACAAGGGAT AGCCGGTCCA GCTGGTGTTA CAGGAGCGGA AGGAGTTCAA GGTGAACAAG GAATCCGAGG AGCAACTGGT CCAGCTGGAG CGCAAGGTAT ACAAGGAGTA CAAGGAATTC AAGGGGAAAC AGGAGCAGCT GGAGCACAAG GTCCTCAAGG AGTTCAAGGG ATACAAGGGA CTACGGGATT GACAGGTGCA CAAGGTGCAC AAGGTCTTCA AGGAGTGCAA GGAATAATCG GTCCGACTGG AGCTATTGGT TTGCAAGGTC CTCAAGGATT ACAAGGAATA ACGGGTCCAA CAGGAGTTCA AGGAGTGCAA GGAATACAAG GAGTGCAAGG AATAATCGGT CCAACAGGAG CACAAGGAAT TAGAGGTCCC CAAGGAAATA CAGGGGAGGT CGGTATAACT GGCCCTCAAG GAGTCCAAGG GGCACAAGGT AGTCAAGGAC CCCAAGGCCC ACAAGGGAAT ATTGGAATTA CTGGTGCAAC GGGTGAAACT GGAGCTACTG GAGCAACCGG GTCAACTGGA CCACAAGGTG TGCAAGGTGT TCAAGGGATT ACGGGAATAC AAGGAATAAC AGGAATGACA GGCGATATAG GTGCAACGGG TCCGCAAGGA ATACAAGGTA TACAAGGAAT TCAAGGTTTG CAAGGCCCAA TTGGAGCAAG TGGGGTGACA GGTGCTAGTG GTAGTATAGG CCCAGTTGGA GCGCAAGGTA GTCAGGGGCA AAACGGAGCG GTTGGACCAA CAGGCGCAAC TGGTAATGTA GGATCAATAG GTTTCTCGGG GATAGCAGGA GCGACTGGAG CGACTGGTTT ACCTAGTGGG GGCGGTTATT TCTTTTCCAC TGCAACGAGT ACAATTGCAG CGAATGCGCT AATACCAATT AATTCTGGTT CTACAATTTT TGGAGCAGGA GTTAGTTTAA CAAATGCGAC AACTATAACG TTAAGTACGC CAGGGATATA TTTAATAAGT TATTATTTTC AAGGGGATGC AATTTTGGGG AATGAAACGA TTTCGGTAAG GCTTGTTTTA AACGGAACGC AAGTCGCAGG GAGTTTTATT CTTTATGTTA CAAAAGGTAA TTTTATATTA GAACCAGCGA TTTCAAATAC GATGGTAATT GAAGTTACTT CTCCAAATTC CACTTTGTCA TTACAAAATG GTCCATTAGC TATTGGGCAT GTAACGACAT TAGCGGGAAT AATAACAGCT AGCTTAAACA TATTACAAAT AGTTTGA
|
Protein sequence | MRHHKNCKKF GVALPLPLIG ATGPTGNSGP VGPTGGHEGP VGPTGMTGPT GATGPQGPQG PQGIRGIQGS RGTTGAQGIQ GAVGDTGEQG ITGPTGLQGA QGLIGNQGPI GDIGVQGLEG TQGATGPAGS QGIQGIQGIQ GEVGERGQTG AQGIQGERGV TGVPGVTGSQ GPQGVQGIQG EQGATGIQGE DGAQGIQGIT GEQGHQGDQG AQGVVGPTGS TGIQGPQGIS GIKGITGVTG PQGPQGVQGI QGVSGSTGFQ GAKGVRGITG ATGPTGTQGS EGPPGGPTGT TGPIGPSSGV TGPSGPPGPP GGPTGPTGAT GSTGVSGGIG PIGAQGVQGI TGPTGPQGVR GVQGSQGVVG AVGVQGAQGP QGNPGITGPT GAEGSQGIQG TRGVTGPTGA EGPKGIQGIQ GVIGPTGAQG MVGIQGIAGP AGVTGAEGVQ GEQGIRGATG PAGAQGIQGV QGIQGETGAA GAQGPQGVQG IQGTTGLTGA QGAQGLQGVQ GIIGPTGAIG LQGPQGLQGI TGPTGVQGVQ GIQGVQGIIG PTGAQGIRGP QGNTGEVGIT GPQGVQGAQG SQGPQGPQGN IGITGATGET GATGATGSTG PQGVQGVQGI TGIQGITGMT GDIGATGPQG IQGIQGIQGL QGPIGASGVT GASGSIGPVG AQGSQGQNGA VGPTGATGNV GSIGFSGIAG ATGATGLPSG GGYFFSTATS TIAANALIPI NSGSTIFGAG VSLTNATTIT LSTPGIYLIS YYFQGDAILG NETISVRLVL NGTQVAGSFI LYVTKGNFIL EPAISNTMVI EVTSPNSTLS LQNGPLAIGH VTTLAGIITA SLNILQIV
|
| |