Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B0398 |
Symbol | |
ID | 7185229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 4647385 |
End bp | 4650240 |
Gene Length | 2856 bp |
Protein Length | 951 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643552627 |
Product | collagen triple helix repeat domain protein |
Protein accession | YP_002448294 |
Protein GI | 218899883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.178342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 143 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGAATC GTGATAATAA AGGGAAACAA CAATCTAATT TTAGAATTCC ACCAGAACTT ATTGGTCCTA CTTTTCCTCC TGTTCCAACT GGATTTACGG GTATAGGGAT TACTGGTCCA ACAGGTCCAC AAGGTCCGAC TGGACCGCAA GGACCGAGAG GATTTCAAGG TCCAATGGGA GAGATGGGTC CGACTGGACC TCAAGGTGTG CAAGGGATTC AAGGTCCAGT TGGACCAATA GGTGCAACTG GACCAGAAGG GCAGCAGGGA GCACAAGGAT TGAGAGGACC ACAAGGAGAA ACTGGAGCGA CAGGACCTCA AGGTGTGCAA GGGTTACAAG GTCCGATTGG TCCAACGGGA GCGACTGGGG CACAAGGTAT ACAAGGGATA AAGGGATTGC AAGGGCCAAT TGGAGCGACA GGACCTGAGG GACCTCAAGG AATTCAAGGC GTCCAAGGGT TACCGGGTGC AACTGGTCCA CAAGGAATAC AAGGAGCACA AGGGATACAA GGAACACAAG GACCGAGTGG AAATACAGGT GCAACCGGAG CAACGGGTCA GGGGCTAACA GGTCCGACTG GAATAACAGG CCCAACTGGG ATAACTGGAC CATCTGGAGG ACCTCCTGGC CCGACGGGGC CAACTGGTGC GACAGGTCCG GGTGGCGGAC CGAGTGGGAG TACAGGTGCG ACTGGAGCAA CGGGGGATAC TGGGGCTACA GGAAGTACAG GTGTAACAGG AGCAACGGGA ACTACAGGTC CGACTGGAAG TACGGGAGCA CAGGGCTTGC AAGGAATACA AGGTATTCAA GGGTCAATTG GCCCAACAGG TCCAGAAGGA CCGCAGGGGA TTCAAGGTAT TCCTGGTCCG ACAGGAATAA CTGGTGAACA AGGAATCCAA GGGGTTCAGG GTATTCAAGG GGTAACGGGA GCAACAGGGG ATCAAGGTCC ACAAGGTATA CAGGGGGCTA TAGGGCCTCA AGGGGTCACA GGAGCAACAG GAGATCAAGG TCCACAAGGA ATACAAGGAG TACCAGGGCC ATCAGGAGCA ACGGGACCAC AGGGAGTTCA AGGGATACAA GGTCCGATGG GTGATATAGG ACCAACAGGT CCAGAAGGCC CAGAGGGACT TCAGGGCCCG CAAGGAATAC AAGGAGTGCC AGGACCAGTT GGAGCAACGG GTCCAGAGGG TCCTCAGGGG ATACAAGGTA TTCAAGGACC GGTAGGAACA ACAGGCCCAC AAGGTCCACA AGGAATACAG GGAATACAAG GTGTGCAAGG GATAACGGGA GCAACTGGAG TACAAGGAGC AACTGGAATT CAAGGGATAC AAGGGGAAAT AGGAGCAACG GGTCCAGAGG GTCCCCAAGG AGTGCAAGGT GCTCAAGGGG AGATTGGTCC AACAGGTCCG ATGGGTCCCC AAGGTGTGCA AGGAGTACAA GGAATTCAAG GAGCGACGGG CGCACAAGGA GTGCAAGGTC CACAGGGAAT TCAAGGAATC CAAGGTCCGA CGGGGGCAAC AGGAGATACA GGAGCAACAG GTGCGACAGG GGAAGGAACT ACAGGCCCAA CTGGAATAAC GGGAGCAACA GGGGTAACGG GACCTTCTGG AGGACCAGCA GGACCGTCCG GCCCAACAGG GCCATCAGGT CCGACGGGAG TAACTGGTCC ATCGGGTGGA CCACCTGGCC CGACAGGAGC AACTGGTGCG ACAGGAGTAA CAGGGGATAC TGGTGCGACA GGCTCAACTG GAGTGACAGG AGCGACAGGA GAAACGGGAG CAACCGGAGT GACAGGGTTA CAAGGTCCTC AAGGAATACA AGGTGTGCAA GGAGATATTG GTCCAACTGG TCCGCAAGGG ATACAAGGTC CGCAAGGGAT ACAAGGAGTA ACTGGTGCGA CAGGAGATCA AGGACCACAG GGAATCCAAG GCCCGCAAGG AATCCAAGGT CCAACCGGTC CCCAAGGAAT TCAAGGGGAA CAAGGCCCTC AAGGGATTCA AGGAGCAACC GGAGCCACGG GAGCACAAGG CCCACAGGGG ATTCAAGGAA TTCAAGGGAT CCAAGGTCCG ACCGGTCCTC AAGGCCCAAC AGGAATACAA GGGGTACAAG GAGAGATAGG TCCAACCGGT CCTCAAGGTG TGCAAGGATT GCAAGGTCCC CAAGGTCCTA CAGGGGACAC AGGGGCAACG GGAGCGCAAG GTCCCCAAGG AGTTCAAGGG ATACAAGGCC CAACGGGAGC TACAGGAGCA ACAGGAGCGA CAGGTCCACA AGGAATTCAA GGCCCGCAAG GAATCCAAGG CCCAACAGGG GCTACAGGAG CAACAGGTTC CCAAGGACCA ACTGGAAATA CAGGTCCAAC AGGTTCACAA GGAATACAAG GTCCAACTGG TCCAACAGGA GCTGGAGCAA CCGGAGCAAC AGGAGCGACC GGGGCGACTG GAGTCAGTAC AACTGCAACA TATGCATTTG CGAATAATAC ATCAGGAAGT ATTATTTCTG TTTTGTTAGG TGGCACGAAT ATTCCGTTAC CAAACAATCA AAATATTGGA CCAGGAATAA CCGTTAGTGG TGGAAATACT GTATTTACAG TTGCGAATGC AGGAAACTAT TATATAGCCT ATACAATTAA TTTAACGGCA GGATTACTTG TAAGTTCCCG TATAACTGTA AATGGCAGTC CGCTTGCGGG AACGATAAAC TCTCCGGCAG TGGCTGCAGG TTCATTTAGT GCAACAATAA TCGCTAACTT GCCTGCTGGA GCTGCAGTTA GTTTACAATT ATTTGGAGTA ATTGCGTTGG CTACATTATC TACGGCAACG CCAGGGGCTA CTCTAACGAT TATTAGATTA AGTTAA
|
Protein sequence | MKNRDNKGKQ QSNFRIPPEL IGPTFPPVPT GFTGIGITGP TGPQGPTGPQ GPRGFQGPMG EMGPTGPQGV QGIQGPVGPI GATGPEGQQG AQGLRGPQGE TGATGPQGVQ GLQGPIGPTG ATGAQGIQGI KGLQGPIGAT GPEGPQGIQG VQGLPGATGP QGIQGAQGIQ GTQGPSGNTG ATGATGQGLT GPTGITGPTG ITGPSGGPPG PTGPTGATGP GGGPSGSTGA TGATGDTGAT GSTGVTGATG TTGPTGSTGA QGLQGIQGIQ GSIGPTGPEG PQGIQGIPGP TGITGEQGIQ GVQGIQGVTG ATGDQGPQGI QGAIGPQGVT GATGDQGPQG IQGVPGPSGA TGPQGVQGIQ GPMGDIGPTG PEGPEGLQGP QGIQGVPGPV GATGPEGPQG IQGIQGPVGT TGPQGPQGIQ GIQGVQGITG ATGVQGATGI QGIQGEIGAT GPEGPQGVQG AQGEIGPTGP MGPQGVQGVQ GIQGATGAQG VQGPQGIQGI QGPTGATGDT GATGATGEGT TGPTGITGAT GVTGPSGGPA GPSGPTGPSG PTGVTGPSGG PPGPTGATGA TGVTGDTGAT GSTGVTGATG ETGATGVTGL QGPQGIQGVQ GDIGPTGPQG IQGPQGIQGV TGATGDQGPQ GIQGPQGIQG PTGPQGIQGE QGPQGIQGAT GATGAQGPQG IQGIQGIQGP TGPQGPTGIQ GVQGEIGPTG PQGVQGLQGP QGPTGDTGAT GAQGPQGVQG IQGPTGATGA TGATGPQGIQ GPQGIQGPTG ATGATGSQGP TGNTGPTGSQ GIQGPTGPTG AGATGATGAT GATGVSTTAT YAFANNTSGS IISVLLGGTN IPLPNNQNIG PGITVSGGNT VFTVANAGNY YIAYTINLTA GLLVSSRITV NGSPLAGTIN SPAVAAGSFS ATIIANLPAG AAVSLQLFGV IALATLSTAT PGATLTIIRL S
|
| |