Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_4868 |
Symbol | |
ID | 2747166 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | - |
Start bp | 4495891 |
End bp | 4499856 |
Gene Length | 3966 bp |
Protein Length | 1321 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637281667 |
Product | triple helix repeat-containing collagen |
Protein accession | NP_981161 |
Protein GI | 42783914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGC GCGATAAACA AAATTCATTA AACTCTAATT TCAGAATCTC ACCAAATCTT ATTGGACCTA CCTTTCCTCC TGTTCCAACG GGATTTACTG GTATTGGGAT TACTGGTCCA ACCGGTCCAC AAGGCCCGAC AGGACCTCAA GGGCCAAGGG GATTACAAGG TCCAATGGGA GAGATGGGCC CAACAGGACC TCAAGGAGTA CAAGGAATTC AAGGACCCGT TGGGTCAATA GGTGCAACGG GACCAGAAGG ACAGCAGGGG CCACAAGGAT TAAGGGGACC ACAGGGAGAA ACTGGAGCGA CAGGACCTCA AGGAGTACAA GGATTGCAAG GTCCAGCTGG TCCAACTGGA GCGACAGGAG CACAAGGTAT ACAAGGTATA CAGGGATTAC AAGGGCCCAT TGGAGCTACC GGGCCAGAGG GACCACAAGG AATCCAAGGA GTGCAAGGAT TACCTGGGGC AACTGGTCCA CAAGGAATAC AAGGGGCACA AGGAATGCAA GGGCTACAAG GGCCGAGTGG AAATACAGGG GCAACCGGAG CGACAGGTCA GGGGATAACA GGTCCGACTG GAGTAACAGG TCCGACAGGG ATCACTGGTC CGTCAGGAGG ACCTCCTGGT CCGACGGGGC CAACTGGTGC GACAGGTCCG GGCGGTGGAC CGAGTGGAAG TACAGGAGCA ACAGGAGCAA CGGGAAATAC TGGGGCTACA GGAAGTACGG GGGTAACAGG AAGTACGGGG GTAACAGGAG CGACGGGAAG TACAGGTCCG ACTGGAAGCA CGGGAGCACA GGGCTTGCAA GGAATACAAG GGATTCAAGG GCCAATTGGG CCAACCGGTC CAGAAGGTCC GCAGGGTATT CAAGGGATTC CTGGTCCGAC GGGAGTAACC GGTGAACAAG GAATACAAGG AGTTCAGGGT ATTCAAGGAG CAACGGGTGC AACAGGGGAT CAAGGTCCAC AAGGTATACA GGGGGCTATA GGGCCACAAG GGGCAACAGG GGCCACAGGA GATCAAGGTC CACAAGGAAT ACAAGGAGTA CCAGGGCCAT CAGGAGCAAC AGGCCCACAG GGAGTTCAAG GGCTACAAGG TCCGATGGGT GATATAGGGC CAACAGGTCC AGAAGGCCCA GAGGGACTTC AGGGCCCGCA AGGAATACAA GGTGTGCCAG GGCCAGTTGG AGCGACGGGT CCAGAGGGTC CTCAGGGGAT ACAAGGCATT CAAGGACCTG TAGGAGCAAC TGGCCCACAA GGTCCACAAG GAATTCAGGG AATACAAGGT GTGCAGGGGA TAACGGGAGC AACGGGAGTA CAAGGAGCAA CTGGAATTCA AGGGATACAA GGGGAAATAG GAGCAACAGG TCCAGAGGGG CCCCAAGGAG TGCAAGGTGC TCAAGGAGGG ATTGGTCCAA CCGGTCCGAT GGGGCCCCAA GGAGTGCAAG GAGTACAAGG AATACAAGGA GCGACGGGCG CACAAGGAGT GCAAGGTCCA CAAGGAATAC AAGGAATACA AGGAATACAA GGAATACAAG GTCCGACTGG GGCAACAGGA GATACGGGAG CAACAGGTGC GACAGGGGAA GGCACTACAG GCCCAACAGG AGTAACCGGT CCAACAGGCC CATCTGGAGG ACCTGCCGGA CCGACCGGCC CAACGGGGCC ATCAGGTCCG GCGGGAGTAA CAGGTCCATC TGGTGGACCA CCTGGCCCAA CAGGAGCAAC TGGGGCGACA GGAGTAACAG GAGATACTGG GGCAACAGGC TCAACTGGAG TGACAGGAGC GACGGGAGAA ACGGGAGCAA CCGGAGTGAC GGGTTTACAA GGTCCGCAAG GAATCCAAGG AGTGCAAGGA GAGATAGGTC CGACGGGTCC CCAAGGTGTT CAAGGTCCGC AAGGAATTCA AGGAGTAACG GGGGCCACAG GAGATCAAGG TCCGCAAGGG GTTCAAGGCC CACAAGGCGA CATAGGTCCA ACCGGCCCAC AAGGAATTCA AGGCCCACAA GGTTCTCAAG GAATCCAAGG AGCGACAGGG GGAACAGGAG CACAAGGCCC ACAGGGAATC CAAGGTCCGC AAGGTGACGT AGGTCCGACT GGGCCACAAG GTCCAACTGG AATTCAAGGG ATACAAGGAG AGATAGGTCC AACCGGTCCA GAAGGCCCAG AGGGACTTCA GGGTCCGCAA GGAATACAAG GTGTTCAAGG ACCAGTTGGA GCAACGGGTC CAGAGGGTCC TCAGGGGATA CAAGGCATTC AAGGAGTGCA AGGAGCAACA GGCTCACAAG GTCCACAAGG AATTCAGGGA ATCCAAGGTG TGCAAGGGAT AACGGGAGCA ACTGGAGCAC AAGGAGCAAC TGGAATTCAA GGGATACAAG GGGAAATAGG AGCAACAGGT CCAGAGGGCC CACAAGGAGT GCAAGGAGTA CAAGGAGAGA TAGGTCCAAC CGGTCCAATG GGGCCCCAAG GAGTGCAAGG AGTGCAAGGA ATTCAAGGAG CGACGGGCGC ACAAGGAGTG CAAGGTCCAC AAGGAATTCA AGGAATACAA GGTCCGACGG GGGCAACAGG AGAAACGGGA GCAACAGGAG CGACAGGGGA AGGCACTACA GGCCCAACAG GAGTAACCGG TCCAACAGGG GTAACAGGCC CATCTGGAGG ACCTGCCGGA CCGACCGGCC CAACGGGGCC ATCAGGTCCG GCGGGAGTAA CCGGTCCATC TGGTGGACCA CCTGGCCCGA CAGGAGCAAC AGGAGCAACA GGAGCGACAG GAGTAACAGG AGATACCGGG GCGACAGGCT CAACTGGAGT GACAGGAGCG ACAGGAGAAA CGGGAGCAAC CGGAGTGACG GGTTTACAGG GACCGCAAGG AATACAAGGT GTTCAAGGAG AGATAGGTCC AACCGGTCCA CAGGGTATTC AAGGTCCCCA AGGAATCCAA GGAGTAACGG GGGCCACAGG AGCACAAGGT CCACAAGGAA TTCAAGGCCC ACAAGGCGAC ATAGGTCCAA CTGGCCCCCA AGGAATTCAA GGTCCACAAG GCCCTCAAGG AATCCAAGGA GCGACGGGGG CCACAGGAGC ACAAGGTCCA CAGGGAATCC AAGGTCCGCA AGGAGAGATA GGTCCGACTG GCCCACAAGG CCCACAAGGA ATTCAAGGCC CGCAAGGAAT ACAAGGTCCA ACGGGAGCTA CAGGAGCAAC CGGAGCGACA GGTCTCCAAG GAATTCAAGG CCCACAAGGA ATTCAAGGCC CGCAAGGAAT ACAAGGTCCA ACGGGAGCTA CAGGAGCAAC CGGAGCGACA GGTCTCCAAG GAATTCAAGG CCCACAAGGA ATTCAAGGCC CGCAAGGAAT ACAAGGTCCA ACGGGAGCTA CAGGAGCAAC CGGAGCGACA GGTCTCCAAG GAATTCAAGG CCCACAAGGA ATTCAAGGCC CGCAAGGAAT ACAAGGTCCA ACGGGAGCTA CAGGGGCAAC CGGAGCAACA GGTTCACAAG GTCCAACTGG AGATACAGGT CCAACCGGAG CTGGAGCCAC TGGAGCGACT GGGGCGACTG GAGTTAGTAC AACTGCAACG TATGCATTTG CGAATAATAC ATCTGGAACC GCTATTTCCG TTTTATTAGG TGGCACGAAT ATTCCGTTAC CAAACAATCA AAATATTGGA CCGGGAATAA CTGTTAGTGG TGGGAATACT GTATTTACAG TTGCAAGTGC AGGGAATTAT TATATAGCTT ATACAATTAA TCTAACGGCA GGATTACTTG TAAGTTCTCG TATAACTGTA AATGGCAGTC CGCTTGCGGG AACGATAAAC GCTCCGACAG TTGCTACTGG TTCATTTAGT GCAACGATCA TTGCTAATTT GCCTGCTGGA GCTGCTATTA GTCTGCAGTT ATTTGGATTA GTTGCAATCG CTACATTATC TACGACAACG CCAGGAGCTA CTTTAACTAT TATTAGATTA AGTTAA
|
Protein sequence | MKERDKQNSL NSNFRISPNL IGPTFPPVPT GFTGIGITGP TGPQGPTGPQ GPRGLQGPMG EMGPTGPQGV QGIQGPVGSI GATGPEGQQG PQGLRGPQGE TGATGPQGVQ GLQGPAGPTG ATGAQGIQGI QGLQGPIGAT GPEGPQGIQG VQGLPGATGP QGIQGAQGMQ GLQGPSGNTG ATGATGQGIT GPTGVTGPTG ITGPSGGPPG PTGPTGATGP GGGPSGSTGA TGATGNTGAT GSTGVTGSTG VTGATGSTGP TGSTGAQGLQ GIQGIQGPIG PTGPEGPQGI QGIPGPTGVT GEQGIQGVQG IQGATGATGD QGPQGIQGAI GPQGATGATG DQGPQGIQGV PGPSGATGPQ GVQGLQGPMG DIGPTGPEGP EGLQGPQGIQ GVPGPVGATG PEGPQGIQGI QGPVGATGPQ GPQGIQGIQG VQGITGATGV QGATGIQGIQ GEIGATGPEG PQGVQGAQGG IGPTGPMGPQ GVQGVQGIQG ATGAQGVQGP QGIQGIQGIQ GIQGPTGATG DTGATGATGE GTTGPTGVTG PTGPSGGPAG PTGPTGPSGP AGVTGPSGGP PGPTGATGAT GVTGDTGATG STGVTGATGE TGATGVTGLQ GPQGIQGVQG EIGPTGPQGV QGPQGIQGVT GATGDQGPQG VQGPQGDIGP TGPQGIQGPQ GSQGIQGATG GTGAQGPQGI QGPQGDVGPT GPQGPTGIQG IQGEIGPTGP EGPEGLQGPQ GIQGVQGPVG ATGPEGPQGI QGIQGVQGAT GSQGPQGIQG IQGVQGITGA TGAQGATGIQ GIQGEIGATG PEGPQGVQGV QGEIGPTGPM GPQGVQGVQG IQGATGAQGV QGPQGIQGIQ GPTGATGETG ATGATGEGTT GPTGVTGPTG VTGPSGGPAG PTGPTGPSGP AGVTGPSGGP PGPTGATGAT GATGVTGDTG ATGSTGVTGA TGETGATGVT GLQGPQGIQG VQGEIGPTGP QGIQGPQGIQ GVTGATGAQG PQGIQGPQGD IGPTGPQGIQ GPQGPQGIQG ATGATGAQGP QGIQGPQGEI GPTGPQGPQG IQGPQGIQGP TGATGATGAT GLQGIQGPQG IQGPQGIQGP TGATGATGAT GLQGIQGPQG IQGPQGIQGP TGATGATGAT GLQGIQGPQG IQGPQGIQGP TGATGATGAT GSQGPTGDTG PTGAGATGAT GATGVSTTAT YAFANNTSGT AISVLLGGTN IPLPNNQNIG PGITVSGGNT VFTVASAGNY YIAYTINLTA GLLVSSRITV NGSPLAGTIN APTVATGSFS ATIIANLPAG AAISLQLFGL VAIATLSTTT PGATLTIIRL S
|
| |