Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1680 |
Symbol | |
ID | 7182161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3461382 |
End bp | 3464297 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643551361 |
Product | putative microbial collagenase |
Protein accession | YP_002447031 |
Protein GI | 218898620 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0340823 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 1.3122e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTCAAAAAA AATGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG AGTTTTCAAG GGGTAAGTTT GGCAGAAGAT ACTAAGGGAG AGCAAGTTTC TTATCGAAAT GTACTCAAAA TGGAGCCAGT CGGTGTACAA CTACCAGTAC AAGAATTAGC TCATTCATCG AAATTGCTCG AAAGCAAGTC TTTTGAGCAA AGGATACAAT TTGCTGATTT ATCACAAAGA CCGCCTGAGG TAAAAAAGGA AAGTAAGCAA TTAGCTGTAG CGAAAACTTA TACAATTGCT GAATTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTATAACCAT TGATTGGGAG CAAATTACGG GGCTATTTCA GTTTAATACA GACAGTCTAG CATTCTATCA AAATGATAGT AGAATGCAGG CGATTATTGA TAAATTGAAG CAGCAAGGAC AAGCCTATAC GAAGGATGAT TCAAAAGGGA TTGAGACATT AGTAGAGGTA TTACGTTCAG GATTTTATTT AGGATTTTAT AATTCGGAAC TTAGTAAATT AAATGAACGG AGTTATCATG ATAAATGCTT ACCTGCATTA AAAGCGATAG CTAACAATTC AAATTTCAAG CTGGGTACAT TAGAGCAAAA TAGAGTGGTA TCATCATACG GAAAGTTAAT AGGAAACGCT TCGAGTGATG TGGAAACGAT AACATCGGCT GCAAAGATTT TTAAACAATA TAATGATAAT TTCTCTACAT TAGTAGATAA TCTTTCAGCT GGAAATGCTA TTTACGATAT TATGCAAGGT GTTGATTACG ACATTCAATC GTATTTGTAC GATACGAGGA AAGCACCGAA AGATACAGTA TGGTATCAGA AAATAGATAG TTATATTAAT GAACTAAGTC GTTTTGCATT AATGGGAACG ATTACCGCAA AAAACGGTTG GCTTATTAAT AATGGCATTT ATTATACAGG TAGGCTCGGT TCGTTCCATA GTACAGGAAC GAAAGGATTA CAAGTTGTAA CAGATGCGAT GAAAATTTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCT GCTGAGCAAA TCGCGACGAA TTATGGCGGA AAAGATGCAA ATGGTAAAGT GGTGAATTTA GATCAAATAA GAGAAGATGG AAAGAAAAAA TATTTGCCAA AAACGTATAT GTTTGATGAT GGGGCAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTTTATAT TGGGCGGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA TCAACCTTTA GAAAAAGGAA ATCCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCAGAATAT CAGTTTAATC GTCAATTGTA CGGATATGAA ACGAATAACG GTGGTCTTTA TATAGAAGGA ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAAGAA TTATTCCGAC ACGAGTTCAC GCATTATTTA CAAGGTAGAT ATGAAGTACC AGGGTTATGG GGTCAAGGGA AAATATATGA AAATGAAAGA TTATCTTGGT TTGAAGAAGG GAATGCTGAA TTTTTTGCAG GGGCAACGAG AACGGATAAT GTTGTACCGA GAAAGAGTAT TATAGGAGGA TTATCTTCAA ACCCAGCAGA GCGTTATACG GCAGAGCGAA CGTTAAATGC AAAGTATGGA ACATGGGATT TCTATAATTA TTCCTTTGCT TTACAATCGT ATATGTACAA TAAGAGATAT GATATGTTTG ATAAAATACA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA TATCGCTCTA CATTAAGTAA AGATGCGAAT TTAAATAAAG AATATCAAGA TTATATGCAA ATGCTATTTG ATAATCGTGA GAAATATAAC GTACCATTAG TATCAGACGA TTATTTAGCA AATCATGCAC CGAAACCAGT TTCAGATATT GCGGCAGAAA TAACGGCGGA AGCAAAATTG AAAAATGTAT CAGTGAAGAA AAATAAATCA AAGTTCTTTA ATACATTCAC ACTACAAGGA ACATATACAG GAACTGCAGC AAAAGGAGAA TATGAAGATT GGAAAATAAT TACACAAAAC GTTAATGATA CGTTAAAACG TTTAAGCGCA AAAGAATGGA CAGGCTATAA AACGGTAACA GCATACTTCG TAAATTATCG CGTGAATGCA TCAGGACAAT TTGAATATGA TGTTGTGTTC CACGGTATTA ATACAGAAGA AGGTGCAGAG AATAAAGCGC CAGTTGCGGT TATAAACGGT CCGTATAGTG GAAATGTGAA TGAAGCAATT TCATTTAAAA GCGACGCATC AAAAGATGAA GACGGGAAAA TTACTTCATA TAAATGGGAG TTTGGAGACG GTACTGTAAG TAATGAGCAA AATCCAACTC ACGTATATAC AAAAGAAGGA GCGTATACAG CGAAATTAAC AGTAACAGAC GATAAAGGAG CAACTAATAC TGCTACAGCG ACTGTAACTG TTCAAAAGAA AGAAGATAAC AGTTTAGAAA AAGAACCGAA TAACTCATTC CAAACAGCAA ACAAACTGCA GTTAAATCAA GTTTTACGTG CTAGTTTAGG AAATGGAGAT ACAAGTGATT TCTTTGAAAT TAATGTGGAG ACTGCTAAAA ATCTTCAAAT TAACGTAACG AATGAAAATA ATATCGGAAT GAACTGGGTA CTTTATTCTG AAGCAGATTT AAATAATTAT GTTACGTATG CTCAGCAACA AGGGAATAAA CTAGTAGGTA GTTACTACAC GTATCCAGGG AAGTATTATT TACATGTGTA TCAGTATGGT GGTGGAACAG GGAATTATAC GGTGGAAGTA AAGTAG
|
Protein sequence | MKGYSKKMLV GVSFASLMLG SFQGVSLAED TKGEQVSYRN VLKMEPVGVQ LPVQELAHSS KLLESKSFEQ RIQFADLSQR PPEVKKESKQ LAVAKTYTIA ELNQLSNQQL VDLLITIDWE QITGLFQFNT DSLAFYQNDS RMQAIIDKLK QQGQAYTKDD SKGIETLVEV LRSGFYLGFY NSELSKLNER SYHDKCLPAL KAIANNSNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN ELSRFALMGT ITAKNGWLIN NGIYYTGRLG SFHSTGTKGL QVVTDAMKIY PYLGEQYFVA AEQIATNYGG KDANGKVVNL DQIREDGKKK YLPKTYMFDD GAIVLKAGDK VTEEKVKRLY WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE FFAGATRTDN VVPRKSIIGG LSSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY DMFDKIHDLI RKNDVTAYDA YRSTLSKDAN LNKEYQDYMQ MLFDNREKYN VPLVSDDYLA NHAPKPVSDI AAEITAEAKL KNVSVKKNKS KFFNTFTLQG TYTGTAAKGE YEDWKIITQN VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAE NKAPVAVING PYSGNVNEAI SFKSDASKDE DGKITSYKWE FGDGTVSNEQ NPTHVYTKEG AYTAKLTVTD DKGATNTATA TVTVQKKEDN SLEKEPNNSF QTANKLQLNQ VLRASLGNGD TSDFFEINVE TAKNLQINVT NENNIGMNWV LYSEADLNNY VTYAQQQGNK LVGSYYTYPG KYYLHVYQYG GGTGNYTVEV K
|
| |