Gene BCG9842_B1680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1680 
Symbol 
ID7182161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3461382 
End bp3464297 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content35% 
IMG OID643551361 
Productputative microbial collagenase 
Protein accessionYP_002447031 
Protein GI218898620 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0340823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.3122e-17 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAGGCT ATTCAAAAAA AATGTTAGTA GGGGTAAGTT TTGCTAGTTT AATGTTAGGG 
AGTTTTCAAG GGGTAAGTTT GGCAGAAGAT ACTAAGGGAG AGCAAGTTTC TTATCGAAAT
GTACTCAAAA TGGAGCCAGT CGGTGTACAA CTACCAGTAC AAGAATTAGC TCATTCATCG
AAATTGCTCG AAAGCAAGTC TTTTGAGCAA AGGATACAAT TTGCTGATTT ATCACAAAGA
CCGCCTGAGG TAAAAAAGGA AAGTAAGCAA TTAGCTGTAG CGAAAACTTA TACAATTGCT
GAATTAAATC AATTAAGCAA TCAGCAGTTA GTAGATTTAC TTATAACCAT TGATTGGGAG
CAAATTACGG GGCTATTTCA GTTTAATACA GACAGTCTAG CATTCTATCA AAATGATAGT
AGAATGCAGG CGATTATTGA TAAATTGAAG CAGCAAGGAC AAGCCTATAC GAAGGATGAT
TCAAAAGGGA TTGAGACATT AGTAGAGGTA TTACGTTCAG GATTTTATTT AGGATTTTAT
AATTCGGAAC TTAGTAAATT AAATGAACGG AGTTATCATG ATAAATGCTT ACCTGCATTA
AAAGCGATAG CTAACAATTC AAATTTCAAG CTGGGTACAT TAGAGCAAAA TAGAGTGGTA
TCATCATACG GAAAGTTAAT AGGAAACGCT TCGAGTGATG TGGAAACGAT AACATCGGCT
GCAAAGATTT TTAAACAATA TAATGATAAT TTCTCTACAT TAGTAGATAA TCTTTCAGCT
GGAAATGCTA TTTACGATAT TATGCAAGGT GTTGATTACG ACATTCAATC GTATTTGTAC
GATACGAGGA AAGCACCGAA AGATACAGTA TGGTATCAGA AAATAGATAG TTATATTAAT
GAACTAAGTC GTTTTGCATT AATGGGAACG ATTACCGCAA AAAACGGTTG GCTTATTAAT
AATGGCATTT ATTATACAGG TAGGCTCGGT TCGTTCCATA GTACAGGAAC GAAAGGATTA
CAAGTTGTAA CAGATGCGAT GAAAATTTAT CCGTATTTAG GGGAGCAATA TTTCGTAGCT
GCTGAGCAAA TCGCGACGAA TTATGGCGGA AAAGATGCAA ATGGTAAAGT GGTGAATTTA
GATCAAATAA GAGAAGATGG AAAGAAAAAA TATTTGCCAA AAACGTATAT GTTTGATGAT
GGGGCAATTG TTTTAAAAGC TGGAGATAAA GTGACAGAAG AAAAAGTAAA ACGTTTATAT
TGGGCGGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA TCAACCTTTA
GAAAAAGGAA ATCCTGATGA TGTATTAACG ATGGTTATTT ATAATAGCCC AGCAGAATAT
CAGTTTAATC GTCAATTGTA CGGATATGAA ACGAATAACG GTGGTCTTTA TATAGAAGGA
ACAGGTACGT TCTTTACTTA TGAGCGTACG CCAGAAGAAA GTATTTATAG TTTAGAAGAA
TTATTCCGAC ACGAGTTCAC GCATTATTTA CAAGGTAGAT ATGAAGTACC AGGGTTATGG
GGTCAAGGGA AAATATATGA AAATGAAAGA TTATCTTGGT TTGAAGAAGG GAATGCTGAA
TTTTTTGCAG GGGCAACGAG AACGGATAAT GTTGTACCGA GAAAGAGTAT TATAGGAGGA
TTATCTTCAA ACCCAGCAGA GCGTTATACG GCAGAGCGAA CGTTAAATGC AAAGTATGGA
ACATGGGATT TCTATAATTA TTCCTTTGCT TTACAATCGT ATATGTACAA TAAGAGATAT
GATATGTTTG ATAAAATACA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA
TATCGCTCTA CATTAAGTAA AGATGCGAAT TTAAATAAAG AATATCAAGA TTATATGCAA
ATGCTATTTG ATAATCGTGA GAAATATAAC GTACCATTAG TATCAGACGA TTATTTAGCA
AATCATGCAC CGAAACCAGT TTCAGATATT GCGGCAGAAA TAACGGCGGA AGCAAAATTG
AAAAATGTAT CAGTGAAGAA AAATAAATCA AAGTTCTTTA ATACATTCAC ACTACAAGGA
ACATATACAG GAACTGCAGC AAAAGGAGAA TATGAAGATT GGAAAATAAT TACACAAAAC
GTTAATGATA CGTTAAAACG TTTAAGCGCA AAAGAATGGA CAGGCTATAA AACGGTAACA
GCATACTTCG TAAATTATCG CGTGAATGCA TCAGGACAAT TTGAATATGA TGTTGTGTTC
CACGGTATTA ATACAGAAGA AGGTGCAGAG AATAAAGCGC CAGTTGCGGT TATAAACGGT
CCGTATAGTG GAAATGTGAA TGAAGCAATT TCATTTAAAA GCGACGCATC AAAAGATGAA
GACGGGAAAA TTACTTCATA TAAATGGGAG TTTGGAGACG GTACTGTAAG TAATGAGCAA
AATCCAACTC ACGTATATAC AAAAGAAGGA GCGTATACAG CGAAATTAAC AGTAACAGAC
GATAAAGGAG CAACTAATAC TGCTACAGCG ACTGTAACTG TTCAAAAGAA AGAAGATAAC
AGTTTAGAAA AAGAACCGAA TAACTCATTC CAAACAGCAA ACAAACTGCA GTTAAATCAA
GTTTTACGTG CTAGTTTAGG AAATGGAGAT ACAAGTGATT TCTTTGAAAT TAATGTGGAG
ACTGCTAAAA ATCTTCAAAT TAACGTAACG AATGAAAATA ATATCGGAAT GAACTGGGTA
CTTTATTCTG AAGCAGATTT AAATAATTAT GTTACGTATG CTCAGCAACA AGGGAATAAA
CTAGTAGGTA GTTACTACAC GTATCCAGGG AAGTATTATT TACATGTGTA TCAGTATGGT
GGTGGAACAG GGAATTATAC GGTGGAAGTA AAGTAG
 
Protein sequence
MKGYSKKMLV GVSFASLMLG SFQGVSLAED TKGEQVSYRN VLKMEPVGVQ LPVQELAHSS 
KLLESKSFEQ RIQFADLSQR PPEVKKESKQ LAVAKTYTIA ELNQLSNQQL VDLLITIDWE
QITGLFQFNT DSLAFYQNDS RMQAIIDKLK QQGQAYTKDD SKGIETLVEV LRSGFYLGFY
NSELSKLNER SYHDKCLPAL KAIANNSNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA
AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTV WYQKIDSYIN
ELSRFALMGT ITAKNGWLIN NGIYYTGRLG SFHSTGTKGL QVVTDAMKIY PYLGEQYFVA
AEQIATNYGG KDANGKVVNL DQIREDGKKK YLPKTYMFDD GAIVLKAGDK VTEEKVKRLY
WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPAEY QFNRQLYGYE TNNGGLYIEG
TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKIYENER LSWFEEGNAE
FFAGATRTDN VVPRKSIIGG LSSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY
DMFDKIHDLI RKNDVTAYDA YRSTLSKDAN LNKEYQDYMQ MLFDNREKYN VPLVSDDYLA
NHAPKPVSDI AAEITAEAKL KNVSVKKNKS KFFNTFTLQG TYTGTAAKGE YEDWKIITQN
VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA SGQFEYDVVF HGINTEEGAE NKAPVAVING
PYSGNVNEAI SFKSDASKDE DGKITSYKWE FGDGTVSNEQ NPTHVYTKEG AYTAKLTVTD
DKGATNTATA TVTVQKKEDN SLEKEPNNSF QTANKLQLNQ VLRASLGNGD TSDFFEINVE
TAKNLQINVT NENNIGMNWV LYSEADLNNY VTYAQQQGNK LVGSYYTYPG KYYLHVYQYG
GGTGNYTVEV K