Gene BCE_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCE_3539 
Symbol 
ID2747984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus ATCC 10987 
KingdomBacteria 
Replicon accessionNC_003909 
Strand
Start bp3290215 
End bp3293130 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content36% 
IMG OID637280340 
Productcollagenase, putative 
Protein accessionNP_979836 
Protein GI42782589 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGCT ATTCAAAAAA AGTGTTAGTA GGGGTAAGCT TTGCTAGTTT CATGTTAGGG 
AGTTTTCAAG GGGGCGCATT GGCAGAAGGT ACAAAGGGAG AGCAAGTTTC ATATCGGAAT
GTGCTCAAAA TGGAGCCGGT TGGTGTACAA TTACCAGTGC AAGAATTAGC TCATTCATCA
AAAGTGTTGG AAAATAAGTC TTTTGAGAAA AGGCTACAAT TTGCTGATTT GTCACAAAGA
CCGCCTGAAG TAAAAAAGAA AAGTAAACAA TTAACCGCAG CGAAAACGTA TACAATTGCT
GAATTAAATC AATTGAGCAA TCAGCAGTTA GTGGATTTAC TTGTAACAAT TGATTGGGAG
CAAATTACTG GGCTATTTCA GTTTAATAAG GATAGTCTTG CATTCTATCA AAATGATAGT
AGAATGCAGG CAATTATTGA TAAATTGAAG CAGCAAGGAC AAGCTTATAC GAAGGATGAT
TCAAAAGGGA TTGAGACTTT AGTAGAGGTA TTACGCTCAG GGTTTTATCT AGGATTTTAT
CATACAGAAT TAAGTAAACT AAATGATCGA AGCTATCATG ATAAATGCTT ACCTGCATTA
AAAACGATTG CGAATAACCC GAATTTCAAA CTCGGTACGT TAGAACAAAA TAGAGTTGTC
TCATCATACG GAAAATTAAT AGGAAATGCT TCGAGTGATG TGGAAACGAT AACATCAGCT
GCAAAGATTT TTAAACAATA TAACGATAAT TTTTCTACAT TGGTAGACAA TCTATCAGCT
GGAAATGCGA TTTACGATAT TATGCAAGGT GTTGATTACG ATATTCAATC GTATTTATAC
GATACGAGAA AAGCACCGAA AGATACAATG TGGTATCAAA AAATTGATAG CTATATTAAT
GAATTAAGTC GTTTTGCCTT AATTGGAACG GTGACAGCGA AAAATGGTTG GTTAATTAAT
AATGGTATTT ATTATACAGG TAGACTTGGT ACGTTCCATA GTACAGGAAC GAAAGGGTTG
CAAGTTGTAA CGGACGCAAT GAAAATGTAT CCGTATTTAG GAGAGCAATA TTTCGTAGCT
GCTGAGCAAA TTGCGACGAA TTATGGCGGG AAAGATGCGA ATGGTAAAGT GGTTGATCTA
GATCAAATAA GAGAAGATGG TAAGAAAAAA TATTTACCGA AAACGTATAC GTTCGATGAT
GGAGCAATTG TGTTAAAAGC TGGAGATAAG GTGACAGAAG AAAAAGTAAA ACGTCTATAT
TGGGCGGCAA AAGAAGTGAA GGCACAATTC CATCGTACGG TTGAAAGTGA CCAGCCGTTA
GAAAAAGGCA ATCCTGATGA TGTATTAACA ATGGTTATTT ATAATAGCCC ATCTGAATAT
CAATTTAACC GTCAATTATA CGGATATGAA ACGAATAACG GCGGTCTTTA TATAGAAGGA
ACAGGTACGT TCTTTACTTA TGAACGTACG CCAGAAGAAA GTATTTATAG TTTAGAAGAA
TTGTTCCGAC ATGAGTTCAC ACATTACTTA CAAGGTAGAT ATGAAGTGCC AGGACTTTGG
GGACAAGGGA AGATGTATGA GAATGAGAGA TTATCTTGGT TTGAAGAAGG GAATGCGGAG
TTTTTTGCAG GGGCAACGAG AACAGATAAT GTTGTACCGA GAAAGAGCAT TATAGGAGGA
ATATCTTCAA ATCCAGCAGA ACGTTATACA GCAGAGAGAA CGTTAAACGC AAAATATGGA
ACGTGGGATT TCTATAATTA TTCCTTCGCT TTACAATCGT ACATGTACAA TAAGAGATAT
GATATGTTTG ACAAAATTCA TGATCTTATT AGGAAAAATG ATGTAACAGC ATATGATGCA
TACCGCTCTG CTTTAAGTAA AGATGCGAAT TTAAATAAAG AATATCARGA TTATATGCAA
ATGTTAGTAG ATAACCGTGA GAAATATAAT GTTCCGTTAG TATCAGATGA TTATTTAGCA
ACTCACGCAC CGAAACCAGT TTCAGATATT GCGGCAGAAA TTACAGCAGA AGCAAAATTA
AATAATGTAT CAGTTAAGAA AAATAAATCA CAGTTCTTTC ATACATTTAC ACTACAAGGA
ACATACACAG GTACTACTGC AAAAGGAGAA TATGAAGACT GGAAGACAAT TACACAAAAC
GTGAATGATA CGTTAAAACG TTTAAGTGCG AAAGAATGGA CAGGCTATAA AACAGTAACA
GCATACTTCG TAAACTATCG TGTGAATGCA GCAGGACAAT TTGAGTATGA TGTTGTATTC
CATGGTATTA ATACAGAAGA AGGTGCTGTG AATAAAGCGC CAGTTGCGGT TATAAATGGT
CCATATAGCG GAAAGGTAAA TGAAGCAATT TCGTTTAAAA GCGATGGATC AAAAGATGAA
GATGGGAAAA TCATTTCGTA TAAATGGGAG TTTGGCGATG GAGCAGTAAG TGATGAGCAA
AATCCGACTC ACGTGTATAC AAAAGAAGGA ACATATACAG CGAAATTAAC AGTAACAGAT
GACAAAGGAT TAACGAATAC TGCTACAACG AATGTAACGG TTCAAAAGAA AGAAGATAAC
AGTGTGGAAA AAGAGCCGAA TAACTCATTT CAAACAGCAA ATAAACTGCA GCTAAATCAA
ATTTTACGTG CTAGTTTAGG AAATGGCGAT ACGAGTGATT TCTTTGAAAT TAATGTGGAT
ACTGCTAAAA ACCTTCAAAT TAACGTAACG AATGAAAATA ATATCGGAAT GAACTGGGTT
CTTTATTCGG AAGCAGATTT AAATAATTAT GTTACGTATG CACAGCAAGA AGGGAATAAG
TTAGTAGGAA GTTACTACAC GTATCCAGGA AAGTATTACT TACATGTGTA TCAGTATAGC
GGGGGAACAG GGAATTATAC GGTAGAAGTG AAATAG
 
Protein sequence
MKGYSKKVLV GVSFASFMLG SFQGGALAEG TKGEQVSYRN VLKMEPVGVQ LPVQELAHSS 
KVLENKSFEK RLQFADLSQR PPEVKKKSKQ LTAAKTYTIA ELNQLSNQQL VDLLVTIDWE
QITGLFQFNK DSLAFYQNDS RMQAIIDKLK QQGQAYTKDD SKGIETLVEV LRSGFYLGFY
HTELSKLNDR SYHDKCLPAL KTIANNPNFK LGTLEQNRVV SSYGKLIGNA SSDVETITSA
AKIFKQYNDN FSTLVDNLSA GNAIYDIMQG VDYDIQSYLY DTRKAPKDTM WYQKIDSYIN
ELSRFALIGT VTAKNGWLIN NGIYYTGRLG TFHSTGTKGL QVVTDAMKMY PYLGEQYFVA
AEQIATNYGG KDANGKVVDL DQIREDGKKK YLPKTYTFDD GAIVLKAGDK VTEEKVKRLY
WAAKEVKAQF HRTVESDQPL EKGNPDDVLT MVIYNSPSEY QFNRQLYGYE TNNGGLYIEG
TGTFFTYERT PEESIYSLEE LFRHEFTHYL QGRYEVPGLW GQGKMYENER LSWFEEGNAE
FFAGATRTDN VVPRKSIIGG ISSNPAERYT AERTLNAKYG TWDFYNYSFA LQSYMYNKRY
DMFDKIHDLI RKNDVTAYDA YRSALSKDAN LNKEYQDYMQ MLVDNREKYN VPLVSDDYLA
THAPKPVSDI AAEITAEAKL NNVSVKKNKS QFFHTFTLQG TYTGTTAKGE YEDWKTITQN
VNDTLKRLSA KEWTGYKTVT AYFVNYRVNA AGQFEYDVVF HGINTEEGAV NKAPVAVING
PYSGKVNEAI SFKSDGSKDE DGKIISYKWE FGDGAVSDEQ NPTHVYTKEG TYTAKLTVTD
DKGLTNTATT NVTVQKKEDN SVEKEPNNSF QTANKLQLNQ ILRASLGNGD TSDFFEINVD
TAKNLQINVT NENNIGMNWV LYSEADLNNY VTYAQQEGNK LVGSYYTYPG KYYLHVYQYS
GGTGNYTVEV K