Gene BCZK0466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0466 
SymbolcolA 
ID3022291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp543776 
End bp546673 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content34% 
IMG OID637544683 
Productmicrobial collagenase 
Protein accessionYP_082073 
Protein GI52144755 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0113001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA AATCAAAAAT CAATAAAGTT ATGCTTAGCA TTAGTACAAT GGCTCTATCA 
TTAGGGGCGC TTCAAGCTCC TGTATCAGCG GAAGAAAAAG TACCGTATAA TGTGTTGAAA
ACGAAACCAG TTGGAATTGA AAAACCAGTA GATGAGATTG GACACGTTTC TAAAGCGGAG
GAAACATTAT CGTTTCAAGA ACGGTTAAAA GTAGGAGACT TTTCACAGCG ACCAGCATCT
ATTACGAACA AAGCGACAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC
AAAATGAATA ATCAAGAATT AGTTGAAACG TTAGGCAGTA TTAAGTGGCA TCAAATTACA
GACTTATTCC AGTTTAATGA AGATGCAAAA GCTTTTTATA AAGATAAAGG GAAAATGCAA
GTCGTTATAG ATGAATTAGC TCATCGGGGT AGTACATTTA CGAAAGATGA TTCAAAAGGA
ATTCAAACGT TTACGGAAGT GCTACGTTCA GCTTTTTATC TGGCATTTTA TAATAACGAA
TTAAGTGAAT TAAATGAAAG AAGTTTCCAG GACAAATGTT TACCTGCTTT AAAAGCAATC
GCAAAAAATC CAAATTTTAA GCTTGGTACA GCTGAACAAG ATACAGTCGT ATCTGCATAC
GGTAAATTAA TTAGTAATGC GTCAAGCGAT GTTGAAACAG TTCAATATGC ATCGAATATT
TTAAAGCAAT ACAATGATAA TTTTACTACA TATGTAAATG ATCGAATGAA GGGACAAGCA
ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CTTACTTAAT TGAAGCTCGT
AAAGAAGCGA ATGAAACGAT GTGGTACGGG AAAGTAGATG GTTTTATTAA TGAAATAAAT
CGTATTGCTC TTTTAAATGA AGTAACACAA GAAAATAAAT GGCTCGTTAA TAATGGAATT
TATTTTGCAA GTCGTTTAGG GAAGTTTCAC AGTAATCCAA ATAAAGGTTT AGAAGTTGTT
ACACAAGCAA TGCATATGTA TCCGCGCTTA AGTGAGCCGT ACTTTGTCGC AGTAGAACAA
ATTACAACAA ATTATAATGG AAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA
CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTTGATGA TGGATCTATC
GTATTTAAAA CAGGAGATAA AGTATCGGAA GAAAAAATTA AGAGACTATA CTGGGCTGCA
AAGGAAGTAA AAGCACAGTA TCATCGTGTA ATTGGGAATG ACAAAGCGTT AGAGCCGGGC
AATGCAGATG ATATATTAAC GATAGTAATT TATAACAGTC CAGAAGAGTA CCAGTTAAAT
AGACAACTGT ATGGATACGA AACAAATAAC GGTGGAATTT ATATTGAAGA AACAGGAACA
TTCTTTACAT ATGAGCGCAC GCCAGAACAA AGCATTTATA GTTTAGAAGA GTTATTCCGT
CATGAGTTTA CTCATTATCT TCAAGGGAGA TATGAAGTGC CAGGTTTATT CGGAAGAGGA
GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GCAATGCAGA GTTTTTCGCA
GGGGCTACTC GAACAAATAA TGTAGTGCCG AGAAAGAGCA TCATTAGTGG ATTATCATCT
GATCCTGCAA GCCGTTATAC AGCAGAGCGC ACATTATTTG CTAAATATGG TTCTTGGGAT
TTCTATAATT ACTCGTTCGC ATTGCAATCT TACTTATATA CACATCAATT TGAAACGTTT
GATAAAATTC AAGATTTAAT TCGTGCAAAT GACGTAAAAA ATTATGATGC CTATCGTGAG
AATTTAAGTA AAGACCTTAA ACTAAACGAA GAGTATCAAG AGTATATGCA GCACCTAATC
GATAATCAAG ATAAATATAA TGTACCGGAA GTAGCAGATG ATTATTTAGC TGAACATGCC
CCAAAATCAT TAACGGCAGT AGAGAAAGAA ATTACTGAAA CGTTGCCGAT GAAAGATGCA
AAAATGACAA AACATAGCTC CCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA
GGTAGTGTAA CAAAAGGTGA GTCAGAAGAT TGGAACGCAA TGAGTAAGAA AGTAAATGAA
GCTTTGGAAC AACTTGCGCA AAAAGAATGG AGTGGCTATA AAACTGTTAC AGCATATTTT
GTCAATTACC GTGTAAATAG CTCAAATCAA TTTGAATATG ATGTAGTCTT CCACGGTATC
GCAAAAGATG ACGAAGAAAA TAAAGCTCCA ACGGTTAATA TAAATGGCCC TTATAATGGA
CTTGTAAAAG AAGGTATTCA ATTTAAAAGT GACGGCTCAA AAGATGAAGA TGGAAAAATC
GTTTCTTATT TATGGGACTT TGGAGATGGA AGCACAAGTG CAGAAGTAAA TCCGGTACAT
GTATATGAAA GCGAAGGTTC ATATAAAGTA GCGTTAATAG TAAAAGATGA TAAAGGAAAA
GAGAGCAAAA GCGAAATAAC GGTTACGGTT AAAGGTGGAA GTTTAACAGA ATCAGAACCA
AATAATCGCC CAGAGGAAGC AAATCGTATT GGACTAAACA CTACTATAAA AGGTAGTCTT
ATCGGCGGGG ATCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AAATATTGAT
ATTTCCGTTT TAAATGAATA TGGAATCGGG ATGACATGGG TACTTCACCA TGAATCAGAT
ATGCAAAATT ATGCTGCTTA CGGTCAAGCA AATGGAAATC ATATAGAGGC AAACTTTAAT
GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGTGA TGGAACATAC
GAATTATCAG TAAAATAA
 
Protein sequence
MNKKSKINKV MLSISTMALS LGALQAPVSA EEKVPYNVLK TKPVGIEKPV DEIGHVSKAE 
ETLSFQERLK VGDFSQRPAS ITNKATVKQV KESYSMADLN KMNNQELVET LGSIKWHQIT
DLFQFNEDAK AFYKDKGKMQ VVIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE
LSELNERSFQ DKCLPALKAI AKNPNFKLGT AEQDTVVSAY GKLISNASSD VETVQYASNI
LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLIEAR KEANETMWYG KVDGFINEIN
RIALLNEVTQ ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPRL SEPYFVAVEQ
ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA
KEVKAQYHRV IGNDKALEPG NADDILTIVI YNSPEEYQLN RQLYGYETNN GGIYIEETGT
FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA
GATRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF
DKIQDLIRAN DVKNYDAYRE NLSKDLKLNE EYQEYMQHLI DNQDKYNVPE VADDYLAEHA
PKSLTAVEKE ITETLPMKDA KMTKHSSQFF NTFTLEGTYT GSVTKGESED WNAMSKKVNE
ALEQLAQKEW SGYKTVTAYF VNYRVNSSNQ FEYDVVFHGI AKDDEENKAP TVNINGPYNG
LVKEGIQFKS DGSKDEDGKI VSYLWDFGDG STSAEVNPVH VYESEGSYKV ALIVKDDKGK
ESKSEITVTV KGGSLTESEP NNRPEEANRI GLNTTIKGSL IGGDHTDVYT FNVASAKNID
ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEANFN AKPGKYYLYV YKYDNGDGTY
ELSVK