Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0466 |
Symbol | colA |
ID | 3022291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 543776 |
End bp | 546673 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637544683 |
Product | microbial collagenase |
Protein accession | YP_082073 |
Protein GI | 52144755 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0113001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AATCAAAAAT CAATAAAGTT ATGCTTAGCA TTAGTACAAT GGCTCTATCA TTAGGGGCGC TTCAAGCTCC TGTATCAGCG GAAGAAAAAG TACCGTATAA TGTGTTGAAA ACGAAACCAG TTGGAATTGA AAAACCAGTA GATGAGATTG GACACGTTTC TAAAGCGGAG GAAACATTAT CGTTTCAAGA ACGGTTAAAA GTAGGAGACT TTTCACAGCG ACCAGCATCT ATTACGAACA AAGCGACAGT AAAGCAAGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC AAAATGAATA ATCAAGAATT AGTTGAAACG TTAGGCAGTA TTAAGTGGCA TCAAATTACA GACTTATTCC AGTTTAATGA AGATGCAAAA GCTTTTTATA AAGATAAAGG GAAAATGCAA GTCGTTATAG ATGAATTAGC TCATCGGGGT AGTACATTTA CGAAAGATGA TTCAAAAGGA ATTCAAACGT TTACGGAAGT GCTACGTTCA GCTTTTTATC TGGCATTTTA TAATAACGAA TTAAGTGAAT TAAATGAAAG AAGTTTCCAG GACAAATGTT TACCTGCTTT AAAAGCAATC GCAAAAAATC CAAATTTTAA GCTTGGTACA GCTGAACAAG ATACAGTCGT ATCTGCATAC GGTAAATTAA TTAGTAATGC GTCAAGCGAT GTTGAAACAG TTCAATATGC ATCGAATATT TTAAAGCAAT ACAATGATAA TTTTACTACA TATGTAAATG ATCGAATGAA GGGACAAGCA ATATACGATA TTATGCAAGG GATTGACTAT GATATACAGT CTTACTTAAT TGAAGCTCGT AAAGAAGCGA ATGAAACGAT GTGGTACGGG AAAGTAGATG GTTTTATTAA TGAAATAAAT CGTATTGCTC TTTTAAATGA AGTAACACAA GAAAATAAAT GGCTCGTTAA TAATGGAATT TATTTTGCAA GTCGTTTAGG GAAGTTTCAC AGTAATCCAA ATAAAGGTTT AGAAGTTGTT ACACAAGCAA TGCATATGTA TCCGCGCTTA AGTGAGCCGT ACTTTGTCGC AGTAGAACAA ATTACAACAA ATTATAATGG AAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA CGTAAAGAAG GAAAAGAGCA GTACTTACCA AAAACGTATA CATTTGATGA TGGATCTATC GTATTTAAAA CAGGAGATAA AGTATCGGAA GAAAAAATTA AGAGACTATA CTGGGCTGCA AAGGAAGTAA AAGCACAGTA TCATCGTGTA ATTGGGAATG ACAAAGCGTT AGAGCCGGGC AATGCAGATG ATATATTAAC GATAGTAATT TATAACAGTC CAGAAGAGTA CCAGTTAAAT AGACAACTGT ATGGATACGA AACAAATAAC GGTGGAATTT ATATTGAAGA AACAGGAACA TTCTTTACAT ATGAGCGCAC GCCAGAACAA AGCATTTATA GTTTAGAAGA GTTATTCCGT CATGAGTTTA CTCATTATCT TCAAGGGAGA TATGAAGTGC CAGGTTTATT CGGAAGAGGA GATATGTATC AAAATGAAAG GTTAACTTGG TTCCAAGAAG GCAATGCAGA GTTTTTCGCA GGGGCTACTC GAACAAATAA TGTAGTGCCG AGAAAGAGCA TCATTAGTGG ATTATCATCT GATCCTGCAA GCCGTTATAC AGCAGAGCGC ACATTATTTG CTAAATATGG TTCTTGGGAT TTCTATAATT ACTCGTTCGC ATTGCAATCT TACTTATATA CACATCAATT TGAAACGTTT GATAAAATTC AAGATTTAAT TCGTGCAAAT GACGTAAAAA ATTATGATGC CTATCGTGAG AATTTAAGTA AAGACCTTAA ACTAAACGAA GAGTATCAAG AGTATATGCA GCACCTAATC GATAATCAAG ATAAATATAA TGTACCGGAA GTAGCAGATG ATTATTTAGC TGAACATGCC CCAAAATCAT TAACGGCAGT AGAGAAAGAA ATTACTGAAA CGTTGCCGAT GAAAGATGCA AAAATGACAA AACATAGCTC CCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA GGTAGTGTAA CAAAAGGTGA GTCAGAAGAT TGGAACGCAA TGAGTAAGAA AGTAAATGAA GCTTTGGAAC AACTTGCGCA AAAAGAATGG AGTGGCTATA AAACTGTTAC AGCATATTTT GTCAATTACC GTGTAAATAG CTCAAATCAA TTTGAATATG ATGTAGTCTT CCACGGTATC GCAAAAGATG ACGAAGAAAA TAAAGCTCCA ACGGTTAATA TAAATGGCCC TTATAATGGA CTTGTAAAAG AAGGTATTCA ATTTAAAAGT GACGGCTCAA AAGATGAAGA TGGAAAAATC GTTTCTTATT TATGGGACTT TGGAGATGGA AGCACAAGTG CAGAAGTAAA TCCGGTACAT GTATATGAAA GCGAAGGTTC ATATAAAGTA GCGTTAATAG TAAAAGATGA TAAAGGAAAA GAGAGCAAAA GCGAAATAAC GGTTACGGTT AAAGGTGGAA GTTTAACAGA ATCAGAACCA AATAATCGCC CAGAGGAAGC AAATCGTATT GGACTAAACA CTACTATAAA AGGTAGTCTT ATCGGCGGGG ATCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AAATATTGAT ATTTCCGTTT TAAATGAATA TGGAATCGGG ATGACATGGG TACTTCACCA TGAATCAGAT ATGCAAAATT ATGCTGCTTA CGGTCAAGCA AATGGAAATC ATATAGAGGC AAACTTTAAT GCAAAACCAG GTAAGTATTA CTTGTATGTA TATAAATATG ATAATGGTGA TGGAACATAC GAATTATCAG TAAAATAA
|
Protein sequence | MNKKSKINKV MLSISTMALS LGALQAPVSA EEKVPYNVLK TKPVGIEKPV DEIGHVSKAE ETLSFQERLK VGDFSQRPAS ITNKATVKQV KESYSMADLN KMNNQELVET LGSIKWHQIT DLFQFNEDAK AFYKDKGKMQ VVIDELAHRG STFTKDDSKG IQTFTEVLRS AFYLAFYNNE LSELNERSFQ DKCLPALKAI AKNPNFKLGT AEQDTVVSAY GKLISNASSD VETVQYASNI LKQYNDNFTT YVNDRMKGQA IYDIMQGIDY DIQSYLIEAR KEANETMWYG KVDGFINEIN RIALLNEVTQ ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPRL SEPYFVAVEQ ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA KEVKAQYHRV IGNDKALEPG NADDILTIVI YNSPEEYQLN RQLYGYETNN GGIYIEETGT FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA GATRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF DKIQDLIRAN DVKNYDAYRE NLSKDLKLNE EYQEYMQHLI DNQDKYNVPE VADDYLAEHA PKSLTAVEKE ITETLPMKDA KMTKHSSQFF NTFTLEGTYT GSVTKGESED WNAMSKKVNE ALEQLAQKEW SGYKTVTAYF VNYRVNSSNQ FEYDVVFHGI AKDDEENKAP TVNINGPYNG LVKEGIQFKS DGSKDEDGKI VSYLWDFGDG STSAEVNPVH VYESEGSYKV ALIVKDDKGK ESKSEITVTV KGGSLTESEP NNRPEEANRI GLNTTIKGSL IGGDHTDVYT FNVASAKNID ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEANFN AKPGKYYLYV YKYDNGDGTY ELSVK
|
| |