Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCB4264_A0591 |
Symbol | |
ID | 7097861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus B4264 |
Kingdom | Bacteria |
Replicon accession | NC_011725 |
Strand | + |
Start bp | 567364 |
End bp | 570261 |
Gene Length | 2898 bp |
Protein Length | 965 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 643468146 |
Product | putative microbial collagenase |
Protein accession | YP_002365351 |
Protein GI | 218231943 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.129925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAGA AATCAAAGAT CAATAAAGTG ATGCTTAGCA TTAGTACAAT GGCTTTATCG TTAGGCGCAC TTCAAACTCA TGCAGCAGCG GAAGAAAAAG TACCGTATAA CGTGTTAAAA ACGAAACCGG TTGGAATTGA AAAGTCGGTA GATGAAGTTG GACATATTTC AAAAGTTGAT GAAACTTTAT CATTTCAAGA ACGTTTAAAA GTAGGAGATT TTTCACAGCG ACCAGCATCT ATTACGAAGA AAACTGCAGT AAAGCAGGTT AAAGAAAGCT ATTCAATGGC TGATTTAAAC AAAATGAATG ACCAAGAATT AGTTGAAACG TTAGGCAGTA TTAAATGGCA CCAAATTACA GACTTATTCC AGTTTAATGA AGATGCAAAG GCTTTTTATA AAGATAAAGG AAAAATGCAA GTCATTATAG ATGAATTAGC TCATAGAGGT AGTACATTTA CGAAAGATGA TTCAAGAGGA ATTCAAACGT TTACTGAAGT GCTGCGTTCA GCTTTTTATC TTGCATTTTA TAATAGTGAA TTAAGCGACT TAAATGAAAG AAGCTTCCAG GATAAATGTT TACCTGCTTT AAAAGCAATC GCAAAAAATC CAAACTTTAA GCTTGGTACA GTTGAACAAG ATACAGTCGT ATCTGCGTAC GGTAAATTAA TTAGTAATGC TTCAAGCGAT GTTGAAACGG TTCAATATGC ATCGAATATT TTAAAGCAAT ACAATGATAA TTATACTACT TATGTAAATG ATCGAATGAA GGGACAAGCA ATATACGATA TTATGCAAGG TATTGACTAT GATATGCAGT CGTACTTAAC TGAGGCTCGT AAAGAAGCGA ATGAAACGAT GTGGTATGGA AAAGTAGATG GGTTTATTAA TGAAATAAAT CGTATTGCTC TTCTAAATGA AGTAACGCCA GAAAATAAAT GGCTCGTTAA TAATGGCATT TATTTTGCTA GCCGTTTAGG GAAGTTTCAT AGCAATCCAA ATAAAGGATT AGAGGTTGTT ACACAAGCAA TGCATATGTA CCCGCGCTTA AGTGAACCGT ATTTTGTTGC GGTAGAACAA ATTACAACAA ATTATAATGG TAAAGATTAT AGCGGGAATA CAGTAGATTT AGAGAAAATA CGTAAAGAAG GAAAAGAGCA ATACCTACCA AAAACGTATA CATTCGACGA TGGATCAATT GTGTTCAAAA CAGGAGATAA AGTATCAGAA GAAAAAATTA AGAGACTATA TTGGGCTGCG AAGGAAGTAA AGGCACAGTA TCACCGTGTA ATTGGAAATG ACAAAGCGTT AGAGCCAGGA AATGCGGATG ATGTATTAAC GATCGTAATT TATAATAGTC CAGATGAATA TCAGTTAAAT AGACAATTGT ATGGATATGA AACAAACAAC GGTGGAATTT ATATTGAAGA GACAGGTACA TTCTTTACAT ATGAGCGTAC ACCAGAGCAA AGTATTTATA GTTTAGAAGA GTTATTCCGT CATGAATTTA CTCATTATCT GCAAGGGAGA TATGAAGTTC CTGGTTTATT TGGAAGAGGA GATATGTATC AAAATGAAAG GCTAACTTGG TTCCAAGAAG GAAATGCAGA GTTTTTCGCA GGATCTACTC GTACGAATAA CGTTGTACCA AGAAAGAGTA TAATTAGCGG ATTATCATCT GATCCTGCAA GCCGTTATAC AGCAGAGCGT ACACTATTTG CTAAATACGG TTCTTGGGAT TTCTATAATT ACTCGTTCGC ATTGCAGTCT TACTTATATA CGCATCAGTT TGAAACATTT GATAAAATTC AAGATTTGAT TCGTGCGAAT GACGTGAAAA ATTATGATGC ATATCGTGAA AATCTAAGTA AAGATCCTAA GTTAAATAAA GAGTATCAAG AGTATATGCA GCAGTTAATT GATAATCAAG ATAAATATAA TGTACCGGAA GTAGCAGATG ATTATTTAGC TGAACATGCA CCGAAATCGT TAACTGAAGT GAAAAAAGAA ATTAGTGATA CGTTGCCTAT GAAAGATACA AAAATGACAA AACATAATTC TCAATTCTTT AATACATTTA CATTAGAAGG TACGTATACA GGTAGTGTCA CAAAAGGTGA ATCAGAAGAT TGGAAAGCAA TGAGTAAAAG AGTAAATGAA TCTTTAGAAC AATTGGCGCA AAAAGAATGG AGTGGCTACA AAACTGTTAC AGCATACTTC GTCAATTATC GTGTGAATAG CTCAAATGAA TTTGAATATG ATGTAGTCTT CCATGGAATC GCAAAAGATG ATGGAGAAAA TAAAGCTCCA ACGGTTAATA TAAATGGCCC TTATAGCGGT CTTGTAAAAG AGGGAATTCA ATTTAAAAGT GATGGCTCAA ACGATGAAGA TGGAAAAATT GTTTCTTATT TATGGGAATT TGGAGATGGA AGCACAAGTG TAGAAGTGAA TCCAGTACAT GTATATGAAA GAGAAGGTTC TTATAAAGTA TCGTTAAGAG TAAAAGATGA TAAAGGCAAA GAGAGTAGAA GCGAAACAAC TGTTACGATT AAAGATGGAA GTTTAACAGA ATCAGAACCA AATAATCGTC CAGAGGAAGC AAATCGTATC GGGCTAAATA GTACGATAAA AGGTAATCTT ATTGGCGGGG ACCACACTGA TGTTTATACA TTTAATGTAG CATCAGCGAA AGATATCGAC ATTTCTGTTT TAAATGAGTA TGGAATTGGG ATGACATGGG TACTTCACCA TGAATCAGAT ATGCAAAATT ATGCAGCTTA CGGTCAAGCC AATGGGAATC ATATAGAAGC GAAATTTAAT GCAAAACCAG GCAAGTATTA CTTGTATGTA TATAAATATG ATAATGGCGA TGGAACGTAC TCATTATCAG TAAAGTGA
|
Protein sequence | MNKKSKINKV MLSISTMALS LGALQTHAAA EEKVPYNVLK TKPVGIEKSV DEVGHISKVD ETLSFQERLK VGDFSQRPAS ITKKTAVKQV KESYSMADLN KMNDQELVET LGSIKWHQIT DLFQFNEDAK AFYKDKGKMQ VIIDELAHRG STFTKDDSRG IQTFTEVLRS AFYLAFYNSE LSDLNERSFQ DKCLPALKAI AKNPNFKLGT VEQDTVVSAY GKLISNASSD VETVQYASNI LKQYNDNYTT YVNDRMKGQA IYDIMQGIDY DMQSYLTEAR KEANETMWYG KVDGFINEIN RIALLNEVTP ENKWLVNNGI YFASRLGKFH SNPNKGLEVV TQAMHMYPRL SEPYFVAVEQ ITTNYNGKDY SGNTVDLEKI RKEGKEQYLP KTYTFDDGSI VFKTGDKVSE EKIKRLYWAA KEVKAQYHRV IGNDKALEPG NADDVLTIVI YNSPDEYQLN RQLYGYETNN GGIYIEETGT FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGRG DMYQNERLTW FQEGNAEFFA GSTRTNNVVP RKSIISGLSS DPASRYTAER TLFAKYGSWD FYNYSFALQS YLYTHQFETF DKIQDLIRAN DVKNYDAYRE NLSKDPKLNK EYQEYMQQLI DNQDKYNVPE VADDYLAEHA PKSLTEVKKE ISDTLPMKDT KMTKHNSQFF NTFTLEGTYT GSVTKGESED WKAMSKRVNE SLEQLAQKEW SGYKTVTAYF VNYRVNSSNE FEYDVVFHGI AKDDGENKAP TVNINGPYSG LVKEGIQFKS DGSNDEDGKI VSYLWEFGDG STSVEVNPVH VYEREGSYKV SLRVKDDKGK ESRSETTVTI KDGSLTESEP NNRPEEANRI GLNSTIKGNL IGGDHTDVYT FNVASAKDID ISVLNEYGIG MTWVLHHESD MQNYAAYGQA NGNHIEAKFN AKPGKYYLYV YKYDNGDGTY SLSVK
|
| |