Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_0486 |
Symbol | |
ID | 5345068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | + |
Start bp | 548387 |
End bp | 551290 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640838085 |
Product | collagenase |
Protein accession | YP_001373836 |
Protein GI | 152974319 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.008095 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAGA AATCAAAAAT TACGCAAATG ATGCTCGGTA TTAGTACAGT AGCTTTATCA TTTGGAAGTA TTCAAACACA AGTATCAGCA GAAGAAAAAA TACCTTATAA TGTCATGCAA ATGAAGCCGA TTGGAGTTGA AACTCAGACG GATCAAATTG CTCATGTTTC GCAGGAAAAT AAAACATTAT CTTTTGAAGA ACGATTAAGA ACGGGAGATT TTTCTCAACG TCCAGCTTTT GGAATGAAGA GAACGGAATC TAAACAGTTT CAAGAAAGTT ATTCTATGGC TGATTTCAAT AAAATGAATA ATGAAGAATT GATTGATACA TTGGCAAATA TTCGTTGGAA TCAAATCACA GATTTATTTC AATTTAATCA AGACACAAAA ACGTTTTATC AAAATAAGGA AAGAATGCAA GTTCTGATTG ATGAATTAGG TCGTCGTGGG AGTACATTTA CAAAAGATGA TGCAAAAGGA ATTGATACAT TTGTTGAAGT GTTACGTTCA GCTTTTTACC TTGGTTTCTA TCATAAAGAA TTAAACTATT TAAATGAGCG TAGTTTCCAT GATAAATGCT TGCCAGCATT AAAAGCCATT GCGAAAAATC CAAACTTTAA ACTTGGGACA GAACAACAAG ATACGGTAGT ATCTGCATAC GGGAAATTAA TTAGTAACGC TTCGAGCGAT GTTGAAACAG TGCAATATGC AGCAAATATT TTAAAACAGT ACAATGATAA TCTCGCTACT TATGTAAGCG ATTTTATGAA AGGGCAAGCT GTATATAGTC TTATACAAGG GATTGATTAT GATATAGATT CTTATGTGTA TGCTTCGTAT AAAGAAGCAA ATCAAACAAT GTGGTACGGT AAAATGGATG CTTTTGTAAA TGAAGTGAGC CGTATTGCGC TTTTACAGCA TGTAACAACA GAAAATAGTT GGTTAATTAA TAATGGCATT TATTTTGCAG GTCGTTTAGG GAAATTTCAT AGTGACCCAA ATAAAGGATT AGCAGTTGTT ACACAAGCAA TGCATATGTA TCCGCATTTA AGTGATGCTT ATTTTGTTGC AGTAGAGCAG ATTAAAACGA ATTATGGCGG AAAAGATTAC AATGGAAATA AAATAGATGC AGAGAAAATA CGTGAAGAGG GAAAACAACG GTACTTGCCA AAAACATATA CATTTGATGA TGGATCTATT GTATTTAAAA CAGGGGATAA AGTAACAGAA GAAAAAGTGA AGCGATTGTA CTGGGCTGCT AAAGAAGTAA AAGCGCAATA TCATCGTGTG ATTGGAAATG ATAAAGCGCT AGAGCCAGGT AATCCGGATG ATGTGCTAAC AGTTGTTATT TATAATAGTC CAGATGAATA TAAGCTTAAT AGACAACTAT ATGGATATGA AACAAATAAT GGTGGAATTT ATATTGAACA AGACGGAACT TTCTTTACAT ATGAACGTAC CCCAGAACAA AGTATTTATA GTTTAGAAGA GTTATTCCGT CATGAATTTA CTCATTATTT ACAAGGGAGA TATGAGGTTC CTGGTTTATT TGGAGAAGGA GATATGTATC AAAATGAGAG ATTAACTTGG TTTCAAGAAG GGAATGCAGA GTTCTTTGCA GGATCTACTC GTATGAACAA TGTTATTCCA CGTAAAAGTA TAATTAGTGG ATTATCATCT GATCCTGCAA ATCGTTATAC AGCAGAACGA ACATTATTTG CAAAATATGG TTCATGGGAT TTTTATAACT ATTCTTTTGC ACTACAGTCC TATTTATATA ATCATCGTTT TGAAATATTT GATCAAATTC AAGACTTAAT TCGAGCGAAT GATGTGAAGA ATTATGATGC ATATCGTGAG GCTTTAAGTA AGGATGCGAA TTTAAATACT GAGTACCAGG CGTATATGCA AAAATTAATT GATAATCAAG AAAGATATAA CGTGCCAGAG GTATCAGATG ATTATTTAGC AGAGCATGCA CCAAAGGCAT TAACAGAAGT GAAGAAAGAT ATCGAGGATA CAACGAATCT GAAAGGGGCA ACGATTAAAA AGCATAAGTC CCAATTTTTT GATACATTTA CGGTAGAAGG AACGTATACA GGTAGTACGA CAAAGGGAGA AGCTGAGGAC TGGAAAGAGA TGAGTAAGCA CATCAACCAA TCTCTAGAAA AATTGACTCA GAAGGAATGG AGCGGTTATA AAACAGTTAC AGCTTATTTT GTGAACTATC GTGTAAATGC AGAAAATCAA TTCGAATACG ATGTTGTATT TCATGGTATC GCAACAGATA GTGGAGAAAA TCAAGCTCCA GTTGTAAATA TAAATGGTCC ATACAATGGA AATGTAAATC AAGCCATTCA GTTTAATAGT GATGGTTCCA AAGATGAGGA TGGAAAAATT ACTTCTTACT TATGGGATTT TGGAGATGGT ACAACAACTA CGGAAGCAAA TCCAAAACAT GTATATAAGC AGGAAGGAAC ATATAAAGTA ACATTAACAG TAAAAGACGA TAAAGGAAAA GAAGCGAAAA CCGAAACAAC CGTTACTGTT AGAAAAGGAC ACGAAACAAC GGTAGAATCC GAACCAAATA ATCGTCTGGA GGAAGCGAAT CCTATTGCAT TCCATACATT GATAAAAGGT AGTCTTATGA ATGAAGATCG TACAGATATT TATACTTTTG ATGTCACTTC TTCAAAAAAT ATAGATATTT CTGTTGTAAA TGAACATAAC ATTGGGATGA CATGGGTTCT TCATCATGAA TCAGATATGC AAAATTATGT TGCGTATGGC CAAGCTAATG GAAATGATAT AAAAGGGAAA TTTGAAGCAA AACCAGGAAA GTATTATTTA TATGTATATA AATTTGATAA CGGAAATGGG ACATACACAT TATCAGTAAA ATAA
|
Protein sequence | MNKKSKITQM MLGISTVALS FGSIQTQVSA EEKIPYNVMQ MKPIGVETQT DQIAHVSQEN KTLSFEERLR TGDFSQRPAF GMKRTESKQF QESYSMADFN KMNNEELIDT LANIRWNQIT DLFQFNQDTK TFYQNKERMQ VLIDELGRRG STFTKDDAKG IDTFVEVLRS AFYLGFYHKE LNYLNERSFH DKCLPALKAI AKNPNFKLGT EQQDTVVSAY GKLISNASSD VETVQYAANI LKQYNDNLAT YVSDFMKGQA VYSLIQGIDY DIDSYVYASY KEANQTMWYG KMDAFVNEVS RIALLQHVTT ENSWLINNGI YFAGRLGKFH SDPNKGLAVV TQAMHMYPHL SDAYFVAVEQ IKTNYGGKDY NGNKIDAEKI REEGKQRYLP KTYTFDDGSI VFKTGDKVTE EKVKRLYWAA KEVKAQYHRV IGNDKALEPG NPDDVLTVVI YNSPDEYKLN RQLYGYETNN GGIYIEQDGT FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGEG DMYQNERLTW FQEGNAEFFA GSTRMNNVIP RKSIISGLSS DPANRYTAER TLFAKYGSWD FYNYSFALQS YLYNHRFEIF DQIQDLIRAN DVKNYDAYRE ALSKDANLNT EYQAYMQKLI DNQERYNVPE VSDDYLAEHA PKALTEVKKD IEDTTNLKGA TIKKHKSQFF DTFTVEGTYT GSTTKGEAED WKEMSKHINQ SLEKLTQKEW SGYKTVTAYF VNYRVNAENQ FEYDVVFHGI ATDSGENQAP VVNINGPYNG NVNQAIQFNS DGSKDEDGKI TSYLWDFGDG TTTTEANPKH VYKQEGTYKV TLTVKDDKGK EAKTETTVTV RKGHETTVES EPNNRLEEAN PIAFHTLIKG SLMNEDRTDI YTFDVTSSKN IDISVVNEHN IGMTWVLHHE SDMQNYVAYG QANGNDIKGK FEAKPGKYYL YVYKFDNGNG TYTLSVK
|
| |