Gene Bcer98_0486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_0486 
Symbol 
ID5345068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp548387 
End bp551290 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content33% 
IMG OID640838085 
Productcollagenase 
Protein accessionYP_001373836 
Protein GI152974319 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.008095 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA AATCAAAAAT TACGCAAATG ATGCTCGGTA TTAGTACAGT AGCTTTATCA 
TTTGGAAGTA TTCAAACACA AGTATCAGCA GAAGAAAAAA TACCTTATAA TGTCATGCAA
ATGAAGCCGA TTGGAGTTGA AACTCAGACG GATCAAATTG CTCATGTTTC GCAGGAAAAT
AAAACATTAT CTTTTGAAGA ACGATTAAGA ACGGGAGATT TTTCTCAACG TCCAGCTTTT
GGAATGAAGA GAACGGAATC TAAACAGTTT CAAGAAAGTT ATTCTATGGC TGATTTCAAT
AAAATGAATA ATGAAGAATT GATTGATACA TTGGCAAATA TTCGTTGGAA TCAAATCACA
GATTTATTTC AATTTAATCA AGACACAAAA ACGTTTTATC AAAATAAGGA AAGAATGCAA
GTTCTGATTG ATGAATTAGG TCGTCGTGGG AGTACATTTA CAAAAGATGA TGCAAAAGGA
ATTGATACAT TTGTTGAAGT GTTACGTTCA GCTTTTTACC TTGGTTTCTA TCATAAAGAA
TTAAACTATT TAAATGAGCG TAGTTTCCAT GATAAATGCT TGCCAGCATT AAAAGCCATT
GCGAAAAATC CAAACTTTAA ACTTGGGACA GAACAACAAG ATACGGTAGT ATCTGCATAC
GGGAAATTAA TTAGTAACGC TTCGAGCGAT GTTGAAACAG TGCAATATGC AGCAAATATT
TTAAAACAGT ACAATGATAA TCTCGCTACT TATGTAAGCG ATTTTATGAA AGGGCAAGCT
GTATATAGTC TTATACAAGG GATTGATTAT GATATAGATT CTTATGTGTA TGCTTCGTAT
AAAGAAGCAA ATCAAACAAT GTGGTACGGT AAAATGGATG CTTTTGTAAA TGAAGTGAGC
CGTATTGCGC TTTTACAGCA TGTAACAACA GAAAATAGTT GGTTAATTAA TAATGGCATT
TATTTTGCAG GTCGTTTAGG GAAATTTCAT AGTGACCCAA ATAAAGGATT AGCAGTTGTT
ACACAAGCAA TGCATATGTA TCCGCATTTA AGTGATGCTT ATTTTGTTGC AGTAGAGCAG
ATTAAAACGA ATTATGGCGG AAAAGATTAC AATGGAAATA AAATAGATGC AGAGAAAATA
CGTGAAGAGG GAAAACAACG GTACTTGCCA AAAACATATA CATTTGATGA TGGATCTATT
GTATTTAAAA CAGGGGATAA AGTAACAGAA GAAAAAGTGA AGCGATTGTA CTGGGCTGCT
AAAGAAGTAA AAGCGCAATA TCATCGTGTG ATTGGAAATG ATAAAGCGCT AGAGCCAGGT
AATCCGGATG ATGTGCTAAC AGTTGTTATT TATAATAGTC CAGATGAATA TAAGCTTAAT
AGACAACTAT ATGGATATGA AACAAATAAT GGTGGAATTT ATATTGAACA AGACGGAACT
TTCTTTACAT ATGAACGTAC CCCAGAACAA AGTATTTATA GTTTAGAAGA GTTATTCCGT
CATGAATTTA CTCATTATTT ACAAGGGAGA TATGAGGTTC CTGGTTTATT TGGAGAAGGA
GATATGTATC AAAATGAGAG ATTAACTTGG TTTCAAGAAG GGAATGCAGA GTTCTTTGCA
GGATCTACTC GTATGAACAA TGTTATTCCA CGTAAAAGTA TAATTAGTGG ATTATCATCT
GATCCTGCAA ATCGTTATAC AGCAGAACGA ACATTATTTG CAAAATATGG TTCATGGGAT
TTTTATAACT ATTCTTTTGC ACTACAGTCC TATTTATATA ATCATCGTTT TGAAATATTT
GATCAAATTC AAGACTTAAT TCGAGCGAAT GATGTGAAGA ATTATGATGC ATATCGTGAG
GCTTTAAGTA AGGATGCGAA TTTAAATACT GAGTACCAGG CGTATATGCA AAAATTAATT
GATAATCAAG AAAGATATAA CGTGCCAGAG GTATCAGATG ATTATTTAGC AGAGCATGCA
CCAAAGGCAT TAACAGAAGT GAAGAAAGAT ATCGAGGATA CAACGAATCT GAAAGGGGCA
ACGATTAAAA AGCATAAGTC CCAATTTTTT GATACATTTA CGGTAGAAGG AACGTATACA
GGTAGTACGA CAAAGGGAGA AGCTGAGGAC TGGAAAGAGA TGAGTAAGCA CATCAACCAA
TCTCTAGAAA AATTGACTCA GAAGGAATGG AGCGGTTATA AAACAGTTAC AGCTTATTTT
GTGAACTATC GTGTAAATGC AGAAAATCAA TTCGAATACG ATGTTGTATT TCATGGTATC
GCAACAGATA GTGGAGAAAA TCAAGCTCCA GTTGTAAATA TAAATGGTCC ATACAATGGA
AATGTAAATC AAGCCATTCA GTTTAATAGT GATGGTTCCA AAGATGAGGA TGGAAAAATT
ACTTCTTACT TATGGGATTT TGGAGATGGT ACAACAACTA CGGAAGCAAA TCCAAAACAT
GTATATAAGC AGGAAGGAAC ATATAAAGTA ACATTAACAG TAAAAGACGA TAAAGGAAAA
GAAGCGAAAA CCGAAACAAC CGTTACTGTT AGAAAAGGAC ACGAAACAAC GGTAGAATCC
GAACCAAATA ATCGTCTGGA GGAAGCGAAT CCTATTGCAT TCCATACATT GATAAAAGGT
AGTCTTATGA ATGAAGATCG TACAGATATT TATACTTTTG ATGTCACTTC TTCAAAAAAT
ATAGATATTT CTGTTGTAAA TGAACATAAC ATTGGGATGA CATGGGTTCT TCATCATGAA
TCAGATATGC AAAATTATGT TGCGTATGGC CAAGCTAATG GAAATGATAT AAAAGGGAAA
TTTGAAGCAA AACCAGGAAA GTATTATTTA TATGTATATA AATTTGATAA CGGAAATGGG
ACATACACAT TATCAGTAAA ATAA
 
Protein sequence
MNKKSKITQM MLGISTVALS FGSIQTQVSA EEKIPYNVMQ MKPIGVETQT DQIAHVSQEN 
KTLSFEERLR TGDFSQRPAF GMKRTESKQF QESYSMADFN KMNNEELIDT LANIRWNQIT
DLFQFNQDTK TFYQNKERMQ VLIDELGRRG STFTKDDAKG IDTFVEVLRS AFYLGFYHKE
LNYLNERSFH DKCLPALKAI AKNPNFKLGT EQQDTVVSAY GKLISNASSD VETVQYAANI
LKQYNDNLAT YVSDFMKGQA VYSLIQGIDY DIDSYVYASY KEANQTMWYG KMDAFVNEVS
RIALLQHVTT ENSWLINNGI YFAGRLGKFH SDPNKGLAVV TQAMHMYPHL SDAYFVAVEQ
IKTNYGGKDY NGNKIDAEKI REEGKQRYLP KTYTFDDGSI VFKTGDKVTE EKVKRLYWAA
KEVKAQYHRV IGNDKALEPG NPDDVLTVVI YNSPDEYKLN RQLYGYETNN GGIYIEQDGT
FFTYERTPEQ SIYSLEELFR HEFTHYLQGR YEVPGLFGEG DMYQNERLTW FQEGNAEFFA
GSTRMNNVIP RKSIISGLSS DPANRYTAER TLFAKYGSWD FYNYSFALQS YLYNHRFEIF
DQIQDLIRAN DVKNYDAYRE ALSKDANLNT EYQAYMQKLI DNQERYNVPE VSDDYLAEHA
PKALTEVKKD IEDTTNLKGA TIKKHKSQFF DTFTVEGTYT GSTTKGEAED WKEMSKHINQ
SLEKLTQKEW SGYKTVTAYF VNYRVNAENQ FEYDVVFHGI ATDSGENQAP VVNINGPYNG
NVNQAIQFNS DGSKDEDGKI TSYLWDFGDG TTTTEANPKH VYKQEGTYKV TLTVKDDKGK
EAKTETTVTV RKGHETTVES EPNNRLEEAN PIAFHTLIKG SLMNEDRTDI YTFDVTSSKN
IDISVVNEHN IGMTWVLHHE SDMQNYVAYG QANGNDIKGK FEAKPGKYYL YVYKFDNGNG
TYTLSVK