Gene Bcer98_2213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_2213 
Symbol 
ID5345736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp2298356 
End bp2300026 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content34% 
IMG OID640839732 
Productpeptidase M4 thermolysin 
Protein accessionYP_001375458 
Protein GI152975941 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000134163 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA CGGTTGTTAC ATTGCTGACT GCTGGTGCAG TTTTAGGTGC ACCATTTTCA 
ACTGCTTTTG CAGAAGAACA AGCGTCACAA CAAAAAATTA TGGATCAAAT GGAAGTCGTA
CAAAAAGATT GGAATGACGA AAAAGGGAAC CCGTCTTTCC TAGCGGGAGA ATTATCAACT
AAAAATGTAC AGACTCAAAA GGAAGTCGAG AAATTTTTGG AAGATAATAA AGCTTTATTT
AAACTTGACC CAAAAACAGA CTTGACACTT AAAGAAGTGA AATCAGATGA TTTGGGAATG
AAACACTATG TTTACACTCA ATCTATCAAT AAGGTACCAG TTGATGGTGC AAGATTTATG
ATTCATACAA ATAAAGAGGG TAAAGTAACA ACAGTAAATG GAAATATACA CCCGGATGCT
GCGGAGAATG TAAGAAAGAA TACAACTGCT AAAATTTCGA AAGAAGCAGC TCTTTCAAAT
GCGTGGAAAC ATATTAATCT TACAAAAAAT GACACGTTAG TCGAAGGGAA TGGAAATGTA
TTAGATAAAG TAAAAGACAA CTTAGAATCT ACAAGTGAAA AAGCTGATTT AGTCGTATAT
GAAAAAGAAG GAAAATATTA TTTAACTTTT AAAGTACAAT TGCAATTTGT TAAACCTTAT
GGAGCTAATT GGCAAATTTA TGTTAATGCA GAAGATGGAA CAATTGTTGA TTCATATAAT
GCAGTTACAG ATGCAGATAT TGCTAAAAAA GGCTATGGTT ATGGTGTATT AGGTGACCGA
AAAGAACTAA ACACTACTTA TAATAGTACA AAGGGAAAAT ATTATTTACG TGATATGACA
AAGCCTATGA ATGGAGGAGT CATTGAAACA TTTACAGTCA ACCACAGTGA TGCAGATTAT
CCGGTTAACT ACTACCTATT AGATGATGAT AATGCATGGA TCAATAAAAA TCAAAGACCT
GCGGTTGATG CTCATTATCA TGCAGGAAAA GTCTATGATT ATTATAAAAA TATTCATAAC
CGAAACAGTA TTGATGGAAA AGGTAAAACG ATTCGTTCAG CTGTCAATTA TGGAACTAAT
GTCAATAATG CATTTTGGAA TGGGCAGCAA ATGATTTATG GAGATGGAGA CGGTTATGAA
TTTATTCCAC TTTCAGGTTC ACTTGATGTT GTTGCCCATG AATTAACACA TGCAGTAACT
CAATATTCAG CTGATTTGCA ATATGTGAAT CAATCTGGTG CATTAAATGA ATCTTTCTCT
GATGTATTTG GATATTTTGT CGATCCAAAC AACTGGGATT TAGGGGAAGC TGTATATACA
CCTAGAGTTG CTGGAGATGC ACTTCGTAGT TTATCGGATC CAGAAAAATA TGGACAGCCT
GCTCATATGA AAGATTATCA ATTTTTACCT CCTACTGAAG AAGGAGATAA TGGGGGCGTG
CATATTAACA GCGGGATTCC AAATAAGGCG GCTTATTTAA CAATTAACGC TATTGGTAAA
GAAAAAGCGG AAAAAATCTA TTACCGTGCT TTAACAACTT ATTTAACACC AACAAGTAAC
TTTAAACAAG CTCGTGCAGC TTTATTACAA TCTGCAGCTG ATTATGATGG TTATGATAGC
GCAACATATA AGGCAGTAGA AAATGCTTGG AATCAAGTTG GTATAAATTA A
 
Protein sequence
MKKTVVTLLT AGAVLGAPFS TAFAEEQASQ QKIMDQMEVV QKDWNDEKGN PSFLAGELST 
KNVQTQKEVE KFLEDNKALF KLDPKTDLTL KEVKSDDLGM KHYVYTQSIN KVPVDGARFM
IHTNKEGKVT TVNGNIHPDA AENVRKNTTA KISKEAALSN AWKHINLTKN DTLVEGNGNV
LDKVKDNLES TSEKADLVVY EKEGKYYLTF KVQLQFVKPY GANWQIYVNA EDGTIVDSYN
AVTDADIAKK GYGYGVLGDR KELNTTYNST KGKYYLRDMT KPMNGGVIET FTVNHSDADY
PVNYYLLDDD NAWINKNQRP AVDAHYHAGK VYDYYKNIHN RNSIDGKGKT IRSAVNYGTN
VNNAFWNGQQ MIYGDGDGYE FIPLSGSLDV VAHELTHAVT QYSADLQYVN QSGALNESFS
DVFGYFVDPN NWDLGEAVYT PRVAGDALRS LSDPEKYGQP AHMKDYQFLP PTEEGDNGGV
HINSGIPNKA AYLTINAIGK EKAEKIYYRA LTTYLTPTSN FKQARAALLQ SAADYDGYDS
ATYKAVENAW NQVGIN