Gene Bcer98_1599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_1599 
Symbol 
ID5343534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp1703778 
End bp1705448 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content35% 
IMG OID640839178 
Productpeptidase M4 thermolysin 
Protein accessionYP_001374904 
Protein GI152975387 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000990098 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA CTGTTATTAC ATTGCTTGCT GCAGGAACAA TGTTAGGTGC ACCTTTTTCA 
ACTGCGTTTG CAGAAGAACA AGCACTTCAA AAAGAAGCAA TGGATAAAAT GGAAATCCAA
CAAAAAAATT GGAATGAGGG ACAAGGAAGT CCAGCATTTC TCTCAGGGGA ATTATCTAAT
AAGAAGGTAG AAAGTCAAAA AGCAGTAAAA GAGTTTCTTG AAGAAAATAA AGAACTATTT
AAAATCAATC CACAAACGGA TCTAACACTT AAAGAAGTGA AGTCTGATGA TTTAGGTATG
AAACATTATG TTTATACAAG GTCTGTAAAT AAGGTACCTG TTGATGGTGC ACAATTCGTT
GTTCATACAG ATAAAGAGGG TAAGGTAACA ACAGTAAATG GAGATATTCA CCCAGCTGCT
GAAGAGAACC TAAAAGGGGA TACAAAAGCA AAAATCACAA AAGAAACAGC TCTTTCAAAT
GCTTGGAAAC ATATTAAACT TACAAAGAAT GATACTTTAG TAAAAGTGGA TGGAAATACG
TTAGATCAAG TAAAAGAAAA CTTAGAATCT ACAAATGAAA AAGCAGACTT AGTTGTATAT
GAAAAAGACG GAACTTATTA TCTAGCGTTT AAAGTACAAC TGCAATTTAT CAAACCTTAC
GGAGCGAACT GGCAGATTTA TGTGAATGCG GAAGATGGAA CAATTATAGA TTCATATAAC
GCAGTTACAG ATGCAGATAG TCCTCGAAAA GGATATGGAT ACGGAGTATT AGGTGATCGA
AAAGAATTGA ATACAACTTT TGACAGTGTA AAAGGGAAAT ACTATTTAAA GGATACGACA
AAGCCTATGA ATGGAGGGTA TATTGAAACA TTTACGGTAA ATCATAGTAA TGCAGATTAC
CCAGTTAACT ATCGTTTATG GGATGATGAT AATGCTTGGA TAAATAAAGA GCAAAGACCT
GCGGTTGATG CTCATTATCA TGCAGGAAAA GTCTATGATT ACTATAAAAA TGTTCATAAT
CGCAACAGTT TTGATGGAAA AGGAAAAACA ATTCGTTCTG GTGTGAATTA TGGAGTGAAT
GTAAATAATG CATTTTGGAA TGGACAGCAA ATGGTTTATG GAGATGGCGA TGGGCGCGTA
TTCGCTCCTC TTTCTGGTTC TCTTGATGTT GTTGCGCACG AACTAACTCA TGCTGTGACA
CAATATTCAG CTGATCTTCG TTATGTAAAT CAATCCGGTG CATTAAATGA ATCGTTCTCT
GACGTATTTG GATATTTTGT GGATCCTGCA AACTGGGATT TAGGAGAAGC TGTATATACA
CCTGGTATTT CTGGAGATGC ACTTCGTAGT TTATCAAACC CTGAGAAATA TGGACAACCT
TCTCATATGA GGGATTATCA ATATCTTCCG GCAACTGAAG AAGGAGATAA TGGTGGTGTG
CATATTAATA GTGGTATCCC AAATAAGGCT GCATATTTGA CAATTAATGC TATTGGTAAA
GAAAAAGCAG AAAAAATCTA TTATCGTGCG TTAACAACAT ATTTAACACC GACAAGTGAC
TTTAAACAAG CTCGTACAGC TTTACTACAA TCTGCAGCTG ATTATGATGG TTATGGTAGT
GCAACATATA AAGCAGTAGA AACGGCTTGG AATCAAGTAG GAGTAAAATA G
 
Protein sequence
MKKTVITLLA AGTMLGAPFS TAFAEEQALQ KEAMDKMEIQ QKNWNEGQGS PAFLSGELSN 
KKVESQKAVK EFLEENKELF KINPQTDLTL KEVKSDDLGM KHYVYTRSVN KVPVDGAQFV
VHTDKEGKVT TVNGDIHPAA EENLKGDTKA KITKETALSN AWKHIKLTKN DTLVKVDGNT
LDQVKENLES TNEKADLVVY EKDGTYYLAF KVQLQFIKPY GANWQIYVNA EDGTIIDSYN
AVTDADSPRK GYGYGVLGDR KELNTTFDSV KGKYYLKDTT KPMNGGYIET FTVNHSNADY
PVNYRLWDDD NAWINKEQRP AVDAHYHAGK VYDYYKNVHN RNSFDGKGKT IRSGVNYGVN
VNNAFWNGQQ MVYGDGDGRV FAPLSGSLDV VAHELTHAVT QYSADLRYVN QSGALNESFS
DVFGYFVDPA NWDLGEAVYT PGISGDALRS LSNPEKYGQP SHMRDYQYLP ATEEGDNGGV
HINSGIPNKA AYLTINAIGK EKAEKIYYRA LTTYLTPTSD FKQARTALLQ SAADYDGYGS
ATYKAVETAW NQVGVK