Gene Bcer98_3622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_3622 
Symbol 
ID5347135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp3679767 
End bp3681203 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content35% 
IMG OID640841118 
Productpeptidase M4 thermolysin 
Protein accessionYP_001376816 
Protein GI152977299 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAT TAGCCGGAAA TGTAGAAAAA CATTTCAAGA TTGTTGGGGA AGAAAAAGAT 
GAGAAGTCGG AGACAACTCA TATTAAGCTA GTTGAGAAAT ATAATGACAT TCCTGTGTAT
GGCTCAGATC AAACGATCAC ATTTGATAAA GAAAATAATG TGAAGGCTTT TTTCGGACAA
GTCATTCCAA ATTTAGAGGA TAAAAATATT CCGACTGCAA CGAGCATTAC GGATGAACAA
GCTGTAAACA TTGCAAAGAA AAATATTGAA AAAGAAATTG GAAAAGTGAA TCAATATGAT
GGTGTAAAAA AAGATTTATA TGTGTATGAA AAAGACGGGA ACTACTATCT TACTTATCTT
GTAAAAGCAT CTATTTCCAA ACCAGCTCCA GGATATTGGC ATTATTTTAT TGATGCAACA
AATGGAAATG TGATTGAGAA ATATAATGCA ATCGATTCTA TTACAGGATT TGGATACGGT
GTATTAGGAA ACAAAGCATC ATTTGAAATT GCTCAAGATG AGAAAACAGG TGTATATAAC
TTGTTTGATG GGAAACGTGG GCAAGGTGTT CATACATTTG ACGCAAAAAA TATGGACGAG
AATATATTTA TAATTTTATC ACAATGGTTT GGTTATACAG GAGAAGAAAT AGAGAGTAAA
TCTAAGTTCT TTGAGGACAA AGCTGCAGTT GACGCACATG TAAATGCCGG AAAAGTATAT
GATTATTATA AAAAGACTTT TAATCGCAAT TCTTTCGATA ATAAAGGTGC AAAGCTCATT
TCAGCTGTTC ACGTAGGTGA GGCTTGGAAT AATGCGGCAT GGAATGGCGT ACAAATGGTA
TATGGTGATG GTGATGGAAA AACATTTATT CCATTATCTG CAGGATTAGA TGTAATTGGT
CATGAATTAA CACATGCTGT AACAGAATAT ACAGCGAATT TAGTTTATCA AAATGAATCA
GGTGCATTGA ACGAATCGAT ATCGGATATT ATGGGTGTTA TGGTTGAGAA GAAAAACTGG
GATATAGGGG CTGATATTTA CACGCCTGAT ATTGAAGGAG ATGCGCTTCG TTCTCTGAAA
GATCCAGCTT CGATTCCAAA TCCGTTAAAG CCAGGTGAAG GGTATCCAGA TCATTATAGC
AAACGCTATG TAGGACCATA TGATAACGGT GGTGTTCATA TTAATAGTAG TATTAATAAT
AAAGCGGCAT ATTTAGTTTC TGAGGGCGGA GAGCATTACG GTGTAAAAGT AACTGGCATT
GGCCGCGAAG CGACAGAAAA AATTTACTAT CATGCGCTTA CGAAATATTT AACTGCAAAT
GCTGATTTTA AAATGATGCG TCAAGCTGCT CTGCAATCTG CTGAAGATTT ATATGGTGAA
AATTCTAAAG CAGTACAAGC TGTAGACAAA GCTTATGAGT CAGTAGGCGT AAAATAA
 
Protein sequence
MFKLAGNVEK HFKIVGEEKD EKSETTHIKL VEKYNDIPVY GSDQTITFDK ENNVKAFFGQ 
VIPNLEDKNI PTATSITDEQ AVNIAKKNIE KEIGKVNQYD GVKKDLYVYE KDGNYYLTYL
VKASISKPAP GYWHYFIDAT NGNVIEKYNA IDSITGFGYG VLGNKASFEI AQDEKTGVYN
LFDGKRGQGV HTFDAKNMDE NIFIILSQWF GYTGEEIESK SKFFEDKAAV DAHVNAGKVY
DYYKKTFNRN SFDNKGAKLI SAVHVGEAWN NAAWNGVQMV YGDGDGKTFI PLSAGLDVIG
HELTHAVTEY TANLVYQNES GALNESISDI MGVMVEKKNW DIGADIYTPD IEGDALRSLK
DPASIPNPLK PGEGYPDHYS KRYVGPYDNG GVHINSSINN KAAYLVSEGG EHYGVKVTGI
GREATEKIYY HALTKYLTAN ADFKMMRQAA LQSAEDLYGE NSKAVQAVDK AYESVGVK