Gene Msil_1362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1362 
Symbol 
ID7091700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1470470 
End bp1472020 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content67% 
IMG OID643464700 
Productpeptidase M48 Ste24p 
Protein accessionYP_002361689 
Protein GI217977542 
COG category[R] General function prediction only 
COG ID[COG4784] Putative Zn-dependent protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0606393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTCA TGAGAATTGT TGGCGCTCGA CATCTTGCCG GCGTCGCTCC CCGCCGCAAA 
GCCCTGACGA CCCGGCCTCC CGCGTCCTCG CTGCGCGCCG CGATCCTCGT CGCATTCGCC
GCATCCCTCG GGCTCGCGGG CTGCGCGTCG ATCGAGCCGC AGGGCGAGCA ATCCTTCAAA
CTCGAGACGC CGCCGCTGCC GCCGCGCCCG CCAAAGCCGG AAAGCGAGGC GAGCGCCGAG
CACAACCGCA TGGTCGCCCT GTTCGACGGC GAGTACAAGG ACCCCGCCGC CGAGCGCTAT
CTCAATGATA TTCTGGCGAA GCTCGCCAAG GCCGACGACC GCCGCAGCGA GCCCTATAAG
GTGACGATTC TCGATTCGCC GATCGTCAAC GCTTTCGCGC TGCCGCCGCA CGATCTCTTC
ATCACCCGCG GCCTGCTGGC GCTCGCCAAT GACGCGTCGG AAGTCGCGGC CGTCATGGCC
CATGAGATCG CCCATCTCAC TGCAAAGCAC GCGGTCCGTC GGGAGGAAGA GGAAAAGCGC
GCGGCGGTGA TCAGCCGGGC CGCGAGCGTC GTCCAGAACA AGGAAAAGGG CCAGGAGATC
GAAGCCTCCG CGCGGCGCAC GATCGCGACC TTCTCGCGCC AGCAGGAGCT CGACGCCGAC
CAGATGGGCA TCAAGGGGAT CGCCAAGGCT GGCTTCGACC CTTACGGCGC CTCGCGTTTC
CTTGGCGCGC TCGGACGATC CGCCGTCCTG CGCACCTCGC TGATCGGCCA GAACGCCAGC
GCGGACAAGC CGGATATCCT CGCAACCCAC CCTTCGACGC CGGAGCGCGT GACGCAGGCG
ATCGCCGTCG CGCGCCAGAT CGCCGCCCCG GGCATCGGCG TCACGGACCG CGACGCCTAT
CTTTCCGCCA TCGACGGCAT GGTGTTCGGC GACAGTCCAA CGGAAGGAAC GGTGCGCGGA
CGCAACTTCC TTCACGCCCG GCTCGGCTTC GCCTTTGCCG CTCCCGAAGG CTTTGTCCTC
GAGAATGCGT CGAAGGCGCT GCTGGGCGTC TCGGCCGACG GCAATCAGGC GCTGCGGCTC
GACAGCGTCA AGGCCCCGCC GGGCATGTCG CTTGAGGCCT ATATGGCCTC CGACTGGATC
GACGGTCTGG AACAGGGCTC GATCGAGACC GTCGACGTCA ACGGTTCGCC GGCGGCCATC
GCCAGAGCCA AAGCCGGCGA CTGGAGTTTT CGGCTCGCCG TGATCCGCTT CGAGGCGGGC
GAGTTCTACC GGCTGATCTT CGCGACCCGC ATCGAAAGCC AAGACAGCGA GCGGCAGTTC
AAGGACGCGC TCTCGTCGTT CCATCGCGCA AGTCCGGAGG AAATCCGCGC CGTACACCCG
CTGCGAATCG AAATCGTCAC GGCAAAGCCC GGCGAGCGCG CGGAAGATCT CGCGGAGAAA
ATGGCGACGC CCGACCGCAC GCTCGAATTC TTCCGGCTGA TCAACGGCCT TGAGGCCTCG
GCGCCGCTGC AGGCGGGCGA GCGCTACAAG ATCGTCGCCG AGCAGAAATA G
 
Protein sequence
MPFMRIVGAR HLAGVAPRRK ALTTRPPASS LRAAILVAFA ASLGLAGCAS IEPQGEQSFK 
LETPPLPPRP PKPESEASAE HNRMVALFDG EYKDPAAERY LNDILAKLAK ADDRRSEPYK
VTILDSPIVN AFALPPHDLF ITRGLLALAN DASEVAAVMA HEIAHLTAKH AVRREEEEKR
AAVISRAASV VQNKEKGQEI EASARRTIAT FSRQQELDAD QMGIKGIAKA GFDPYGASRF
LGALGRSAVL RTSLIGQNAS ADKPDILATH PSTPERVTQA IAVARQIAAP GIGVTDRDAY
LSAIDGMVFG DSPTEGTVRG RNFLHARLGF AFAAPEGFVL ENASKALLGV SADGNQALRL
DSVKAPPGMS LEAYMASDWI DGLEQGSIET VDVNGSPAAI ARAKAGDWSF RLAVIRFEAG
EFYRLIFATR IESQDSERQF KDALSSFHRA SPEEIRAVHP LRIEIVTAKP GERAEDLAEK
MATPDRTLEF FRLINGLEAS APLQAGERYK IVAEQK