Gene Msil_0188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0188 
Symbol 
ID7090505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp192430 
End bp194004 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content67% 
IMG OID643463522 
Productprotease Do 
Protein accessionYP_002360531 
Protein GI217976384 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.463194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTCC CTTTGCTTCC TGGTTCTGAA CAGAATGGCC AATCGCGTCC CGCGCCCCGG 
CGCGGCCTTC GGGTCGTCCT GCTTGGCGCC GTGGCGACGG CTGCCTTGAC CGGAGCGCTG
ACGACCGGCT TCGTTTCGCC GCATTCCGCG GTCGCCGAAA CCGCCGCTCC CATCGCGGCG
CAAGCGCCTT CCGCCTCGCC GGTATCCTTC GCCGATGTCG TCGACCATGT TCGCGACGCA
GTCGTTTCGG TCAAGGTCAA GATCACTGAG ACCGCCGACG CGTCGGATGA CGACGATGAC
AACAGCGATT CGCCGCGCCC CGGCCAGATT CCGCGCCTTC AGCCGGGCGA TCCGCTGGAG
CGCTTCTTCA AGCGCTTCGG CCAGCCGGGC ATGCCGCATC CGGGCGGTCC CGGCAAGCCG
CATTCGGCGC AGGCGCAGGG CTCGGGCTTC ATCATTTCGT CGGACGGCTA TGTGGTGACC
AATAACCACG TCGTCGAGAA GGCGACCGAA GTCACGCTCA CCACCGACGA GGGCAAGACC
CTTCATGCGA CGGTCGTCGG CACCGACAAG AAGACCGATC TGGCCTTGTT GAAGATCAAG
GAAGACGGCT CCTATCCTCA CGTGAAATTC TCCAGCGCCA CCCCCCGCGT CGGCGACTGG
GTGATTGCGG TCGGCAATCC GTTCGGCCTT GGCGGAACGG TGACGGCCGG CATCGTCTCG
GCGCGCGGCC GCGACATCGG CGCCGGCCCC TATGACGATT TCCTGCAGAT CGACGCCCCG
GTGAACCGCG GCAACTCGGG CGGCCCGACC TTCAACACGC TGGGCGACGT CGTCGGCGTC
AACACGGCGA TCTTCTCGCC GTCCGGCGGC AGCGTCGGCA TCGGCTTCGC CATCCCCTCG
GAGACGGCGC AGTCGATCAT TGCGAGCCTC AAGGACAAGG GCGCCGTGGC GCGCGGCTGG
ATCGGCGTGC AGATTCAGCC GGTCACCGAT GAGATCGCCG ACAGCCTTGG CCTCAAGTCG
AGCAAGGGCG CACTCGTCGC CGACGCGCAG GACAATTCGC CCGCCAAGGA AGCGGGCATC
AAATCCGGCG ACGTGATCCT CGGCGTCAAT GGCGAGCGCG TCGATGGACC GCGCGATCTC
GCCAAGAAAG TGGCGGCGCT TGGTCCGGGC AAGAAGGCCG ATCTGCTCTA TTGGCACGAC
GGCGCGGAGA AGACCGTAGC GGTGAAGCTC GGCTCGCTCC CCGACGAGAA AGAGGCGGCA
AAGCCGGCGG CGCTGCAGGA TAATTCTGCG CTTGCGGGCC TCGGGCTGAA GCTGGCTCCG
GCGTCTTCCG TGCAAGGCGC GGGCAATGAT GGCGTCGTTG TCGCCGACAT CGATCCCGAA
GGCTCGGCCG CGCAGAAGGG CCTCAGGGTC GGCGATCTCA TCCTCGAGGC CGGCGGCCGC
GCGGTGAGCA AGCCGTCCGA AATTGCGGCG ATTATCGCTG ACGCCAAGAA GGATGGCCGC
AAGGCCGTGC TGCTGCGGGT CAAGAGCGGC GAAGGCACGC GCTTCGTCGC CGTGGCGACC
AACCCCGCTT CCTAA
 
Protein sequence
MAFPLLPGSE QNGQSRPAPR RGLRVVLLGA VATAALTGAL TTGFVSPHSA VAETAAPIAA 
QAPSASPVSF ADVVDHVRDA VVSVKVKITE TADASDDDDD NSDSPRPGQI PRLQPGDPLE
RFFKRFGQPG MPHPGGPGKP HSAQAQGSGF IISSDGYVVT NNHVVEKATE VTLTTDEGKT
LHATVVGTDK KTDLALLKIK EDGSYPHVKF SSATPRVGDW VIAVGNPFGL GGTVTAGIVS
ARGRDIGAGP YDDFLQIDAP VNRGNSGGPT FNTLGDVVGV NTAIFSPSGG SVGIGFAIPS
ETAQSIIASL KDKGAVARGW IGVQIQPVTD EIADSLGLKS SKGALVADAQ DNSPAKEAGI
KSGDVILGVN GERVDGPRDL AKKVAALGPG KKADLLYWHD GAEKTVAVKL GSLPDEKEAA
KPAALQDNSA LAGLGLKLAP ASSVQGAGND GVVVADIDPE GSAAQKGLRV GDLILEAGGR
AVSKPSEIAA IIADAKKDGR KAVLLRVKSG EGTRFVAVAT NPAS