Gene Msil_2993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2993 
Symbol 
ID7093488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3304239 
End bp3305669 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content65% 
IMG OID643466304 
Productprotease Do 
Protein accessionYP_002363266 
Protein GI217979119 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.700896 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG AATTTTTCGC AAGGGGCGCG TCTCGCCTGC GCCGCGCTTA CGGCGTCGCG 
CTAGCGCTTG TCTGCTGCCT TGGCCTTCCC GCCCAGGCGG AGACGGCGCG TCAGGCGCCG
CAGGCCCCAG CCGAGGTGAT GCTCTCTTTC GCGCCCGTGG TGAAAAAGGC GCAACCTGCG
GTCGTCAACG TCTATGCGTC GCGGACGGAG AAGCGGCCGC GCAGCGCACT CTACGACGAT
CCGATTTTCG AGCGGTTTTT TGGCGGCGGC GGCCGTCCCG GCGGCTCGAC CTCGCGCTCG
CTTGGGTCCG GCGTGCTGGT CGATTCCTCG GGCCTCGTCG TCACCAATTA CCACGTCATC
GAAGGCATGA CGGACGTCAA GATCGCGCTT GCCGACAAGC GCGAGTTTGA CGCGGACATT
GTGCTGCGCG ACCAGCGCAC CGATCTTGCT GTGCTGCGGC TGAAGGGCGG CGCCAATTTT
CCGGTGATGG AGCTTGGCGA TTCCGACGCG CTCGAAGTCG GCGACTTCGT GCTGGCCATC
GGCAATCCGT TTGGAGTCGG CCAGACCGTG ACGCAGGGGA TCGTCTCGGC GCTCGCCCGC
ACCCAGGCGG GCATTTCCGA TTCCGGCTTC TTCATCCAGA CCGACGCGGC GATTAATCCC
GGCAATTCCG GCGGCGGCCT CGTCGATATG AGGGGCCGTC TCGTCGGCAT CAACTCGGCG
ATCTTCTCGC AGACGGGCAA TTCGGTCGGC ATCGGCTTCG CCGTCCCGAG CAATATGGTG
CGGGTCGTCA TCGCGGCGGC CAAATCCGGC CAGCGGGTGC GGCGGCCCTG GTTAGGGGCA
AGCCTGCAGG CGGTCTCGCG TGAAATCGCC GATTCGCTCG GCCTCGACCG CCCGTCGGGC
GCCCTGGTCG CCGAGGTGAC GGACGGAGGT CCGGCGGACA AGGCCGGAGT CAAGCGCGGC
GACATCATCG CCGCTGTCGA CGGCCAGACC ATCGACGATC CCGAAAGCTT CGGATATCGC
CTGTCGACCA AAGCGCTCGG CGGCGAGACC TCGCTTTCGC TGGTGCGCAA CGGCAAGCCG
CTGAACGTCA AGCTGGCGCT ATCCCCCGCG GCTGAAATCC CGGCGCGCGA TCCGGTCAAA
CTGAAAGGTC CGTCGCCTTT TTCCGGCGCG ACGGTGATCA ATCTATCCCC GGCTGTCATC
GAGGAAATGT CGGTCCACGG CGTCAACGAT GGGGTCGTGA TCGGCGATAT CGAGGACGGA
TCGACCGCTG CGGAGGTCAA TTTCCAAAAG GGCGACGTCA TTCTTCTCAT CAATGACGTC
AAGATCCAGA CGACCCGCGA TCTTGAGAAG GCGGTCAACG GCCGCCATAC CTATTGGAAG
CTGACCATTT TACGCGGCGG ACAGGTGGAG ACGACGGTGC TCGGGGGCTG A
 
Protein sequence
MTTEFFARGA SRLRRAYGVA LALVCCLGLP AQAETARQAP QAPAEVMLSF APVVKKAQPA 
VVNVYASRTE KRPRSALYDD PIFERFFGGG GRPGGSTSRS LGSGVLVDSS GLVVTNYHVI
EGMTDVKIAL ADKREFDADI VLRDQRTDLA VLRLKGGANF PVMELGDSDA LEVGDFVLAI
GNPFGVGQTV TQGIVSALAR TQAGISDSGF FIQTDAAINP GNSGGGLVDM RGRLVGINSA
IFSQTGNSVG IGFAVPSNMV RVVIAAAKSG QRVRRPWLGA SLQAVSREIA DSLGLDRPSG
ALVAEVTDGG PADKAGVKRG DIIAAVDGQT IDDPESFGYR LSTKALGGET SLSLVRNGKP
LNVKLALSPA AEIPARDPVK LKGPSPFSGA TVINLSPAVI EEMSVHGVND GVVIGDIEDG
STAAEVNFQK GDVILLINDV KIQTTRDLEK AVNGRHTYWK LTILRGGQVE TTVLGG