Gene Msil_0606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0606 
Symbol 
ID7093684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp656365 
End bp657936 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content64% 
IMG OID643463938 
Productprotease Do 
Protein accessionYP_002360940 
Protein GI217976793 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0210206 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTGT TCAAGACTGC GCGGCGCGGC GCGATTGCGT CTGCCGCCGG CGGGGCCGCT 
CCGCGAAACG CCTGCTCCCT GCTCGCCATT GCCGCCGTGC TCATGTTTGG AGCCGCGTCC
GTCGCGCCCG GGCTCGGCGC GCCGCCGGCC TATGCGAAGG GACCGGACTC CCTAGCGGAT
CTTGCCGCTG ACGTCAGCGA CGCAGTCGTC AACATCTCGG CGACGCAGAC GATGGACGAA
AAGCGCTCGG GCGGGGCTCC GCAGCTTGAG CCGGGCACGC CCTTCGATGA TCTGTTCGAG
GAGTTCTTCC GCCGCCGCCA GCAAGGCCAA GGCGGGCCGG ACCAACCCAC GCCGCGCGCG
CCGCGCGAGC GCAAGTCGAA TTCGCTCGGC TCAGGCTTCG TCGTCGATCC GTCAGGCATT
ATCATTACCA ATAATCACGT CATCGCCGAC GCCAATGACG TCACGGTGAT TTTTACCGAT
GGACAGAAAC TGAAGGCCGA AGTCCTCGGC AAGGATTCGA AAGTCGACGT CGCCGTCCTG
AAAGTGAAGC CCGACAAGCC GCTGAAGGCG GTCAAATTCG GCGACAGCGA CAAGATGCGC
GTTGGCGATT GGGTCATCGC CGTCGGCAAT CCGTTCGGCC TTGGCGGCAC TGTCACCGCC
GGAATCATTT CCGCCCTGAA ACGGAACATC GATTCCGGCC CCTATGACAA TTATTTCCAG
ACGGACGCCG CGATCAACAA GGGCAATTCG GGCGGACCGC TCTTCAACAT GGCCGGCGAG
GTCGTCGGCA TCAACACCGC GATCCTCTCG CCCTCGGGCG GATCGATCGG TATCGGCTTC
TCGACGCCTG CTGCGACTGT GACGCCGGTC ATCGATCAGC TGCAGAAATT TGGCGAGACG
CGGCGCGGAT GGCTCGGCGT CCGGATCCAG AACGTCGACG ACACGATCGC AGAAACGCTG
AACCTTGGCT CGGTGCGGGG CGCGCTGGTC GCCGGCGCGG ACGACAAGGG ACCGGCTAAG
GCAGCCGGGA TCGAGGCTGG CGACGTCATC TTGAAATTCG ACGGCGTGCC GATCAAGGAA
TCCCACGACC TGCCCAAGAT CGTCGCCTCC GCGCCAGTCG GCAAGGATGT CGAAGTGGTG
CTGCTGCGGC AAGGCAAGGA GATCACTAAG ACGATCAAGC TCGGCCGGCT CGAGGACAAT
GAAAAGCAAA AGGCCGCCTT GACCGTCAGG CCCGGCGACG ACGACAAGCC GCCGGCGGCC
AACGCCTCGA TGGAGCGCGC GCTCGGCATG GCCTTCTCAG GGCTTAATGA CGGCGCGCGC
CGGAAATATT CGATCAAACA GAGCGTCGCC GCCGGCGTCA TTGTGACCGA TGTCGAGCCG
GATTCGGGCG CCGCGGAAAA GCACATCCAA CCCGGCGACG TGATCATGGA GATCAATCAG
GAGCCGGTGA AGGAGCCGGC CGACGTCGCC AAGAAAGTCG CCAAGCTGAA GGACGACGGC
AAGAAGTCGG CGCTGCTTCT CGTCGCCAAT GGCCAGGGAG AAATGCGCTT TGTCGCGCTT
CCCTTCCCAT AA
 
Protein sequence
MGLFKTARRG AIASAAGGAA PRNACSLLAI AAVLMFGAAS VAPGLGAPPA YAKGPDSLAD 
LAADVSDAVV NISATQTMDE KRSGGAPQLE PGTPFDDLFE EFFRRRQQGQ GGPDQPTPRA
PRERKSNSLG SGFVVDPSGI IITNNHVIAD ANDVTVIFTD GQKLKAEVLG KDSKVDVAVL
KVKPDKPLKA VKFGDSDKMR VGDWVIAVGN PFGLGGTVTA GIISALKRNI DSGPYDNYFQ
TDAAINKGNS GGPLFNMAGE VVGINTAILS PSGGSIGIGF STPAATVTPV IDQLQKFGET
RRGWLGVRIQ NVDDTIAETL NLGSVRGALV AGADDKGPAK AAGIEAGDVI LKFDGVPIKE
SHDLPKIVAS APVGKDVEVV LLRQGKEITK TIKLGRLEDN EKQKAALTVR PGDDDKPPAA
NASMERALGM AFSGLNDGAR RKYSIKQSVA AGVIVTDVEP DSGAAEKHIQ PGDVIMEINQ
EPVKEPADVA KKVAKLKDDG KKSALLLVAN GQGEMRFVAL PFP