Gene Mchl_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1052 
Symbol 
ID7118555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1067543 
End bp1070701 
Gene Length3159 bp 
Protein Length1052 aa 
Translation table11 
GC content73% 
IMG OID643523845 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_002419887 
Protein GI218529071 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.865489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAGTG CCTTCGTTCG GTCCATGCTC GGATGGTCCT GCCGCGTCCT GACTGCGGCA 
GTGCTCCTCT TCTCCGGCTG CCTCGTCGCC GAAGCGCAGA TGCGCGGCCC GGCCGGCTCC
GGTACCGGAG CCCCCTACGG TGGCGGCGGT TCCGGAGGGG GCGGCTACGG TGGCGGCTAT
CGCGGCGGCC CGATGATCGG CCTGCCGGGC ATGATCGGCA TCATTCCGCG GCTGATCCCA
CCGACGCGAC AGCGGGTGGA AGAGGTCGAG GACGAGGAAC CGCCGCCGCG GCGGCCGGTC
CGCCAACCCC AGTACGAGGA CGAGCCTCCC GCATACCGGC CAAGGCCACG TCCCCGCCCG
CAGGCCCCCC CGCCCGTTCA TCGCGCGGAG CCGGCACCGA CACCACATCC GAAGCAGGCC
GCACCCGTTC GCGAACCTCC GCCCAAGGTG GTGCAGAAGC CGGAGCGCCC GAAGCCCGTC
GCGGCGCCGG TCGCCAAGCC GCAGCCGCCG AAGCCGGCAC TCGCTAAGGC ACCGCCTCCG
CCGCGTCGTC AGGCGCCCGC TCCCGCGCTG GTCCCAGCCG CAAGCCAACC GGCCCCCGTG
GACCCAGGCG AGGTTCCCGG CGAGGTCCTG TTCGTCCTCA AGGCGGAGGT GCCCGCCGAG
AGCCTTCCGG AGATCCTGCG CCGCGAGCGT CTGGCGCTGA TCTCCGCGGA CACCTTCACG
CTTGTGCCTG TGACCCTGCA CCGTACCCGT ATCCGCGACC GCCGGTCGGT CGCGGAGGTG
GTCGCCGCCC TGTCGCGCGA CCCTCGCGTC GCCTCGGCAC AGGCGAACCA CGTCTACGCG
CTCGTCGGGG AGGCCATGCC GACCCTCGCC GGTGCGCAGT ACGTCGTGGG CAAGCTCCGC
CTGAAGGAGG CGCACGCCTC GGCCACGGGC AAGGACGTGA CCGTCGCCGT GATCGATTCC
GACGTCGACC TGGGACATCC CTCGCTGCAG GGTGCCGTGG CGAACCGGCA CGACGCGCTG
GACGGCGGCA AGCCGGCGGC GGCCCACCCG CACGGGACGG CCATCGCGGG CATCGTCGGC
GCCCGCGCCC AACTGGCCAG CGCGGCACCG GAAGCCAGCC TCCTCGCGGT GCGCGCCTTC
TCGGGCGAGA CGCGGGCCGG GGCGCAGGGC ACCACGCTGC ACGTCCTGCG AGCCCTGGAC
TGGTCCGGGA AAATGGGCGC GAGAGTGGTC AACATGAGCT TCGCCGGTCC TTGGGATGCC
GCGCTCTCCG AGTTCCTGGC CGCCGGAACC GGGCGCGGCG TCGTCTACGT GGCCGCGGGT
GGCAATGCCG GCCCGGCCTC GCCGCCCCTG TTCCCGGCCG CCGACCCAAA CGTCATCGCC
GTGACCGCGA CGGACGCCGA GGACAGGCTC TTCCCGGCGG CCAACCGCGG CTCACACCTC
TGCGTTGCCG CGCCGGGCGT CGACATCCTC GTGGCCGCGC CGAACGGCGG ATACGGGCTT
CTCTCGGGCA CCTCGACGGC GGCCGCGCAG GTCAGCGGCG TGGTCGCCCT GATGCTGCAG
GCGAGGCCCG ACCTGAAGCC CGCCGAAGTG CGGGCCGCCC TGACGCGCAG CGCCCGCGAC
CTCGGACCGC CGGGACCGGA CCGGGAGTTC GGCGCCGGCT TCGCCGACGC GGAGGGTGCT
ATGCGGTCCC TGACGGCCCC CATGGCCGTG CAGGAGCCGG CCGGACCCGT ACCGGCGGGT
GATCCGCCCC CACTGAAGTG GCTCTTCGTC GGGTTTGGTG GATCACGCTG CGAGGAGGAA
GCGGCGACGC AGGAGGTCGA GCTTGGCGCG GCCATACATC GAACGCTTCA ACAGCTTGAG
GCGGTTGACC TGCCCCTCGT CCTGCCCGCT GCTCCAGGGC AGGCGAAGGC CTGCACGAAC
AGCGGCTCCA TCCTGAGCGA GGCTGACGGC GAAGCTCTCG ACCACCCGCA CACCGCAGGC
ACGGGCGTCG GACAACCAAG CGTCGAAGGC GGTGGTGATG CCGGGCTTGG CCGGCGCGGC
GCCGCAACGG CTGCGAACGA TTCGACAGAA CCGCCGACCG AGCCCGACGA CCTTGGCAGC
CTCGTCATCC TGCAGCACGC GCGCGACCAC TGCTGCGGCG TCGGCGTCGA GGTCATCCGG
CTCGCGCAGG AGGTGCCACG ACAACTGCTT CGGCGAGGGC AGCGGCGGCG ACGGGGGCGC
GACCTGCGCC ATGGGCGACG GCGTCTTGAG GCGCCAGATC GTGGTCCTGG CGGGGCGCGT
GCGCCGCTCG GACAGCCAGC GTCTGACCTG CTTGACGGTG CCGGGAAAGC CGCGATCCCG
CAGCTCTCGC CAGAGCTGCA TGGCGTTCTC GCACCCCGCA GCCTGGCGAG CGTGCAGGTG
ATCGAGATGA GGATCAAGGA TGCTGCGCCC CGGCCCGCGC CCGTCATGGC GTGGGAAGCC
CTCGGCGAAG GCGTACTTGC GCACCGTGGC GCGAGCCAAG CCGGTCTCGC GATTGATGAG
CCGGAGCGAC TGCCCGGCCG CACGGCGGCG ACGGACATCG TCGTAGAGCT CCTCCCACCG
ACCGACCGCT GCAGCGCGGG CGAGCGTCTC GGCGGGTGCA CGCGGATAGG CTCTGGTGCG
CCGGGTCGAA GGCGCTGGCG CCGTGATCGG CGGCAAGAGC TTCAGGCGCG GGTGGACACG
GGCGAGCCAA CGCTCGATCA TCTGGCGGGT GTTGAGCAGG AGATGCCACC GGTCAGCGAC
CTGCACGGCC GCCGGCGCGC CAAGCGTGGT GCCGCGGGCA TACTCGGTCG AACGATCGCG
CGCCACGAGC CGGATCTGTG GCTGGCGGCG GAGCCATGCG GCCCAGGTCT CGGCCGAGCG
GTCAGGGAGC AGATCGAGAG GGCGATGGCG TTCGAGGTCG ACGACGATCG TGCCGTAGGT
CCGCCCTTTG CGCAGCGCCC AGTCATCGAC GCCGACGACA CAGGGTCGAG GCGCCTTCGG
CAGCGGCACT CCCCGGATCG TCCGCAGCAG AGTCGTGGCG CTGGACGGCA TGGCCAGGTG
TGCGAGCAAC CGGGCAGCTG GCTGTCCGCC GAGTGCGAGA CCGGTCCGGG CCTGCGCCCC
GGCCAGCCGG CGGGTGCGTT GGGCATGGCG GGCGAGTAG
 
Protein sequence
MRSAFVRSML GWSCRVLTAA VLLFSGCLVA EAQMRGPAGS GTGAPYGGGG SGGGGYGGGY 
RGGPMIGLPG MIGIIPRLIP PTRQRVEEVE DEEPPPRRPV RQPQYEDEPP AYRPRPRPRP
QAPPPVHRAE PAPTPHPKQA APVREPPPKV VQKPERPKPV AAPVAKPQPP KPALAKAPPP
PRRQAPAPAL VPAASQPAPV DPGEVPGEVL FVLKAEVPAE SLPEILRRER LALISADTFT
LVPVTLHRTR IRDRRSVAEV VAALSRDPRV ASAQANHVYA LVGEAMPTLA GAQYVVGKLR
LKEAHASATG KDVTVAVIDS DVDLGHPSLQ GAVANRHDAL DGGKPAAAHP HGTAIAGIVG
ARAQLASAAP EASLLAVRAF SGETRAGAQG TTLHVLRALD WSGKMGARVV NMSFAGPWDA
ALSEFLAAGT GRGVVYVAAG GNAGPASPPL FPAADPNVIA VTATDAEDRL FPAANRGSHL
CVAAPGVDIL VAAPNGGYGL LSGTSTAAAQ VSGVVALMLQ ARPDLKPAEV RAALTRSARD
LGPPGPDREF GAGFADAEGA MRSLTAPMAV QEPAGPVPAG DPPPLKWLFV GFGGSRCEEE
AATQEVELGA AIHRTLQQLE AVDLPLVLPA APGQAKACTN SGSILSEADG EALDHPHTAG
TGVGQPSVEG GGDAGLGRRG AATAANDSTE PPTEPDDLGS LVILQHARDH CCGVGVEVIR
LAQEVPRQLL RRGQRRRRGR DLRHGRRRLE APDRGPGGAR APLGQPASDL LDGAGKAAIP
QLSPELHGVL APRSLASVQV IEMRIKDAAP RPAPVMAWEA LGEGVLAHRG ASQAGLAIDE
PERLPGRTAA TDIVVELLPP TDRCSAGERL GGCTRIGSGA PGRRRWRRDR RQELQARVDT
GEPTLDHLAG VEQEMPPVSD LHGRRRAKRG AAGILGRTIA RHEPDLWLAA EPCGPGLGRA
VREQIERAMA FEVDDDRAVG PPFAQRPVID ADDTGSRRLR QRHSPDRPQQ SRGAGRHGQV
CEQPGSWLSA ECETGPGLRP GQPAGALGMA GE