Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_1052 |
Symbol | |
ID | 7118555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 1067543 |
End bp | 1070701 |
Gene Length | 3159 bp |
Protein Length | 1052 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643523845 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_002419887 |
Protein GI | 218529071 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.865489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAGTG CCTTCGTTCG GTCCATGCTC GGATGGTCCT GCCGCGTCCT GACTGCGGCA GTGCTCCTCT TCTCCGGCTG CCTCGTCGCC GAAGCGCAGA TGCGCGGCCC GGCCGGCTCC GGTACCGGAG CCCCCTACGG TGGCGGCGGT TCCGGAGGGG GCGGCTACGG TGGCGGCTAT CGCGGCGGCC CGATGATCGG CCTGCCGGGC ATGATCGGCA TCATTCCGCG GCTGATCCCA CCGACGCGAC AGCGGGTGGA AGAGGTCGAG GACGAGGAAC CGCCGCCGCG GCGGCCGGTC CGCCAACCCC AGTACGAGGA CGAGCCTCCC GCATACCGGC CAAGGCCACG TCCCCGCCCG CAGGCCCCCC CGCCCGTTCA TCGCGCGGAG CCGGCACCGA CACCACATCC GAAGCAGGCC GCACCCGTTC GCGAACCTCC GCCCAAGGTG GTGCAGAAGC CGGAGCGCCC GAAGCCCGTC GCGGCGCCGG TCGCCAAGCC GCAGCCGCCG AAGCCGGCAC TCGCTAAGGC ACCGCCTCCG CCGCGTCGTC AGGCGCCCGC TCCCGCGCTG GTCCCAGCCG CAAGCCAACC GGCCCCCGTG GACCCAGGCG AGGTTCCCGG CGAGGTCCTG TTCGTCCTCA AGGCGGAGGT GCCCGCCGAG AGCCTTCCGG AGATCCTGCG CCGCGAGCGT CTGGCGCTGA TCTCCGCGGA CACCTTCACG CTTGTGCCTG TGACCCTGCA CCGTACCCGT ATCCGCGACC GCCGGTCGGT CGCGGAGGTG GTCGCCGCCC TGTCGCGCGA CCCTCGCGTC GCCTCGGCAC AGGCGAACCA CGTCTACGCG CTCGTCGGGG AGGCCATGCC GACCCTCGCC GGTGCGCAGT ACGTCGTGGG CAAGCTCCGC CTGAAGGAGG CGCACGCCTC GGCCACGGGC AAGGACGTGA CCGTCGCCGT GATCGATTCC GACGTCGACC TGGGACATCC CTCGCTGCAG GGTGCCGTGG CGAACCGGCA CGACGCGCTG GACGGCGGCA AGCCGGCGGC GGCCCACCCG CACGGGACGG CCATCGCGGG CATCGTCGGC GCCCGCGCCC AACTGGCCAG CGCGGCACCG GAAGCCAGCC TCCTCGCGGT GCGCGCCTTC TCGGGCGAGA CGCGGGCCGG GGCGCAGGGC ACCACGCTGC ACGTCCTGCG AGCCCTGGAC TGGTCCGGGA AAATGGGCGC GAGAGTGGTC AACATGAGCT TCGCCGGTCC TTGGGATGCC GCGCTCTCCG AGTTCCTGGC CGCCGGAACC GGGCGCGGCG TCGTCTACGT GGCCGCGGGT GGCAATGCCG GCCCGGCCTC GCCGCCCCTG TTCCCGGCCG CCGACCCAAA CGTCATCGCC GTGACCGCGA CGGACGCCGA GGACAGGCTC TTCCCGGCGG CCAACCGCGG CTCACACCTC TGCGTTGCCG CGCCGGGCGT CGACATCCTC GTGGCCGCGC CGAACGGCGG ATACGGGCTT CTCTCGGGCA CCTCGACGGC GGCCGCGCAG GTCAGCGGCG TGGTCGCCCT GATGCTGCAG GCGAGGCCCG ACCTGAAGCC CGCCGAAGTG CGGGCCGCCC TGACGCGCAG CGCCCGCGAC CTCGGACCGC CGGGACCGGA CCGGGAGTTC GGCGCCGGCT TCGCCGACGC GGAGGGTGCT ATGCGGTCCC TGACGGCCCC CATGGCCGTG CAGGAGCCGG CCGGACCCGT ACCGGCGGGT GATCCGCCCC CACTGAAGTG GCTCTTCGTC GGGTTTGGTG GATCACGCTG CGAGGAGGAA GCGGCGACGC AGGAGGTCGA GCTTGGCGCG GCCATACATC GAACGCTTCA ACAGCTTGAG GCGGTTGACC TGCCCCTCGT CCTGCCCGCT GCTCCAGGGC AGGCGAAGGC CTGCACGAAC AGCGGCTCCA TCCTGAGCGA GGCTGACGGC GAAGCTCTCG ACCACCCGCA CACCGCAGGC ACGGGCGTCG GACAACCAAG CGTCGAAGGC GGTGGTGATG CCGGGCTTGG CCGGCGCGGC GCCGCAACGG CTGCGAACGA TTCGACAGAA CCGCCGACCG AGCCCGACGA CCTTGGCAGC CTCGTCATCC TGCAGCACGC GCGCGACCAC TGCTGCGGCG TCGGCGTCGA GGTCATCCGG CTCGCGCAGG AGGTGCCACG ACAACTGCTT CGGCGAGGGC AGCGGCGGCG ACGGGGGCGC GACCTGCGCC ATGGGCGACG GCGTCTTGAG GCGCCAGATC GTGGTCCTGG CGGGGCGCGT GCGCCGCTCG GACAGCCAGC GTCTGACCTG CTTGACGGTG CCGGGAAAGC CGCGATCCCG CAGCTCTCGC CAGAGCTGCA TGGCGTTCTC GCACCCCGCA GCCTGGCGAG CGTGCAGGTG ATCGAGATGA GGATCAAGGA TGCTGCGCCC CGGCCCGCGC CCGTCATGGC GTGGGAAGCC CTCGGCGAAG GCGTACTTGC GCACCGTGGC GCGAGCCAAG CCGGTCTCGC GATTGATGAG CCGGAGCGAC TGCCCGGCCG CACGGCGGCG ACGGACATCG TCGTAGAGCT CCTCCCACCG ACCGACCGCT GCAGCGCGGG CGAGCGTCTC GGCGGGTGCA CGCGGATAGG CTCTGGTGCG CCGGGTCGAA GGCGCTGGCG CCGTGATCGG CGGCAAGAGC TTCAGGCGCG GGTGGACACG GGCGAGCCAA CGCTCGATCA TCTGGCGGGT GTTGAGCAGG AGATGCCACC GGTCAGCGAC CTGCACGGCC GCCGGCGCGC CAAGCGTGGT GCCGCGGGCA TACTCGGTCG AACGATCGCG CGCCACGAGC CGGATCTGTG GCTGGCGGCG GAGCCATGCG GCCCAGGTCT CGGCCGAGCG GTCAGGGAGC AGATCGAGAG GGCGATGGCG TTCGAGGTCG ACGACGATCG TGCCGTAGGT CCGCCCTTTG CGCAGCGCCC AGTCATCGAC GCCGACGACA CAGGGTCGAG GCGCCTTCGG CAGCGGCACT CCCCGGATCG TCCGCAGCAG AGTCGTGGCG CTGGACGGCA TGGCCAGGTG TGCGAGCAAC CGGGCAGCTG GCTGTCCGCC GAGTGCGAGA CCGGTCCGGG CCTGCGCCCC GGCCAGCCGG CGGGTGCGTT GGGCATGGCG GGCGAGTAG
|
Protein sequence | MRSAFVRSML GWSCRVLTAA VLLFSGCLVA EAQMRGPAGS GTGAPYGGGG SGGGGYGGGY RGGPMIGLPG MIGIIPRLIP PTRQRVEEVE DEEPPPRRPV RQPQYEDEPP AYRPRPRPRP QAPPPVHRAE PAPTPHPKQA APVREPPPKV VQKPERPKPV AAPVAKPQPP KPALAKAPPP PRRQAPAPAL VPAASQPAPV DPGEVPGEVL FVLKAEVPAE SLPEILRRER LALISADTFT LVPVTLHRTR IRDRRSVAEV VAALSRDPRV ASAQANHVYA LVGEAMPTLA GAQYVVGKLR LKEAHASATG KDVTVAVIDS DVDLGHPSLQ GAVANRHDAL DGGKPAAAHP HGTAIAGIVG ARAQLASAAP EASLLAVRAF SGETRAGAQG TTLHVLRALD WSGKMGARVV NMSFAGPWDA ALSEFLAAGT GRGVVYVAAG GNAGPASPPL FPAADPNVIA VTATDAEDRL FPAANRGSHL CVAAPGVDIL VAAPNGGYGL LSGTSTAAAQ VSGVVALMLQ ARPDLKPAEV RAALTRSARD LGPPGPDREF GAGFADAEGA MRSLTAPMAV QEPAGPVPAG DPPPLKWLFV GFGGSRCEEE AATQEVELGA AIHRTLQQLE AVDLPLVLPA APGQAKACTN SGSILSEADG EALDHPHTAG TGVGQPSVEG GGDAGLGRRG AATAANDSTE PPTEPDDLGS LVILQHARDH CCGVGVEVIR LAQEVPRQLL RRGQRRRRGR DLRHGRRRLE APDRGPGGAR APLGQPASDL LDGAGKAAIP QLSPELHGVL APRSLASVQV IEMRIKDAAP RPAPVMAWEA LGEGVLAHRG ASQAGLAIDE PERLPGRTAA TDIVVELLPP TDRCSAGERL GGCTRIGSGA PGRRRWRRDR RQELQARVDT GEPTLDHLAG VEQEMPPVSD LHGRRRAKRG AAGILGRTIA RHEPDLWLAA EPCGPGLGRA VREQIERAMA FEVDDDRAVG PPFAQRPVID ADDTGSRRLR QRHSPDRPQQ SRGAGRHGQV CEQPGSWLSA ECETGPGLRP GQPAGALGMA GE
|
| |