Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4033 |
Symbol | |
ID | 7118038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 4244507 |
End bp | 4247494 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643526752 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002422761 |
Protein GI | 218531945 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.319981 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.145852 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGC CGTTCAGATT GTCCCGCGGC GGACGCGTCG ACCGCACCCG CCCCATCGTC TTCGAATTCA ACGGAAAGCC GGTCCACGGA TTTGCTGGCG ACACCGTCGC CTCGGCGCTG CTGGCCAACG GCATCCACCT CGTCGGACGC TCGTTCAAGT ACCACCGCCC CCGCGGCATC CTGAGCCACG GCCCCGACGA GCCGAGCGCG CTGCTCTCGG TCGATCGCGG GCCCGGCCGG ATCGACCCGA ACAACCGCGC CTCCGTGGTC GAGGCGCGCT CGGGCCTGCG CACGACCTCG CAGAACCATT GGCCGTCGCT CGAATTCGAC GTCGGCGCCG TCAACGACCT CCTCTCGCCG GTCTTCGTGG CGGGCTTCTA CTACAAGACC TTCATGTGGC CCCGGAAGTT CTGGGACCGG GTCTATGAGC CGTTCATCCG CGCTGCCGCC GGTCTCGGAA AGGCGCCGAC GGTGGCCGAT CCCGACCGCT ACGCCAACCG CCACGCCCAT TGCGACGTGC TGATCGTCGG CGCTGGCCCG GCGGGGCTTG CTGCGGCGCT CGCCGCGGCA CGCACCGGCA AGCGGGTGAT CCTGGCCGAC GAGGGCGCGG AGCCCGGCGG CACGCTCCTG CACGACACGA CCTCGCAGAT CAACGGACGC CCGGCGGCGG ACTGGCTCGC CGAGACGCTG GCCGAGCTCG ATGCCCGCGA GAACGTCATC CTGCTGCCCC GCACCACCGC CTTCGGCTAT TACAACCACA ACCACGTGGC GATGACCGAG CGTGTCACCG ACCACCTGTC GTCCGCCGCG GGCCAAGCGC CCCGCGAGCG CCTATGGCAG GTGCGGGCGG AACAGGTCGT GCTCGCTGGT GGCTCCCACG AGCGCCCCCT CGTCTTCGCC GACAACGACC GGCCGGGCAT CCTGCTCGCC GAGAGCGTGA GGGTCTTCCT CAACCGTTAC GGCGTGGCGC CGGGCCGCAA GCTCGTCTTC GCTACGAGCG GCGCCTCCGC CTACCAGGCC GCGCTCGATG CGCGTGCGGC GGGCCTCGAC GTCACCCTCG TCGACCTGCG CCTGGAAGCG GATTGCGGAC CGGAGTTGGC ACGCCTGCGC AGCGCCGGAG TCGACGTGTT GACCGGCCAC ACCGTGGTCG GATCGAAGGG CCGGAAGCGC GTCACGGGTC TCATCGTGGC GCCTGTCGGG AGCGACGGCC GGTGCGGCGG CCGTCGCATT CTCCCTTGCG ACTGCGTCGG CATGTCCGGC GGCTGGACGC CCGCCGTCCA CCTGTTCTCG CAGTCCCGCG GCAAGCTCGC CTACGATGAG GGCATCGATG CCTTCGTGCC GAGCCGCTCG GCGCAAGACG AGCGCTCGGC GGGTGCGGCC CGCGGCACCT ACGACCTCGC CGCCTGCCTC GCGGAGGGCT TTGCCGCCGG TGCCGCCGCG GCGGGTTCCG ACGCACGGCA GGACTTCAGG GCGACGGAGA CGCTGACCGG TTTCCAGCCG GTGCGGATCA TGCCCACCGA CGCGAACCCG ACCAAGGTCC GCGCCTTCGT CGATTACCAG AACGACGTCA CCGCCAAGGA CATCAAGCTC GCGGTGCGCG AGGGCTTCCA GTCGATCGAG CACGTCAAGC GCTACACCAC GACCGGCATG GCGACCGATC AGGGCAAGAC CTCGAACATG AACGCGCTCG GCATCGTCGC CGGGCAGCTC GACAAGGCGC TGCCCGCCGT CGGCACCACG ACCTTCCGGC CGCCCTATAC CCCGGTGACC TTCGGTGCGC TGGTGGGCCC CGCCCGCCAC GCCCTGTTCG ATCCGATCCG CACCACGCCG ATCCACGAAT GGGCGGAGGC CCACGGCGCC CTGTTCGAGA ACGTCGCCCT GTGGCGGCGC GCCTGGTACT TCCCGAAGGC CGGCGAGGAC CTGCACGCGG CGGTCGCCCG CGAGTGCAAG GCGGTGCGCG AGGGCGTCGG CATCTTCGAC GCCTCGACGC TCGGCAAGAT CGAGATCGTC GGCCGGGATG CGGCCGAGTT CATGAACCGC CTCTACATCA ACCCCTGGAC CAAGCTCGCG CCGGGCCGCT GCCGCTACGG GCTGATGCTC AAGGAGGACG GCTACATCCT CGACGACGGC GTCGTCGCCC GCGTCTCGGA TACCTGCTTC CACGTCACCA CCACGACCGG CGGCGCGGCG CGGGTGCTCG GCCATATGGA GGACTACCTC CAAACCGAGT GGCCGGAGCT TGAAGTGTTC CTGACCTCGA CCACCGAGCA ATGGGCGGTG ATCGCGCTCC AGGGCCCGAA GGCCCGCGCC GTGATCGCGC CGCTGGTCGA CGGCATCGAC CTCGCGCCGG ACGCCTTCCC GCATATGGCG ATGCGCTCAG GCACGATCTG CGGCGTGCCG ACCCGGCTGT TCCGGGTGTC GTTCACCGGT GAACTCGGCT TCGAGATCAA CGTGCCCGCC GACCACGCCC GCGCGGTCTG GGAGGCGGTG TTCGAGGCGG GCCGGGCCCA CGGCATCACG CCCTACGGCA CCGAGACGAT GCACGTGCTG CGCGCCGAGA AGGGCTACAT CATCGTCGGC CAGGAGACCG ACGGCACGGT GACCCCGGAC GATGTCGGCA TGGCCGGGAT GATCCCGAAG GCCAAGGGGG ACTTCGTCGG CAAGCGCTCG CTGGCGCGCC CCGACGTGGT CGCCACCGGC CGCAAGCAGC TCGTCGGCCT CATGACCGAC GATCTCAAGC TCGTCCTCGA CGAGGGCGCG CAGATCGTCA CCGATACCCA TCAGCCGATC CCGATGCGCA TGCTCGGCCA CGTCACGTCG AGCTACTGGA GCGCCAATTG CGGCCGCTCC ATCGCGCTGG CCCTGGTCGA GGGCGGACGC GAGCGGATGA ACGGCCATCT CTTCGTCACC ACGCCGGACG GGTTCACCCG CGTCACCGTC TGCGAACCGG TCTTCTTCGA CGTCCAGGGG GAGCGCATCA ATGCTTGA
|
Protein sequence | MAQPFRLSRG GRVDRTRPIV FEFNGKPVHG FAGDTVASAL LANGIHLVGR SFKYHRPRGI LSHGPDEPSA LLSVDRGPGR IDPNNRASVV EARSGLRTTS QNHWPSLEFD VGAVNDLLSP VFVAGFYYKT FMWPRKFWDR VYEPFIRAAA GLGKAPTVAD PDRYANRHAH CDVLIVGAGP AGLAAALAAA RTGKRVILAD EGAEPGGTLL HDTTSQINGR PAADWLAETL AELDARENVI LLPRTTAFGY YNHNHVAMTE RVTDHLSSAA GQAPRERLWQ VRAEQVVLAG GSHERPLVFA DNDRPGILLA ESVRVFLNRY GVAPGRKLVF ATSGASAYQA ALDARAAGLD VTLVDLRLEA DCGPELARLR SAGVDVLTGH TVVGSKGRKR VTGLIVAPVG SDGRCGGRRI LPCDCVGMSG GWTPAVHLFS QSRGKLAYDE GIDAFVPSRS AQDERSAGAA RGTYDLAACL AEGFAAGAAA AGSDARQDFR ATETLTGFQP VRIMPTDANP TKVRAFVDYQ NDVTAKDIKL AVREGFQSIE HVKRYTTTGM ATDQGKTSNM NALGIVAGQL DKALPAVGTT TFRPPYTPVT FGALVGPARH ALFDPIRTTP IHEWAEAHGA LFENVALWRR AWYFPKAGED LHAAVARECK AVREGVGIFD ASTLGKIEIV GRDAAEFMNR LYINPWTKLA PGRCRYGLML KEDGYILDDG VVARVSDTCF HVTTTTGGAA RVLGHMEDYL QTEWPELEVF LTSTTEQWAV IALQGPKARA VIAPLVDGID LAPDAFPHMA MRSGTICGVP TRLFRVSFTG ELGFEINVPA DHARAVWEAV FEAGRAHGIT PYGTETMHVL RAEKGYIIVG QETDGTVTPD DVGMAGMIPK AKGDFVGKRS LARPDVVATG RKQLVGLMTD DLKLVLDEGA QIVTDTHQPI PMRMLGHVTS SYWSANCGRS IALALVEGGR ERMNGHLFVT TPDGFTRVTV CEPVFFDVQG ERINA
|
| |