Gene Mchl_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_4033 
Symbol 
ID7118038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp4244507 
End bp4247494 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content70% 
IMG OID643526752 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_002422761 
Protein GI218531945 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.319981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.145852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGC CGTTCAGATT GTCCCGCGGC GGACGCGTCG ACCGCACCCG CCCCATCGTC 
TTCGAATTCA ACGGAAAGCC GGTCCACGGA TTTGCTGGCG ACACCGTCGC CTCGGCGCTG
CTGGCCAACG GCATCCACCT CGTCGGACGC TCGTTCAAGT ACCACCGCCC CCGCGGCATC
CTGAGCCACG GCCCCGACGA GCCGAGCGCG CTGCTCTCGG TCGATCGCGG GCCCGGCCGG
ATCGACCCGA ACAACCGCGC CTCCGTGGTC GAGGCGCGCT CGGGCCTGCG CACGACCTCG
CAGAACCATT GGCCGTCGCT CGAATTCGAC GTCGGCGCCG TCAACGACCT CCTCTCGCCG
GTCTTCGTGG CGGGCTTCTA CTACAAGACC TTCATGTGGC CCCGGAAGTT CTGGGACCGG
GTCTATGAGC CGTTCATCCG CGCTGCCGCC GGTCTCGGAA AGGCGCCGAC GGTGGCCGAT
CCCGACCGCT ACGCCAACCG CCACGCCCAT TGCGACGTGC TGATCGTCGG CGCTGGCCCG
GCGGGGCTTG CTGCGGCGCT CGCCGCGGCA CGCACCGGCA AGCGGGTGAT CCTGGCCGAC
GAGGGCGCGG AGCCCGGCGG CACGCTCCTG CACGACACGA CCTCGCAGAT CAACGGACGC
CCGGCGGCGG ACTGGCTCGC CGAGACGCTG GCCGAGCTCG ATGCCCGCGA GAACGTCATC
CTGCTGCCCC GCACCACCGC CTTCGGCTAT TACAACCACA ACCACGTGGC GATGACCGAG
CGTGTCACCG ACCACCTGTC GTCCGCCGCG GGCCAAGCGC CCCGCGAGCG CCTATGGCAG
GTGCGGGCGG AACAGGTCGT GCTCGCTGGT GGCTCCCACG AGCGCCCCCT CGTCTTCGCC
GACAACGACC GGCCGGGCAT CCTGCTCGCC GAGAGCGTGA GGGTCTTCCT CAACCGTTAC
GGCGTGGCGC CGGGCCGCAA GCTCGTCTTC GCTACGAGCG GCGCCTCCGC CTACCAGGCC
GCGCTCGATG CGCGTGCGGC GGGCCTCGAC GTCACCCTCG TCGACCTGCG CCTGGAAGCG
GATTGCGGAC CGGAGTTGGC ACGCCTGCGC AGCGCCGGAG TCGACGTGTT GACCGGCCAC
ACCGTGGTCG GATCGAAGGG CCGGAAGCGC GTCACGGGTC TCATCGTGGC GCCTGTCGGG
AGCGACGGCC GGTGCGGCGG CCGTCGCATT CTCCCTTGCG ACTGCGTCGG CATGTCCGGC
GGCTGGACGC CCGCCGTCCA CCTGTTCTCG CAGTCCCGCG GCAAGCTCGC CTACGATGAG
GGCATCGATG CCTTCGTGCC GAGCCGCTCG GCGCAAGACG AGCGCTCGGC GGGTGCGGCC
CGCGGCACCT ACGACCTCGC CGCCTGCCTC GCGGAGGGCT TTGCCGCCGG TGCCGCCGCG
GCGGGTTCCG ACGCACGGCA GGACTTCAGG GCGACGGAGA CGCTGACCGG TTTCCAGCCG
GTGCGGATCA TGCCCACCGA CGCGAACCCG ACCAAGGTCC GCGCCTTCGT CGATTACCAG
AACGACGTCA CCGCCAAGGA CATCAAGCTC GCGGTGCGCG AGGGCTTCCA GTCGATCGAG
CACGTCAAGC GCTACACCAC GACCGGCATG GCGACCGATC AGGGCAAGAC CTCGAACATG
AACGCGCTCG GCATCGTCGC CGGGCAGCTC GACAAGGCGC TGCCCGCCGT CGGCACCACG
ACCTTCCGGC CGCCCTATAC CCCGGTGACC TTCGGTGCGC TGGTGGGCCC CGCCCGCCAC
GCCCTGTTCG ATCCGATCCG CACCACGCCG ATCCACGAAT GGGCGGAGGC CCACGGCGCC
CTGTTCGAGA ACGTCGCCCT GTGGCGGCGC GCCTGGTACT TCCCGAAGGC CGGCGAGGAC
CTGCACGCGG CGGTCGCCCG CGAGTGCAAG GCGGTGCGCG AGGGCGTCGG CATCTTCGAC
GCCTCGACGC TCGGCAAGAT CGAGATCGTC GGCCGGGATG CGGCCGAGTT CATGAACCGC
CTCTACATCA ACCCCTGGAC CAAGCTCGCG CCGGGCCGCT GCCGCTACGG GCTGATGCTC
AAGGAGGACG GCTACATCCT CGACGACGGC GTCGTCGCCC GCGTCTCGGA TACCTGCTTC
CACGTCACCA CCACGACCGG CGGCGCGGCG CGGGTGCTCG GCCATATGGA GGACTACCTC
CAAACCGAGT GGCCGGAGCT TGAAGTGTTC CTGACCTCGA CCACCGAGCA ATGGGCGGTG
ATCGCGCTCC AGGGCCCGAA GGCCCGCGCC GTGATCGCGC CGCTGGTCGA CGGCATCGAC
CTCGCGCCGG ACGCCTTCCC GCATATGGCG ATGCGCTCAG GCACGATCTG CGGCGTGCCG
ACCCGGCTGT TCCGGGTGTC GTTCACCGGT GAACTCGGCT TCGAGATCAA CGTGCCCGCC
GACCACGCCC GCGCGGTCTG GGAGGCGGTG TTCGAGGCGG GCCGGGCCCA CGGCATCACG
CCCTACGGCA CCGAGACGAT GCACGTGCTG CGCGCCGAGA AGGGCTACAT CATCGTCGGC
CAGGAGACCG ACGGCACGGT GACCCCGGAC GATGTCGGCA TGGCCGGGAT GATCCCGAAG
GCCAAGGGGG ACTTCGTCGG CAAGCGCTCG CTGGCGCGCC CCGACGTGGT CGCCACCGGC
CGCAAGCAGC TCGTCGGCCT CATGACCGAC GATCTCAAGC TCGTCCTCGA CGAGGGCGCG
CAGATCGTCA CCGATACCCA TCAGCCGATC CCGATGCGCA TGCTCGGCCA CGTCACGTCG
AGCTACTGGA GCGCCAATTG CGGCCGCTCC ATCGCGCTGG CCCTGGTCGA GGGCGGACGC
GAGCGGATGA ACGGCCATCT CTTCGTCACC ACGCCGGACG GGTTCACCCG CGTCACCGTC
TGCGAACCGG TCTTCTTCGA CGTCCAGGGG GAGCGCATCA ATGCTTGA
 
Protein sequence
MAQPFRLSRG GRVDRTRPIV FEFNGKPVHG FAGDTVASAL LANGIHLVGR SFKYHRPRGI 
LSHGPDEPSA LLSVDRGPGR IDPNNRASVV EARSGLRTTS QNHWPSLEFD VGAVNDLLSP
VFVAGFYYKT FMWPRKFWDR VYEPFIRAAA GLGKAPTVAD PDRYANRHAH CDVLIVGAGP
AGLAAALAAA RTGKRVILAD EGAEPGGTLL HDTTSQINGR PAADWLAETL AELDARENVI
LLPRTTAFGY YNHNHVAMTE RVTDHLSSAA GQAPRERLWQ VRAEQVVLAG GSHERPLVFA
DNDRPGILLA ESVRVFLNRY GVAPGRKLVF ATSGASAYQA ALDARAAGLD VTLVDLRLEA
DCGPELARLR SAGVDVLTGH TVVGSKGRKR VTGLIVAPVG SDGRCGGRRI LPCDCVGMSG
GWTPAVHLFS QSRGKLAYDE GIDAFVPSRS AQDERSAGAA RGTYDLAACL AEGFAAGAAA
AGSDARQDFR ATETLTGFQP VRIMPTDANP TKVRAFVDYQ NDVTAKDIKL AVREGFQSIE
HVKRYTTTGM ATDQGKTSNM NALGIVAGQL DKALPAVGTT TFRPPYTPVT FGALVGPARH
ALFDPIRTTP IHEWAEAHGA LFENVALWRR AWYFPKAGED LHAAVARECK AVREGVGIFD
ASTLGKIEIV GRDAAEFMNR LYINPWTKLA PGRCRYGLML KEDGYILDDG VVARVSDTCF
HVTTTTGGAA RVLGHMEDYL QTEWPELEVF LTSTTEQWAV IALQGPKARA VIAPLVDGID
LAPDAFPHMA MRSGTICGVP TRLFRVSFTG ELGFEINVPA DHARAVWEAV FEAGRAHGIT
PYGTETMHVL RAEKGYIIVG QETDGTVTPD DVGMAGMIPK AKGDFVGKRS LARPDVVATG
RKQLVGLMTD DLKLVLDEGA QIVTDTHQPI PMRMLGHVTS SYWSANCGRS IALALVEGGR
ERMNGHLFVT TPDGFTRVTV CEPVFFDVQG ERINA