Gene Mchl_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1934 
Symbol 
ID7116749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2000503 
End bp2003529 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content72% 
IMG OID643524698 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_002420725 
Protein GI218529909 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.857913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC TCGCCCTTCA GCGCGCGGAG GCCCCGGCTT CCGCAAACGC CGGCGACCAG 
CCGTTCCGCA CCGCCACCGG CGGCCTGATC GACCGGAACC GGCCGCGCGA CTTCACCTTC
GACGGACGGC GCCTCACCGG CTGCCACGGC GACACGCTGG CCTCGGCCCT GCTCGCCAAC
GGCGTCCGCC TCGTCGGCCG CTCGTTCAAG TATCACCGGC CGCGCGGCAT CCTCTCGGCC
GGCTCGGAGG AGCCGAACGC GCTGGTGGAG CTGCGCTCGG GCGCCCGGCG CGAGCCCAAC
ACCCGCGCCA CCATGGCCGA GCTCTACGAA GGGCTGGAGG CGACGAGCCA GAACCGCTGG
CCATCCCTCG CCGTGGATGC GCTCTCGGTC AACGCGCTGC TCTCACCGGT CTTCGCGGCG
GGCTTCTACT ACAAGACCTT CATGTGGCCG GCCGGCTTCT GGGAGAAGGT GTACGAGCCG
ATGATCCGGC GCGCGGCCGG GCTCGGTCGG GCCGCCGACG CGCCCGATCC CGACACCTAC
GACCACGCCC ACGCCCATTG CGACGTGCTC GTCATCGGCG GCGGCCCGGC CGGCCTGTCG
GCGGCGCTCG CAGCCGGCCG CTCCGGGGCG CGGGTCATCC TGGTCGACGA GGATTTCGCC
ACCGGCGGGC GCCTGCTCGC CGAGCGGCGC GAGATCGGCG GCGCGAGCGG GGTCGACTGG
GCCGCCCGCG CAGTGGCCGA GTTGGAGAGC CTGCCCGAGG TGCGCATCCT GTCGCGCACC
ACCCTGTTCG GCGTCTACGA TCACGGCGCC TACGGCGCGG TCGAGCGCGT TTCCGATCAT
CTCGCGGTTC CGGCCGCCCA CACCCCGCGC CAGCGGCTGT GGCGGATCGT GGCGCGGCGC
GCGGTACTCG CGGCCGGCGC GATCGAGCGC CCGCACGTCT TCGGCGGCAA CGATCGGCCC
GGCGTGATGC TGGCCGGCGC GGTGCGGACC TATCTCAACC GTTACGGCGT GATGCCGGGG
CGCCGCTTGG CCGTGTTCAC GTCGAGCGAC GACGGCTGGC GCACCGCCGC CGATATCCTC
GCCACGGGCG GCGGGCTCGC GGCGGTGATC GACACCCGCA CCACGGTCCC TCCGCATCTG
CGCCGGATGG CGGAAGCCGC CGGGGCGCGG GTGGTGGCCG GTGGGTATGT CGCCGGCACC
AAGGGGCATC TCGGCCTGAG CGCGATCCAG GTCGTGGACG GCGACAACAG CACCGAGACG
ATTGCCTGCG ACGGCCTCGC CATGGCCAAT GGCTGGAACC CGGTCGTCCA CCTCGATTCG
CACCTGTCGC GCCGGCCGGT CTGGGACGCG GCGATCCACG CCTTCGTGCC GGGCACCCTT
CCCTCCGGCA TGCAGGCGGC CGGCGCCGCC GCCGGACGCT TCACCCTGGC CGAATGCCTG
GAGACCGGCG CCCGGGCCGG CGCCGAGGCC GCGGCCGATT GCGGCTTCAC CGCCTCGGCC
GAGGCCGCTC CGCCGACCGA CCCGGAGAGC GTGGACCACA CCCCGCTCTG GCGCGTGCCA
AAGCCGAAGG GAAAAGCCTT CGTCGATTTC CAGAACGACG TGGCGGCCTC CGACGTCGAA
CTCGCCCACC GTGAGGGCTT TCGCGCGGTC GAACTGTTGA AGCGCTACAC CACGCTTGGC
ATGGCGACCG ACCAGGGCAA GACCTCGAAC CTCGCCGGCC TCTCGATCAT GGCCGAGCTG
ACCGGCAAGG ACATTCCGAG CGTCGGCACC ACGGTGTTCC GCCCGCCCTT CACCCCCGTC
GCCATCGGGG CGTTTGCCGG CCATCACCGC GGCAAGGAGT TCCGCGCCAC CCGCCATGTC
CCCTCCCATG CCTGGGCGGA GGAGAACGGC GCGGTCTTCG TGGAGACCGG CCTGTGGCTG
CGCCCGGCCT ATTTCCCGCG TGCCGGCGAG ACCGACTGGC TCGACACGGT GGTGCGGGAG
GTCGAGACCG TGCGCGCCCG CGTCGGGATC TGCGACGTCA CCACGCTGGG CAAGATCGAC
ATCCAGGGCC GCGACGCGCT GGCCTTCATC GAACGGGTCT GCGCCAACCC CTTCGCGACA
TTGCCCGTCG GCAAGGCGCG CTACGCCGTG CTGCTGCGCG AGGACGGCTT CATCTTGGAC
GACGGCACGA TCGCGCGGAT GGGCGAGACC CATTACGTCA TGACCGCCTC GACGGCGAAC
GCCGCGCGGG TGATGCAGCA CCTCGAATTT TGCCGGCAAT GGTTGTGGCC GGAACTCGAC
GTGCAGCTCG CCTCGGTCAG CGAGCAATGG GCGCAATACG CGGTCGCCGG CCCCCGCGCC
CGCGACACCC TGCGCCGCAT CGTCGATCCC GGTTTCGATC TCTCCAACGA GGCCTTCCCG
TTTCTCGCCT GCGCCGACGT CACCGTCGGC GGCATCCCGG CGCGGCTGTT CCGCATCTCG
TTCTCCGGCG AACTCGCCTA CGAGCTCGCT GTGCCAGCCG CCTACGGCGA CGCGGCGTGG
CGGGCGATCA TGCAGGCGGG GCTGCCCTAC GGCATCACCG CCTACGGCTC CGAGGCGCTC
TCGGTGATGC GCATCGAGAA GGGCCACGCG GCCGGGGCCG AGATCAACGG CCAGACAACC
GCCCGCGACC TCGGGCTCGG CGGGATGCTC GCCAAGAAGA AGGACTATAT CGGCCGCCTG
ATGAAGGAGC GCCCGGCGCT GGTCGATCCG GACCGGCCGA TCCTCGTGGG TTTTCGCCCC
GTCGATCCGA GCGCGCGACT CCGGGCGGGG GCGCATTTCC TGAGCCTCGA CGCCGCGCCG
AGCCTAGAGG CCGACGAGGG CGTGATGACC TCGGTAGCCT ACTCGCCGAG CCTGAAGAGC
TGGATCGGCA TCGGCCTGAT CCGGCGCGGA CCCGAGCGCC ACGGCGAGCG GGTGCGTGCC
TACGATCCGG TGCGCGGCGC CGAGATCGAG GTCGAGATCT GTTCGCCGGT CTTCGTGGAC
CCCAAGGAGG AGAAGCTGCG TGTCTGA
 
Protein sequence
MTTLALQRAE APASANAGDQ PFRTATGGLI DRNRPRDFTF DGRRLTGCHG DTLASALLAN 
GVRLVGRSFK YHRPRGILSA GSEEPNALVE LRSGARREPN TRATMAELYE GLEATSQNRW
PSLAVDALSV NALLSPVFAA GFYYKTFMWP AGFWEKVYEP MIRRAAGLGR AADAPDPDTY
DHAHAHCDVL VIGGGPAGLS AALAAGRSGA RVILVDEDFA TGGRLLAERR EIGGASGVDW
AARAVAELES LPEVRILSRT TLFGVYDHGA YGAVERVSDH LAVPAAHTPR QRLWRIVARR
AVLAAGAIER PHVFGGNDRP GVMLAGAVRT YLNRYGVMPG RRLAVFTSSD DGWRTAADIL
ATGGGLAAVI DTRTTVPPHL RRMAEAAGAR VVAGGYVAGT KGHLGLSAIQ VVDGDNSTET
IACDGLAMAN GWNPVVHLDS HLSRRPVWDA AIHAFVPGTL PSGMQAAGAA AGRFTLAECL
ETGARAGAEA AADCGFTASA EAAPPTDPES VDHTPLWRVP KPKGKAFVDF QNDVAASDVE
LAHREGFRAV ELLKRYTTLG MATDQGKTSN LAGLSIMAEL TGKDIPSVGT TVFRPPFTPV
AIGAFAGHHR GKEFRATRHV PSHAWAEENG AVFVETGLWL RPAYFPRAGE TDWLDTVVRE
VETVRARVGI CDVTTLGKID IQGRDALAFI ERVCANPFAT LPVGKARYAV LLREDGFILD
DGTIARMGET HYVMTASTAN AARVMQHLEF CRQWLWPELD VQLASVSEQW AQYAVAGPRA
RDTLRRIVDP GFDLSNEAFP FLACADVTVG GIPARLFRIS FSGELAYELA VPAAYGDAAW
RAIMQAGLPY GITAYGSEAL SVMRIEKGHA AGAEINGQTT ARDLGLGGML AKKKDYIGRL
MKERPALVDP DRPILVGFRP VDPSARLRAG AHFLSLDAAP SLEADEGVMT SVAYSPSLKS
WIGIGLIRRG PERHGERVRA YDPVRGAEIE VEICSPVFVD PKEEKLRV