Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_1934 |
Symbol | |
ID | 7116749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 2000503 |
End bp | 2003529 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643524698 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002420725 |
Protein GI | 218529909 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.857913 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGC TCGCCCTTCA GCGCGCGGAG GCCCCGGCTT CCGCAAACGC CGGCGACCAG CCGTTCCGCA CCGCCACCGG CGGCCTGATC GACCGGAACC GGCCGCGCGA CTTCACCTTC GACGGACGGC GCCTCACCGG CTGCCACGGC GACACGCTGG CCTCGGCCCT GCTCGCCAAC GGCGTCCGCC TCGTCGGCCG CTCGTTCAAG TATCACCGGC CGCGCGGCAT CCTCTCGGCC GGCTCGGAGG AGCCGAACGC GCTGGTGGAG CTGCGCTCGG GCGCCCGGCG CGAGCCCAAC ACCCGCGCCA CCATGGCCGA GCTCTACGAA GGGCTGGAGG CGACGAGCCA GAACCGCTGG CCATCCCTCG CCGTGGATGC GCTCTCGGTC AACGCGCTGC TCTCACCGGT CTTCGCGGCG GGCTTCTACT ACAAGACCTT CATGTGGCCG GCCGGCTTCT GGGAGAAGGT GTACGAGCCG ATGATCCGGC GCGCGGCCGG GCTCGGTCGG GCCGCCGACG CGCCCGATCC CGACACCTAC GACCACGCCC ACGCCCATTG CGACGTGCTC GTCATCGGCG GCGGCCCGGC CGGCCTGTCG GCGGCGCTCG CAGCCGGCCG CTCCGGGGCG CGGGTCATCC TGGTCGACGA GGATTTCGCC ACCGGCGGGC GCCTGCTCGC CGAGCGGCGC GAGATCGGCG GCGCGAGCGG GGTCGACTGG GCCGCCCGCG CAGTGGCCGA GTTGGAGAGC CTGCCCGAGG TGCGCATCCT GTCGCGCACC ACCCTGTTCG GCGTCTACGA TCACGGCGCC TACGGCGCGG TCGAGCGCGT TTCCGATCAT CTCGCGGTTC CGGCCGCCCA CACCCCGCGC CAGCGGCTGT GGCGGATCGT GGCGCGGCGC GCGGTACTCG CGGCCGGCGC GATCGAGCGC CCGCACGTCT TCGGCGGCAA CGATCGGCCC GGCGTGATGC TGGCCGGCGC GGTGCGGACC TATCTCAACC GTTACGGCGT GATGCCGGGG CGCCGCTTGG CCGTGTTCAC GTCGAGCGAC GACGGCTGGC GCACCGCCGC CGATATCCTC GCCACGGGCG GCGGGCTCGC GGCGGTGATC GACACCCGCA CCACGGTCCC TCCGCATCTG CGCCGGATGG CGGAAGCCGC CGGGGCGCGG GTGGTGGCCG GTGGGTATGT CGCCGGCACC AAGGGGCATC TCGGCCTGAG CGCGATCCAG GTCGTGGACG GCGACAACAG CACCGAGACG ATTGCCTGCG ACGGCCTCGC CATGGCCAAT GGCTGGAACC CGGTCGTCCA CCTCGATTCG CACCTGTCGC GCCGGCCGGT CTGGGACGCG GCGATCCACG CCTTCGTGCC GGGCACCCTT CCCTCCGGCA TGCAGGCGGC CGGCGCCGCC GCCGGACGCT TCACCCTGGC CGAATGCCTG GAGACCGGCG CCCGGGCCGG CGCCGAGGCC GCGGCCGATT GCGGCTTCAC CGCCTCGGCC GAGGCCGCTC CGCCGACCGA CCCGGAGAGC GTGGACCACA CCCCGCTCTG GCGCGTGCCA AAGCCGAAGG GAAAAGCCTT CGTCGATTTC CAGAACGACG TGGCGGCCTC CGACGTCGAA CTCGCCCACC GTGAGGGCTT TCGCGCGGTC GAACTGTTGA AGCGCTACAC CACGCTTGGC ATGGCGACCG ACCAGGGCAA GACCTCGAAC CTCGCCGGCC TCTCGATCAT GGCCGAGCTG ACCGGCAAGG ACATTCCGAG CGTCGGCACC ACGGTGTTCC GCCCGCCCTT CACCCCCGTC GCCATCGGGG CGTTTGCCGG CCATCACCGC GGCAAGGAGT TCCGCGCCAC CCGCCATGTC CCCTCCCATG CCTGGGCGGA GGAGAACGGC GCGGTCTTCG TGGAGACCGG CCTGTGGCTG CGCCCGGCCT ATTTCCCGCG TGCCGGCGAG ACCGACTGGC TCGACACGGT GGTGCGGGAG GTCGAGACCG TGCGCGCCCG CGTCGGGATC TGCGACGTCA CCACGCTGGG CAAGATCGAC ATCCAGGGCC GCGACGCGCT GGCCTTCATC GAACGGGTCT GCGCCAACCC CTTCGCGACA TTGCCCGTCG GCAAGGCGCG CTACGCCGTG CTGCTGCGCG AGGACGGCTT CATCTTGGAC GACGGCACGA TCGCGCGGAT GGGCGAGACC CATTACGTCA TGACCGCCTC GACGGCGAAC GCCGCGCGGG TGATGCAGCA CCTCGAATTT TGCCGGCAAT GGTTGTGGCC GGAACTCGAC GTGCAGCTCG CCTCGGTCAG CGAGCAATGG GCGCAATACG CGGTCGCCGG CCCCCGCGCC CGCGACACCC TGCGCCGCAT CGTCGATCCC GGTTTCGATC TCTCCAACGA GGCCTTCCCG TTTCTCGCCT GCGCCGACGT CACCGTCGGC GGCATCCCGG CGCGGCTGTT CCGCATCTCG TTCTCCGGCG AACTCGCCTA CGAGCTCGCT GTGCCAGCCG CCTACGGCGA CGCGGCGTGG CGGGCGATCA TGCAGGCGGG GCTGCCCTAC GGCATCACCG CCTACGGCTC CGAGGCGCTC TCGGTGATGC GCATCGAGAA GGGCCACGCG GCCGGGGCCG AGATCAACGG CCAGACAACC GCCCGCGACC TCGGGCTCGG CGGGATGCTC GCCAAGAAGA AGGACTATAT CGGCCGCCTG ATGAAGGAGC GCCCGGCGCT GGTCGATCCG GACCGGCCGA TCCTCGTGGG TTTTCGCCCC GTCGATCCGA GCGCGCGACT CCGGGCGGGG GCGCATTTCC TGAGCCTCGA CGCCGCGCCG AGCCTAGAGG CCGACGAGGG CGTGATGACC TCGGTAGCCT ACTCGCCGAG CCTGAAGAGC TGGATCGGCA TCGGCCTGAT CCGGCGCGGA CCCGAGCGCC ACGGCGAGCG GGTGCGTGCC TACGATCCGG TGCGCGGCGC CGAGATCGAG GTCGAGATCT GTTCGCCGGT CTTCGTGGAC CCCAAGGAGG AGAAGCTGCG TGTCTGA
|
Protein sequence | MTTLALQRAE APASANAGDQ PFRTATGGLI DRNRPRDFTF DGRRLTGCHG DTLASALLAN GVRLVGRSFK YHRPRGILSA GSEEPNALVE LRSGARREPN TRATMAELYE GLEATSQNRW PSLAVDALSV NALLSPVFAA GFYYKTFMWP AGFWEKVYEP MIRRAAGLGR AADAPDPDTY DHAHAHCDVL VIGGGPAGLS AALAAGRSGA RVILVDEDFA TGGRLLAERR EIGGASGVDW AARAVAELES LPEVRILSRT TLFGVYDHGA YGAVERVSDH LAVPAAHTPR QRLWRIVARR AVLAAGAIER PHVFGGNDRP GVMLAGAVRT YLNRYGVMPG RRLAVFTSSD DGWRTAADIL ATGGGLAAVI DTRTTVPPHL RRMAEAAGAR VVAGGYVAGT KGHLGLSAIQ VVDGDNSTET IACDGLAMAN GWNPVVHLDS HLSRRPVWDA AIHAFVPGTL PSGMQAAGAA AGRFTLAECL ETGARAGAEA AADCGFTASA EAAPPTDPES VDHTPLWRVP KPKGKAFVDF QNDVAASDVE LAHREGFRAV ELLKRYTTLG MATDQGKTSN LAGLSIMAEL TGKDIPSVGT TVFRPPFTPV AIGAFAGHHR GKEFRATRHV PSHAWAEENG AVFVETGLWL RPAYFPRAGE TDWLDTVVRE VETVRARVGI CDVTTLGKID IQGRDALAFI ERVCANPFAT LPVGKARYAV LLREDGFILD DGTIARMGET HYVMTASTAN AARVMQHLEF CRQWLWPELD VQLASVSEQW AQYAVAGPRA RDTLRRIVDP GFDLSNEAFP FLACADVTVG GIPARLFRIS FSGELAYELA VPAAYGDAAW RAIMQAGLPY GITAYGSEAL SVMRIEKGHA AGAEINGQTT ARDLGLGGML AKKKDYIGRL MKERPALVDP DRPILVGFRP VDPSARLRAG AHFLSLDAAP SLEADEGVMT SVAYSPSLKS WIGIGLIRRG PERHGERVRA YDPVRGAEIE VEICSPVFVD PKEEKLRV
|
| |