Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1652 |
Symbol | |
ID | 5833890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 1845761 |
End bp | 1848790 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367450 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001639122 |
Protein GI | 163851079 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.90362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGC TTGCCCTTCA GCGCGCCGAG GCCTCGGCTT CCGCGACGTC AGCCGACCGG CCGTTCCGCA CCGCCACCGG CGGCCTGATC GACCGGAACC GGCCGCGCGA CTTCACCTTC GACGGGCGGC GCCTCACCGG CTGCCACGGC GACACGCTGG CCTCGGCCCT GCTTGCCAAC GGCGTGCGCC TCGTCGGCCG CTCGTTCAAG TATCACCGGC CGCGCGGCAT CCTCTCGGCC GGCTCGGAGG AGCCGAACGC GCTGGTGGAG CTGCGCTCGG GCGCCCGGCG CGAGCCCAAC ACCCGCGCCA CCATGGCCGA GCTCTACGAG GGGTTGGAGG CGACGAGCCA GAACCGCTGG CCCACGCTCG CCGTGGATGC GCTCTCGGTC AACGCGCTGC TCTCACCGGT CTTCGCGGCG GGCTTCTACT ACAAGACCTT CATGTGGCCG GCCGGCTTCT GGGAGAAGCT GTACGAGCCG ATGATCCGGC GCGCGGCTGG GCTCGGTCGG GCCGCCGACG CGCCCGATCC CGACACCTAC GACCACGCCC ACGCCCATTG CGACGTGCTC GTCATCGGCG GCGGCCCGGC CGGCCTGTCG GCGGCGCTCG CCGCCGGCCG CTCCGGGGCG CGGGTCATCC TGGTCGACGA GGATTTCGCC ACCGGCGGTC GGCTGCTCGC CGAGCGGCGC GAAATCGGCG GCGCGAGCGG GTCCGAGTGG GCTGCGCGCG CGGTGGCCGA GTTGGAGAGC CTGCCCGAGG TGCGCATCCT GTCGCGCACC ACCCTGTTCG GCGTCTACGA TCACGGCGCC TACGGAGCGG TCGAGCGCGT CTCCGACCAT CTCGCGGTGC CGCCCCCCCA CACCCCGCGC CAGCGGCTGT GGCGGATCGT AGCGCGGCGC GCGGTGCTCG CGGCCGGCGC GATCGAGCGC CCGCACGTCT TCGGCGGCAA CGACCGCCCC GGCGTGATGC TGGCCGGCGC GGTGCGGACC TACCTCAACC GTTACGGCGT GCTGCCGGGG CGCCGCTTGG CCGTGTTCAC GTCGAGCGAC GACGGCTGGC GCACCGCCAC CGATATCCTG GCCGCAGGCG GCGGGCTTGC GGCGGTGATC GACACCCGCG CTGCCGTTCC CCCCGCCCTG CGCCGGATGG CGGAGGCCGC CGGGGCGCGG GTGGTGGCCG GTGGATACGT CGCCGGCACC AAGGGGCATC TGGGCTTGAG CGCGATCCAG GTCGTGGACG GCGACAACAG CATCGAGACG ATTCCCTGCG ACGGCCTCGC CATGGCCAAT GGCTGGAACC CGGTCGTCCA CCTCGATTCC CACCTGTCGC GTCGGCCGGT CTGGAACGCG GCGATCCACG CCTTCGTGCC GGGCACCCTC CCCTCCGGCA TGCAGGCGGC CGGCGCCGCC GCCGGGCGCT TCACCCTGGC CGAATGCCTG GAGACCGGCG CCCAGGCCGG CGCCGAGGCC GCGAGCGATT GCGGCTTCAC CGCCTCGGCC GAGGCCACTC CGCCGACCGA TCCGGAAAGC GTGGACCACA CCCCGCTCTG GCGCGCTCCG AAGCCCCGCG GAAAAGCCTT CGTCGATTTT CAGAACGACG TGGCGGCCTC CGACATCGAA CTCGCCCACC GCGAGGGGTT TCGCGCGGTC GAGCTGTTGA AGCGCTACAC CACCCTCGGC ATGGCGACCG ACCAGGGCAA GACCTCGAAC CTCGCCGGCC TCTCGATCAT GGCCGAGCTG ACCGGCAAGG ACATCCCGAG CGTCGCCACC ACGGTGTTCC GCCCGCCCTT CACCCCCGTC GCCATCGGCG CGTTTGCCGG CCATCACCGC GGCAAGGAGT TCCGCGCCAC CCGCCACGTC CCCTCCCATG CCTGGGCGGA GGAGAACGGC GCGGTCTTCG TGGAGACCGG CCTGTGGCTG CGCCCGGCCT ATTTCCCGCG GGCAAGCGAG ATCGACTGGC TCGACACGGT GGTACGGGAG GTCGAGACCG TGCGCGCCCG CGTCGGGATC TGCGACGTCA CCACGCTCGG CAAGATCGAC ATCCAGGGCC GCGACGCGCT GGCCTTCATC GAGCGGGTCT GCGCCAACCC CTTCGCGACG TTGCCCGTCG GCAAGGCGCG CTACGCCGTG CTGCTGCGTG AGGACGGCTT CATCCTGGAC GACGGCACAA TCGCGCGGAT GGGCGAGACC CACTACGTCA TGACCGCCTC GACGGCGAAC GCCCCGCGGG TGATGCAGCA TCTCGAATTT TGCCGGCAAT GGTTGTGGCC GGAACTCGAC GTGCAGCTCG CCTCGGTCAG CGAGCAATGG GCGCAATACG CGGTCGCCGG CCCGCGCGCC CGCGACACCC TGCGCCGCAT CGTCGATCCG GGTTTCGATC TCTCCAACGA GGCCTTCCCG TTTCTCGCCT GCGCCGACGT CACCGTCGGC GGCGGCATCC CGGCGCGGCT GTTCCGGATC TCGTTCTCGG GCGAAGTCGC CTACGAGCTG GCGGTGCCGG CCGCCTACGG CGACGCGGCG TGGCGGGCGG TCATGCAGGC CGGCCTGCCC TACGGCATCA CCGCCTACGG CTCGGAGGCG CTCTCGGTCA TGCGCATCGA GAAGGGTCAC GCGGCCGGGG CCGAGATCAA CGGCCAGACC ACCGCCCGCG ACCTCGGGCT CGGCGGGATG CTCGCCAAGA AGAAGGACTA TATCGGCCGC CTGATGAAGG AGCGCCCGGC GCTGGTCGAT CCGGACCGGC CGATCCTCGT GGGCTTTCGC CCCGTCGATC CGAGCGCGCG GCTCCGGGCG GGGGCGCATT TCCTGAGCCT CGACGCCGCG CCGAGCCTGG AGGCCGACGA GGGCGTGATG ACCTCGGTGG CCTACTCGCC GAGCCTGAAG AGCTGGATCG GCATCGGCCT GATCCGGCGC GGGCCCGAGC GCCACGGCGA GCGGGTGCGC GCCTACGATC CGGTGCGCGG CGCCGAGATC GAGGTCGAGA TCTGTTCGCC GGTCTTCGTG GACCCCAAGG AGGAGAAGCT GCGTGTCTGA
|
Protein sequence | MTTLALQRAE ASASATSADR PFRTATGGLI DRNRPRDFTF DGRRLTGCHG DTLASALLAN GVRLVGRSFK YHRPRGILSA GSEEPNALVE LRSGARREPN TRATMAELYE GLEATSQNRW PTLAVDALSV NALLSPVFAA GFYYKTFMWP AGFWEKLYEP MIRRAAGLGR AADAPDPDTY DHAHAHCDVL VIGGGPAGLS AALAAGRSGA RVILVDEDFA TGGRLLAERR EIGGASGSEW AARAVAELES LPEVRILSRT TLFGVYDHGA YGAVERVSDH LAVPPPHTPR QRLWRIVARR AVLAAGAIER PHVFGGNDRP GVMLAGAVRT YLNRYGVLPG RRLAVFTSSD DGWRTATDIL AAGGGLAAVI DTRAAVPPAL RRMAEAAGAR VVAGGYVAGT KGHLGLSAIQ VVDGDNSIET IPCDGLAMAN GWNPVVHLDS HLSRRPVWNA AIHAFVPGTL PSGMQAAGAA AGRFTLAECL ETGAQAGAEA ASDCGFTASA EATPPTDPES VDHTPLWRAP KPRGKAFVDF QNDVAASDIE LAHREGFRAV ELLKRYTTLG MATDQGKTSN LAGLSIMAEL TGKDIPSVAT TVFRPPFTPV AIGAFAGHHR GKEFRATRHV PSHAWAEENG AVFVETGLWL RPAYFPRASE IDWLDTVVRE VETVRARVGI CDVTTLGKID IQGRDALAFI ERVCANPFAT LPVGKARYAV LLREDGFILD DGTIARMGET HYVMTASTAN APRVMQHLEF CRQWLWPELD VQLASVSEQW AQYAVAGPRA RDTLRRIVDP GFDLSNEAFP FLACADVTVG GGIPARLFRI SFSGEVAYEL AVPAAYGDAA WRAVMQAGLP YGITAYGSEA LSVMRIEKGH AAGAEINGQT TARDLGLGGM LAKKKDYIGR LMKERPALVD PDRPILVGFR PVDPSARLRA GAHFLSLDAA PSLEADEGVM TSVAYSPSLK SWIGIGLIRR GPERHGERVR AYDPVRGAEI EVEICSPVFV DPKEEKLRV
|
| |