Gene Mext_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1652 
Symbol 
ID5833890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1845761 
End bp1848790 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content72% 
IMG OID641367450 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001639122 
Protein GI163851079 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.90362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC TTGCCCTTCA GCGCGCCGAG GCCTCGGCTT CCGCGACGTC AGCCGACCGG 
CCGTTCCGCA CCGCCACCGG CGGCCTGATC GACCGGAACC GGCCGCGCGA CTTCACCTTC
GACGGGCGGC GCCTCACCGG CTGCCACGGC GACACGCTGG CCTCGGCCCT GCTTGCCAAC
GGCGTGCGCC TCGTCGGCCG CTCGTTCAAG TATCACCGGC CGCGCGGCAT CCTCTCGGCC
GGCTCGGAGG AGCCGAACGC GCTGGTGGAG CTGCGCTCGG GCGCCCGGCG CGAGCCCAAC
ACCCGCGCCA CCATGGCCGA GCTCTACGAG GGGTTGGAGG CGACGAGCCA GAACCGCTGG
CCCACGCTCG CCGTGGATGC GCTCTCGGTC AACGCGCTGC TCTCACCGGT CTTCGCGGCG
GGCTTCTACT ACAAGACCTT CATGTGGCCG GCCGGCTTCT GGGAGAAGCT GTACGAGCCG
ATGATCCGGC GCGCGGCTGG GCTCGGTCGG GCCGCCGACG CGCCCGATCC CGACACCTAC
GACCACGCCC ACGCCCATTG CGACGTGCTC GTCATCGGCG GCGGCCCGGC CGGCCTGTCG
GCGGCGCTCG CCGCCGGCCG CTCCGGGGCG CGGGTCATCC TGGTCGACGA GGATTTCGCC
ACCGGCGGTC GGCTGCTCGC CGAGCGGCGC GAAATCGGCG GCGCGAGCGG GTCCGAGTGG
GCTGCGCGCG CGGTGGCCGA GTTGGAGAGC CTGCCCGAGG TGCGCATCCT GTCGCGCACC
ACCCTGTTCG GCGTCTACGA TCACGGCGCC TACGGAGCGG TCGAGCGCGT CTCCGACCAT
CTCGCGGTGC CGCCCCCCCA CACCCCGCGC CAGCGGCTGT GGCGGATCGT AGCGCGGCGC
GCGGTGCTCG CGGCCGGCGC GATCGAGCGC CCGCACGTCT TCGGCGGCAA CGACCGCCCC
GGCGTGATGC TGGCCGGCGC GGTGCGGACC TACCTCAACC GTTACGGCGT GCTGCCGGGG
CGCCGCTTGG CCGTGTTCAC GTCGAGCGAC GACGGCTGGC GCACCGCCAC CGATATCCTG
GCCGCAGGCG GCGGGCTTGC GGCGGTGATC GACACCCGCG CTGCCGTTCC CCCCGCCCTG
CGCCGGATGG CGGAGGCCGC CGGGGCGCGG GTGGTGGCCG GTGGATACGT CGCCGGCACC
AAGGGGCATC TGGGCTTGAG CGCGATCCAG GTCGTGGACG GCGACAACAG CATCGAGACG
ATTCCCTGCG ACGGCCTCGC CATGGCCAAT GGCTGGAACC CGGTCGTCCA CCTCGATTCC
CACCTGTCGC GTCGGCCGGT CTGGAACGCG GCGATCCACG CCTTCGTGCC GGGCACCCTC
CCCTCCGGCA TGCAGGCGGC CGGCGCCGCC GCCGGGCGCT TCACCCTGGC CGAATGCCTG
GAGACCGGCG CCCAGGCCGG CGCCGAGGCC GCGAGCGATT GCGGCTTCAC CGCCTCGGCC
GAGGCCACTC CGCCGACCGA TCCGGAAAGC GTGGACCACA CCCCGCTCTG GCGCGCTCCG
AAGCCCCGCG GAAAAGCCTT CGTCGATTTT CAGAACGACG TGGCGGCCTC CGACATCGAA
CTCGCCCACC GCGAGGGGTT TCGCGCGGTC GAGCTGTTGA AGCGCTACAC CACCCTCGGC
ATGGCGACCG ACCAGGGCAA GACCTCGAAC CTCGCCGGCC TCTCGATCAT GGCCGAGCTG
ACCGGCAAGG ACATCCCGAG CGTCGCCACC ACGGTGTTCC GCCCGCCCTT CACCCCCGTC
GCCATCGGCG CGTTTGCCGG CCATCACCGC GGCAAGGAGT TCCGCGCCAC CCGCCACGTC
CCCTCCCATG CCTGGGCGGA GGAGAACGGC GCGGTCTTCG TGGAGACCGG CCTGTGGCTG
CGCCCGGCCT ATTTCCCGCG GGCAAGCGAG ATCGACTGGC TCGACACGGT GGTACGGGAG
GTCGAGACCG TGCGCGCCCG CGTCGGGATC TGCGACGTCA CCACGCTCGG CAAGATCGAC
ATCCAGGGCC GCGACGCGCT GGCCTTCATC GAGCGGGTCT GCGCCAACCC CTTCGCGACG
TTGCCCGTCG GCAAGGCGCG CTACGCCGTG CTGCTGCGTG AGGACGGCTT CATCCTGGAC
GACGGCACAA TCGCGCGGAT GGGCGAGACC CACTACGTCA TGACCGCCTC GACGGCGAAC
GCCCCGCGGG TGATGCAGCA TCTCGAATTT TGCCGGCAAT GGTTGTGGCC GGAACTCGAC
GTGCAGCTCG CCTCGGTCAG CGAGCAATGG GCGCAATACG CGGTCGCCGG CCCGCGCGCC
CGCGACACCC TGCGCCGCAT CGTCGATCCG GGTTTCGATC TCTCCAACGA GGCCTTCCCG
TTTCTCGCCT GCGCCGACGT CACCGTCGGC GGCGGCATCC CGGCGCGGCT GTTCCGGATC
TCGTTCTCGG GCGAAGTCGC CTACGAGCTG GCGGTGCCGG CCGCCTACGG CGACGCGGCG
TGGCGGGCGG TCATGCAGGC CGGCCTGCCC TACGGCATCA CCGCCTACGG CTCGGAGGCG
CTCTCGGTCA TGCGCATCGA GAAGGGTCAC GCGGCCGGGG CCGAGATCAA CGGCCAGACC
ACCGCCCGCG ACCTCGGGCT CGGCGGGATG CTCGCCAAGA AGAAGGACTA TATCGGCCGC
CTGATGAAGG AGCGCCCGGC GCTGGTCGAT CCGGACCGGC CGATCCTCGT GGGCTTTCGC
CCCGTCGATC CGAGCGCGCG GCTCCGGGCG GGGGCGCATT TCCTGAGCCT CGACGCCGCG
CCGAGCCTGG AGGCCGACGA GGGCGTGATG ACCTCGGTGG CCTACTCGCC GAGCCTGAAG
AGCTGGATCG GCATCGGCCT GATCCGGCGC GGGCCCGAGC GCCACGGCGA GCGGGTGCGC
GCCTACGATC CGGTGCGCGG CGCCGAGATC GAGGTCGAGA TCTGTTCGCC GGTCTTCGTG
GACCCCAAGG AGGAGAAGCT GCGTGTCTGA
 
Protein sequence
MTTLALQRAE ASASATSADR PFRTATGGLI DRNRPRDFTF DGRRLTGCHG DTLASALLAN 
GVRLVGRSFK YHRPRGILSA GSEEPNALVE LRSGARREPN TRATMAELYE GLEATSQNRW
PTLAVDALSV NALLSPVFAA GFYYKTFMWP AGFWEKLYEP MIRRAAGLGR AADAPDPDTY
DHAHAHCDVL VIGGGPAGLS AALAAGRSGA RVILVDEDFA TGGRLLAERR EIGGASGSEW
AARAVAELES LPEVRILSRT TLFGVYDHGA YGAVERVSDH LAVPPPHTPR QRLWRIVARR
AVLAAGAIER PHVFGGNDRP GVMLAGAVRT YLNRYGVLPG RRLAVFTSSD DGWRTATDIL
AAGGGLAAVI DTRAAVPPAL RRMAEAAGAR VVAGGYVAGT KGHLGLSAIQ VVDGDNSIET
IPCDGLAMAN GWNPVVHLDS HLSRRPVWNA AIHAFVPGTL PSGMQAAGAA AGRFTLAECL
ETGAQAGAEA ASDCGFTASA EATPPTDPES VDHTPLWRAP KPRGKAFVDF QNDVAASDIE
LAHREGFRAV ELLKRYTTLG MATDQGKTSN LAGLSIMAEL TGKDIPSVAT TVFRPPFTPV
AIGAFAGHHR GKEFRATRHV PSHAWAEENG AVFVETGLWL RPAYFPRASE IDWLDTVVRE
VETVRARVGI CDVTTLGKID IQGRDALAFI ERVCANPFAT LPVGKARYAV LLREDGFILD
DGTIARMGET HYVMTASTAN APRVMQHLEF CRQWLWPELD VQLASVSEQW AQYAVAGPRA
RDTLRRIVDP GFDLSNEAFP FLACADVTVG GGIPARLFRI SFSGEVAYEL AVPAAYGDAA
WRAVMQAGLP YGITAYGSEA LSVMRIEKGH AAGAEINGQT TARDLGLGGM LAKKKDYIGR
LMKERPALVD PDRPILVGFR PVDPSARLRA GAHFLSLDAA PSLEADEGVM TSVAYSPSLK
SWIGIGLIRR GPERHGERVR AYDPVRGAEI EVEICSPVFV DPKEEKLRV