Gene Mext_3738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3738 
Symbol 
ID5832221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4140140 
End bp4143127 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content70% 
IMG OID641369528 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001641183 
Protein GI163853140 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.187034 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGC CGTTCAGATT GTCCCGCGGC GGACGCATCG ACCGCACCCG CCCCATCGTC 
TTCGAATTCA ACGGCAAGCC GGTCCACGGA TTCGCCGGCG ACACCGTCGC CTCGGCGCTG
CTGGCCAACG GCATCCACCT CGTCGGGCGC TCGTTCAAGT ACCACCGCCC CCGCGGCATC
CTGAGCCATG GCCCCGACGA GCCGAGCGCG CTGCTCTCGG TCGATCGTGG GCCCGGCCGG
ATCGACCCGA ACAACCGCGC CTCCGTGGTC GAGGCGCGCT CGGGCCTGCG CACGACCTCG
CAGAACCATT GGCCGTCGCT CGAATTCGAC GTCGGCGCCG TCAATGATTT GCTGTCGCCG
GTCTTCGTGG CGGGCTTCTA CTACAAGACT TTCATGTGGC CCCGGAAGTT CTGGGACCGG
GTCTATGAGC CGTTCATCCG CGCCGCCGCC GGTCTCGGAA AGGCGCCGAC GGTGGCCGAT
CCCGACCGCT ACGCCAACCG CCACGCCCAT TGCGATGTGC TGATCGTCGG CGCCGGCCCG
GCGGGGCTTG CTGCGGCGCT CGCCGCGGCG CGTACCGGCA AGCGGGTGAT CCTCGCCGAC
GAGGGCGCGG AGCCCGGCGG CACGCTCCTG CACGACACGA CCTCGCAGAT CGACGGTCGC
CCGGCGGCGG ACTGGCTCGC CGAAACGCTG GCCGAGCTCG ATGCCCGCGA GAACGTCATC
CTGCTGCCCC GCACCACCGC CTTCGGCTAT TACAACCACA ACCACGTGGC GATGACCGAG
CGCGTCACCG ACCACCTGTC GTCCGCCGCG GGCCAAGCGC CCCGCGAGCG CCTGTGGCAG
GTGCGGGCGG AGCAGGTCGT GCTCGCCGGT GGCTCCCACG AGCGCCCCCT CGTTTTCGCC
GACAACGACC GGCCGGGCAT CCTGCTCGCC GAGAGCGTGC GGGTCTTCCT CAACCGTTAC
GGCGTGGCGC CGGGCCGCAA GCTCGTCTTC GCCACGAGCG GCGCCTCCGC CTACCAAGCC
GCGCTCGATG CGCGTGCGGC GGGCCTCGAC GTCACCCTCG TCGATCTGCG CCTAGAAGCG
GATTGCGGAC CGGAGTTGGC ACGCCTGCGC AGCGCCGGGG TCGACGTATT GACCGGCCAC
ACCGTGGTCG GATCGAAGGG CCGGAAGCGC GTCACGGGTC TCATCGTGGC GCCTGTCGGG
AGCGACGGCC GGTGCGGCGG CCGTCGCATT CTCCCTTGCG ACTGCGTCGG CATGTCCGGC
GGCTGGACGC CCGCCGTCCA CCTGTTCTCG CAGTCCCGCG GCAAGCTCGC CTACGATGAG
GGCATCGATG CCTTCGTGCC GAGCCGCTCG GCGCAAGACG AGCGCTCGGC GGGCGCGGCC
CGCGGCAGCT ACGACCTCGC CGCCTGCCTC GCGGAGGGCT TCGCCGCCGG TGCCGCCGCG
GCTGGTTCCG ACGCACGGCA GGACTTCAGG GCGACGGAGA CGCTGACCGG TTTCCAGCCG
GTGCGGATCA TGCCCACCGA CGCGAACCCG ACCAAGGTCC GCGCCTTCGT CGACTACCAG
AACGACGTCA CCGCCAAGGA CATCAAGCTC GCGGTGCGCG AGGGCTTCCA GTCGATCGAG
CACGTCAAGC GCTACACCAC GACCGGCATG GCGACCGACC AGGGCAAGAC CTCGAACATG
AACGCGCTCG GCATCGTCGC CGGGCAGCTC GACAAGGCGC TGCCCGCCGT CGGCACCACG
ACCTTCCGGC CGCCCTACAC CCCGGTGACC TTCGGCGCGC TGGTGGGCCC GGCCCGCCAC
GCCCTGTTCG ATCCGATCCG CACCACTCCG ATCCACGAAT GGGCCGAGGC CCACGGCGCC
CTGTTCGAGA ACGTCGCCCT GTGGCGGCGC GCCTGGTACT TCCCGAAGGC GGGCGAGGAT
CTGCACGCCG CGGTCGCCCG CGAGTGCAAG GCGGTGCGCG AGGGCGTCGG CATCTTCGAC
GCCTCGACGC TGGGCAAAAT CGAGATCGTC GGCCGGGATG CGGCCGAGTT CATGAACCGC
CTCTACATCA ACCCCTGGAC CAAGCTCGCC CCCGGGCGCT GCCGCTACGG GCTGATGCTG
AAGGAGGACG GCTACATCCT CGATGACGGC GTCGTCGCCC GCGTGTCGGA CACCTGCTTC
CACGTCACCA CCACCACCGG CGGCGCCGCC CGCGTGCTCG GCCACATGGA GGATTATCTC
CAGACCGAGT GGCCGGAGCT TGAAGTGTTC CTGACCTCGA CCACCGAGCA ATGGGCGGTG
ATCGCGCTCC AGGGCCCGAA GGCCCGCGCC GTGATCGCGC CGCTCGTCGA CGGCATCGAT
CTGTCGCCGG ACGCCTTCCC GCATATGGCG ATGCGCTCAG GCACGATCTG CGGCGTGCCG
ACCCGGCTGT TCCGGGTGTC GTTCACCGGT GAACTCGGCT TCGAGATCAA CGTGCCCGCC
GACCACGCCC GCGCGGTCTG GGAGGCGGTG TTCGAGGCGG GCCGGGCCCA CGGCATCACG
CCCTACGGCA CCGAGACGAT GCACGTGCTG CGCGCCGAGA AGGGCTACAT CATCGTCGGC
CAGGAGACCG ACGGCACGGT GACCCCGGAC GATGTCGGCA TGGCCGGCAT GATCCCGAAG
GCCAAGGGAG ACTTCGTCGG CAAGCGCTCG CTGGCGCGCC CCGACGTCGT TGCCACCGGC
CGCAAGCAGC TCGTCGGCCT CATGACCGAT GACCCTAAGC TCGTCCTCGA CGAGGGCGCG
CAGATCGTCA CGGATACCCA TCAGCCGATC CCGATGCGCA TGCTCGGCCA CGTCACGTCG
AGCTACTGGA GCGCCAATTG CGGCCGCTCC ATCGCGCTGG CCCTGGTCGA GGGCGGACGC
GAGCGGATGA ACGGCCATCT CTTCGTCACC ACGCCGGACG GGTTCACCCG CGTCACCGTC
TGCGAGCCGG TCTTCTTCGA CGTCCAGGGG GAGCGCATCA ATGCTTGA
 
Protein sequence
MAQPFRLSRG GRIDRTRPIV FEFNGKPVHG FAGDTVASAL LANGIHLVGR SFKYHRPRGI 
LSHGPDEPSA LLSVDRGPGR IDPNNRASVV EARSGLRTTS QNHWPSLEFD VGAVNDLLSP
VFVAGFYYKT FMWPRKFWDR VYEPFIRAAA GLGKAPTVAD PDRYANRHAH CDVLIVGAGP
AGLAAALAAA RTGKRVILAD EGAEPGGTLL HDTTSQIDGR PAADWLAETL AELDARENVI
LLPRTTAFGY YNHNHVAMTE RVTDHLSSAA GQAPRERLWQ VRAEQVVLAG GSHERPLVFA
DNDRPGILLA ESVRVFLNRY GVAPGRKLVF ATSGASAYQA ALDARAAGLD VTLVDLRLEA
DCGPELARLR SAGVDVLTGH TVVGSKGRKR VTGLIVAPVG SDGRCGGRRI LPCDCVGMSG
GWTPAVHLFS QSRGKLAYDE GIDAFVPSRS AQDERSAGAA RGSYDLAACL AEGFAAGAAA
AGSDARQDFR ATETLTGFQP VRIMPTDANP TKVRAFVDYQ NDVTAKDIKL AVREGFQSIE
HVKRYTTTGM ATDQGKTSNM NALGIVAGQL DKALPAVGTT TFRPPYTPVT FGALVGPARH
ALFDPIRTTP IHEWAEAHGA LFENVALWRR AWYFPKAGED LHAAVARECK AVREGVGIFD
ASTLGKIEIV GRDAAEFMNR LYINPWTKLA PGRCRYGLML KEDGYILDDG VVARVSDTCF
HVTTTTGGAA RVLGHMEDYL QTEWPELEVF LTSTTEQWAV IALQGPKARA VIAPLVDGID
LSPDAFPHMA MRSGTICGVP TRLFRVSFTG ELGFEINVPA DHARAVWEAV FEAGRAHGIT
PYGTETMHVL RAEKGYIIVG QETDGTVTPD DVGMAGMIPK AKGDFVGKRS LARPDVVATG
RKQLVGLMTD DPKLVLDEGA QIVTDTHQPI PMRMLGHVTS SYWSANCGRS IALALVEGGR
ERMNGHLFVT TPDGFTRVTV CEPVFFDVQG ERINA