Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3738 |
Symbol | |
ID | 5832221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4140140 |
End bp | 4143127 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641369528 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001641183 |
Protein GI | 163853140 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.161816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.187034 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGC CGTTCAGATT GTCCCGCGGC GGACGCATCG ACCGCACCCG CCCCATCGTC TTCGAATTCA ACGGCAAGCC GGTCCACGGA TTCGCCGGCG ACACCGTCGC CTCGGCGCTG CTGGCCAACG GCATCCACCT CGTCGGGCGC TCGTTCAAGT ACCACCGCCC CCGCGGCATC CTGAGCCATG GCCCCGACGA GCCGAGCGCG CTGCTCTCGG TCGATCGTGG GCCCGGCCGG ATCGACCCGA ACAACCGCGC CTCCGTGGTC GAGGCGCGCT CGGGCCTGCG CACGACCTCG CAGAACCATT GGCCGTCGCT CGAATTCGAC GTCGGCGCCG TCAATGATTT GCTGTCGCCG GTCTTCGTGG CGGGCTTCTA CTACAAGACT TTCATGTGGC CCCGGAAGTT CTGGGACCGG GTCTATGAGC CGTTCATCCG CGCCGCCGCC GGTCTCGGAA AGGCGCCGAC GGTGGCCGAT CCCGACCGCT ACGCCAACCG CCACGCCCAT TGCGATGTGC TGATCGTCGG CGCCGGCCCG GCGGGGCTTG CTGCGGCGCT CGCCGCGGCG CGTACCGGCA AGCGGGTGAT CCTCGCCGAC GAGGGCGCGG AGCCCGGCGG CACGCTCCTG CACGACACGA CCTCGCAGAT CGACGGTCGC CCGGCGGCGG ACTGGCTCGC CGAAACGCTG GCCGAGCTCG ATGCCCGCGA GAACGTCATC CTGCTGCCCC GCACCACCGC CTTCGGCTAT TACAACCACA ACCACGTGGC GATGACCGAG CGCGTCACCG ACCACCTGTC GTCCGCCGCG GGCCAAGCGC CCCGCGAGCG CCTGTGGCAG GTGCGGGCGG AGCAGGTCGT GCTCGCCGGT GGCTCCCACG AGCGCCCCCT CGTTTTCGCC GACAACGACC GGCCGGGCAT CCTGCTCGCC GAGAGCGTGC GGGTCTTCCT CAACCGTTAC GGCGTGGCGC CGGGCCGCAA GCTCGTCTTC GCCACGAGCG GCGCCTCCGC CTACCAAGCC GCGCTCGATG CGCGTGCGGC GGGCCTCGAC GTCACCCTCG TCGATCTGCG CCTAGAAGCG GATTGCGGAC CGGAGTTGGC ACGCCTGCGC AGCGCCGGGG TCGACGTATT GACCGGCCAC ACCGTGGTCG GATCGAAGGG CCGGAAGCGC GTCACGGGTC TCATCGTGGC GCCTGTCGGG AGCGACGGCC GGTGCGGCGG CCGTCGCATT CTCCCTTGCG ACTGCGTCGG CATGTCCGGC GGCTGGACGC CCGCCGTCCA CCTGTTCTCG CAGTCCCGCG GCAAGCTCGC CTACGATGAG GGCATCGATG CCTTCGTGCC GAGCCGCTCG GCGCAAGACG AGCGCTCGGC GGGCGCGGCC CGCGGCAGCT ACGACCTCGC CGCCTGCCTC GCGGAGGGCT TCGCCGCCGG TGCCGCCGCG GCTGGTTCCG ACGCACGGCA GGACTTCAGG GCGACGGAGA CGCTGACCGG TTTCCAGCCG GTGCGGATCA TGCCCACCGA CGCGAACCCG ACCAAGGTCC GCGCCTTCGT CGACTACCAG AACGACGTCA CCGCCAAGGA CATCAAGCTC GCGGTGCGCG AGGGCTTCCA GTCGATCGAG CACGTCAAGC GCTACACCAC GACCGGCATG GCGACCGACC AGGGCAAGAC CTCGAACATG AACGCGCTCG GCATCGTCGC CGGGCAGCTC GACAAGGCGC TGCCCGCCGT CGGCACCACG ACCTTCCGGC CGCCCTACAC CCCGGTGACC TTCGGCGCGC TGGTGGGCCC GGCCCGCCAC GCCCTGTTCG ATCCGATCCG CACCACTCCG ATCCACGAAT GGGCCGAGGC CCACGGCGCC CTGTTCGAGA ACGTCGCCCT GTGGCGGCGC GCCTGGTACT TCCCGAAGGC GGGCGAGGAT CTGCACGCCG CGGTCGCCCG CGAGTGCAAG GCGGTGCGCG AGGGCGTCGG CATCTTCGAC GCCTCGACGC TGGGCAAAAT CGAGATCGTC GGCCGGGATG CGGCCGAGTT CATGAACCGC CTCTACATCA ACCCCTGGAC CAAGCTCGCC CCCGGGCGCT GCCGCTACGG GCTGATGCTG AAGGAGGACG GCTACATCCT CGATGACGGC GTCGTCGCCC GCGTGTCGGA CACCTGCTTC CACGTCACCA CCACCACCGG CGGCGCCGCC CGCGTGCTCG GCCACATGGA GGATTATCTC CAGACCGAGT GGCCGGAGCT TGAAGTGTTC CTGACCTCGA CCACCGAGCA ATGGGCGGTG ATCGCGCTCC AGGGCCCGAA GGCCCGCGCC GTGATCGCGC CGCTCGTCGA CGGCATCGAT CTGTCGCCGG ACGCCTTCCC GCATATGGCG ATGCGCTCAG GCACGATCTG CGGCGTGCCG ACCCGGCTGT TCCGGGTGTC GTTCACCGGT GAACTCGGCT TCGAGATCAA CGTGCCCGCC GACCACGCCC GCGCGGTCTG GGAGGCGGTG TTCGAGGCGG GCCGGGCCCA CGGCATCACG CCCTACGGCA CCGAGACGAT GCACGTGCTG CGCGCCGAGA AGGGCTACAT CATCGTCGGC CAGGAGACCG ACGGCACGGT GACCCCGGAC GATGTCGGCA TGGCCGGCAT GATCCCGAAG GCCAAGGGAG ACTTCGTCGG CAAGCGCTCG CTGGCGCGCC CCGACGTCGT TGCCACCGGC CGCAAGCAGC TCGTCGGCCT CATGACCGAT GACCCTAAGC TCGTCCTCGA CGAGGGCGCG CAGATCGTCA CGGATACCCA TCAGCCGATC CCGATGCGCA TGCTCGGCCA CGTCACGTCG AGCTACTGGA GCGCCAATTG CGGCCGCTCC ATCGCGCTGG CCCTGGTCGA GGGCGGACGC GAGCGGATGA ACGGCCATCT CTTCGTCACC ACGCCGGACG GGTTCACCCG CGTCACCGTC TGCGAGCCGG TCTTCTTCGA CGTCCAGGGG GAGCGCATCA ATGCTTGA
|
Protein sequence | MAQPFRLSRG GRIDRTRPIV FEFNGKPVHG FAGDTVASAL LANGIHLVGR SFKYHRPRGI LSHGPDEPSA LLSVDRGPGR IDPNNRASVV EARSGLRTTS QNHWPSLEFD VGAVNDLLSP VFVAGFYYKT FMWPRKFWDR VYEPFIRAAA GLGKAPTVAD PDRYANRHAH CDVLIVGAGP AGLAAALAAA RTGKRVILAD EGAEPGGTLL HDTTSQIDGR PAADWLAETL AELDARENVI LLPRTTAFGY YNHNHVAMTE RVTDHLSSAA GQAPRERLWQ VRAEQVVLAG GSHERPLVFA DNDRPGILLA ESVRVFLNRY GVAPGRKLVF ATSGASAYQA ALDARAAGLD VTLVDLRLEA DCGPELARLR SAGVDVLTGH TVVGSKGRKR VTGLIVAPVG SDGRCGGRRI LPCDCVGMSG GWTPAVHLFS QSRGKLAYDE GIDAFVPSRS AQDERSAGAA RGSYDLAACL AEGFAAGAAA AGSDARQDFR ATETLTGFQP VRIMPTDANP TKVRAFVDYQ NDVTAKDIKL AVREGFQSIE HVKRYTTTGM ATDQGKTSNM NALGIVAGQL DKALPAVGTT TFRPPYTPVT FGALVGPARH ALFDPIRTTP IHEWAEAHGA LFENVALWRR AWYFPKAGED LHAAVARECK AVREGVGIFD ASTLGKIEIV GRDAAEFMNR LYINPWTKLA PGRCRYGLML KEDGYILDDG VVARVSDTCF HVTTTTGGAA RVLGHMEDYL QTEWPELEVF LTSTTEQWAV IALQGPKARA VIAPLVDGID LSPDAFPHMA MRSGTICGVP TRLFRVSFTG ELGFEINVPA DHARAVWEAV FEAGRAHGIT PYGTETMHVL RAEKGYIIVG QETDGTVTPD DVGMAGMIPK AKGDFVGKRS LARPDVVATG RKQLVGLMTD DPKLVLDEGA QIVTDTHQPI PMRMLGHVTS SYWSANCGRS IALALVEGGR ERMNGHLFVT TPDGFTRVTV CEPVFFDVQG ERINA
|
| |