Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpop_3993 |
Symbol | |
ID | 6312049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium populi BJ001 |
Kingdom | Bacteria |
Replicon accession | NC_010725 |
Strand | + |
Start bp | 4258864 |
End bp | 4261851 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 642652691 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_001926650 |
Protein GI | 188583205 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGC CGTTCAGATT GTCCCGCGGC GGACGCATCG ACCGCAGCCG CCCCATCGTC CTCACGTTCA ACGGCAAGCC CGTCCACGGC TATGCCGGCG ACACCGTCGC CTCGGCGCTC TTGGCCAACG GCATCCACCT CGTCGGGCGC TCGTTCAAGT ATCACCGCCC CCGCGGCATC CTGAGCCACG GCCCCGACGA GCCGAGCGCG CTGCTCTCCG TCGATCGCGG GCCCGGCCGG ATCGACCCGA ACAACCGCGC CTCCGTGGTC GAGGCGCGCT CGGGGCTCCG CACGACCTCG CAGAACCACT GGCCGTCGCT CGAATTCGAT GTCGGCGCGG TCAACGACCT CTTGTCGCCG GTCTTCGTGG CGGGCTTCTA CTACAAGACC TTCATGTGGC CCCGGACGTT CTGGGACCGG GTCTACGAGC CGTTCATCCG CGCCGCCGCC GGTCTCGGCA AGGCGCCGAC GGTGGCCGAT CCCGACCGCT ACGCCAACCG CCACGCCCAT TGCGACGTGC TGATCGTCGG CGCCGGCCCG GCGGGTCTCG CCGCCGCGCT CGCCGCGGCC CGCACCGGCA AGCGGGTGAT CCTCGCCGAC GAGGGCGCGG AGCCCGGCGG CGCGCTCCTG CACGACACGA CGTCCGAGAT CGACGGACGC CCGGCGGCCG ACTGGCTCGC CGAGACGCTG ACCGAGCTCG ATGCCCGCGA GAACGTCATC CTGCTGCCCC GGACCACCGC CTTCGGTTAC TACAACCACA ACCACGTGGC GATGACCGAG CGCGTCACCG ACCACCTGAC CTCCACCGCC GGCCAAGCGC CCCGCGAGCG CCTGTGGCAG GTGCGGGCGG AACAGGTCGT GCTCGCCGGC GGCGCCCATG AGCGCCCCCT CGTCTTCGCC GACAACGACC GGCCGGGCAT CCTGCTCGCC GAAAGCGTGC GGGTCTTCCT CAACCGCTAC GGCGTGGCGC CGGGCCGCCG GCTCGTCTTC GCCACGAGCG GCGCCTCCGC CTACCGGGCC GCACTCGATG CGCGCGCGGC GGGCCTCGAC GTCACCCTCG TCGACCTGCG CCTGGAAGCG GATTGCGGGC CGGAGCTGGC GCAGTTGCGC GCGGCCGGGG CCGACGTGTT GACCGGCCAC ACCGTGGTCG GATCGAAGGG CCGCAAGCGC GTCACGGGTC TCATCGTGGC GCCTGTCGGG AGCGACGGCC GGTGCGGCGG CCGTCGCCTT CTCGCCTGCG ATTGCGTCGG CATGTCCGGC GGCTGGACGC CCGCCGTCCA CCTGTTCTCG CAGTCCCGCG GCAAGCTCGC CTACGACGAG GCGATCGATG CCTTCGTACC GAGCCGCTCG GCGCAAGAGG AGCGCTCGGC GGGTGCGGCC CGCGGCACCT ACGACCTCGC CGCCTGCCTT GGCGACGGCT TCGCCGCCGG CGCTGCCGCC GCGGGCTCCG AGGCGCGTCA GGACTTCAAG GCGACAGCGA CGCTCACAGG CTTCCAGCCG GTGCGGATCA TGCCGACCGA CGCCGACCCG ACCAAGGTCC GCGCCTTCGT CGACTACCAG AACGACGTCA CCGCCAAGGA CATCAAGCTC GCGGTCCGCG AGGGCTTCCA GTCGATCGAG CACGTCAAGC GCTACACCAC GACCGGCATG GCGACCGATC AGGGCAAGAC CTCGAACATG AACGCGCTCG GCATCGTCGC CGGGCAGCTC GACAAGGCGC TGCCGGCGGT GGGCACCACC ACCTTCCGGC CGCCCTACAC CCCGGTGACC TTCGGCGCCC TGGTGGGCCC CGCCCGCCAC GCCCTGTTCG ATCCGATCCG CACCACCCCG ATCCACGAAT GGGCGGAGGC GCACGGCGCC CTGTTCGAGA ACGTCGCCCT GTGGCGGCGG GCCTGGTACT TCCCCAAGGA CGGCGAGGAC CTGCACGCGG CGGTCGCCCG CGAGTGCAAG GCGGTGCGCG AGGGCGTCGG CATCTTCGAC GCCTCGACGC TGGGCAAGAT CGAGATCGTC GGCCGGGACG CGGCCGAGTT CATGAACCGC CTCTACATCA ACCCCTGGAC CAAGCTCGCC CCCGGACGCT GCCGCTACGG GCTGATGCTC AAGGAGGACG GCTACATCCT CGACGACGGC GTCGTCGCCC GCGTCTCGGA CACCTGCTTC CACGTCACCA CCACGACCGG CGGCGCCGCC CGCGTGCTGG CCCATATGGA GGACTACCTC CAGACCGAAT GGCCGGAGCT CGAGGTGTTC CTGACCTCGA CCACCGAGCA ATGGGCGGTG ATCGCGCTCC AGGGCCCGAA GGCCCGCGAC GTGATCGCGC CGCTGGTCGA CGGCATCGAC CTCTCGCCGG AGGCCTTTCC GCACATGGCG ATGCGCTCGG GCACGATCTG CGGCGTGCCG ACCCGGCTGT TCCGGGTCTC GTTCACCGGC GAGCTCGGCT TCGAGATCAA CGTCCCCTCC GACCACGCCC GCGCGGTCTG GGAGGCGGTG TTCGAGGCGG GCCGGGCTCA CGGCATCACG CCCTACGGCA CCGAGACGAT GCACGTGCTG CGCGCCGAGA AGGGCTACAT CATCGTCGGC CAGGAAACCG ACGGCACGGT GACCCCGGAC GATGTCGGCA TGGCCGGGAT GATCCCGAAG GCCAAGGGGG ACTTCGTCGG CAAGCGCTCG CTGGCGCGCC CCGACGTGGT GGCGTCGGGC CGCAAGCAGC TCGTCGGGCT CGTGACCGAC GATCCCAAGC TCGTCCTCGA CGAGGGCGCG CAGATCGTAC CAGACACCGG TCAGCCGATC CCGATGCGCA TGCTCGGCCA CGTCACGTCG AGCTACTGGA GTGCCAATTG CGGACGCTCC ATCGCGCTCG CCCTGGTCGA GGGCGGCCGC GCGCGGATGA ACGGGCATCT CCACGTCACC ACGCCGGACG GCTTTACCCG CGTCACCGTC TGCGAACCGG TCTTCTTCGA CGTCAAGGGG GAGCGCATCC ATGCTTGA
|
Protein sequence | MAQPFRLSRG GRIDRSRPIV LTFNGKPVHG YAGDTVASAL LANGIHLVGR SFKYHRPRGI LSHGPDEPSA LLSVDRGPGR IDPNNRASVV EARSGLRTTS QNHWPSLEFD VGAVNDLLSP VFVAGFYYKT FMWPRTFWDR VYEPFIRAAA GLGKAPTVAD PDRYANRHAH CDVLIVGAGP AGLAAALAAA RTGKRVILAD EGAEPGGALL HDTTSEIDGR PAADWLAETL TELDARENVI LLPRTTAFGY YNHNHVAMTE RVTDHLTSTA GQAPRERLWQ VRAEQVVLAG GAHERPLVFA DNDRPGILLA ESVRVFLNRY GVAPGRRLVF ATSGASAYRA ALDARAAGLD VTLVDLRLEA DCGPELAQLR AAGADVLTGH TVVGSKGRKR VTGLIVAPVG SDGRCGGRRL LACDCVGMSG GWTPAVHLFS QSRGKLAYDE AIDAFVPSRS AQEERSAGAA RGTYDLAACL GDGFAAGAAA AGSEARQDFK ATATLTGFQP VRIMPTDADP TKVRAFVDYQ NDVTAKDIKL AVREGFQSIE HVKRYTTTGM ATDQGKTSNM NALGIVAGQL DKALPAVGTT TFRPPYTPVT FGALVGPARH ALFDPIRTTP IHEWAEAHGA LFENVALWRR AWYFPKDGED LHAAVARECK AVREGVGIFD ASTLGKIEIV GRDAAEFMNR LYINPWTKLA PGRCRYGLML KEDGYILDDG VVARVSDTCF HVTTTTGGAA RVLAHMEDYL QTEWPELEVF LTSTTEQWAV IALQGPKARD VIAPLVDGID LSPEAFPHMA MRSGTICGVP TRLFRVSFTG ELGFEINVPS DHARAVWEAV FEAGRAHGIT PYGTETMHVL RAEKGYIIVG QETDGTVTPD DVGMAGMIPK AKGDFVGKRS LARPDVVASG RKQLVGLVTD DPKLVLDEGA QIVPDTGQPI PMRMLGHVTS SYWSANCGRS IALALVEGGR ARMNGHLHVT TPDGFTRVTV CEPVFFDVKG ERIHA
|
| |