Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2750 |
Symbol | |
ID | 5323620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2867642 |
End bp | 2870635 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640791695 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001328415 |
Protein GI | 150397948 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGGG TCAACCGCAT TTCAGGCGCG GGCCGCCTGA CACCGGCGCG CACCGCCCGC TTCACCTTCG ACGGCCGGAC GCTGACGGCG CTCGAGGGCG ACACCATCGC CTCGGCGCTC ATTGCCAACG ACATTCATCT CGTCGGCCGT TCGTTCAAAT ATCACCGTCC GCGCGGCATT CTTTCCGCAG GCGCCGAGGA GCCGAATGCT TTGCTCGACG TTTCTCGCGA TGCCGCCCGC CGGCAGCCGA ACGTCCGCGC CACCGTGCAG GAGGTCTTCG ACGGCATGAG AGTGTCGTCG CAGAACCGTT GGCCTTCGCT TGCCTTCGAC GTCGGCGGTT TCAACGATCT CTTGTCGCCC TTCTTCGCAG CGGGATTTTA TTACAAGACC TTCATGTGGC CGAAAGCCGC CTGGCATAAG CTCTATGAGC CCTTTATCCG TCGGGCCGCC GGTCTCGGTG TAGCGCCGAC GGAGACAGAC CCTGACCATT ATGCAAGCCG CTATGTCCAT TGCGACGTGC TTGTGGTCGG CGCCGGCGCC GCGGGGCTTG CGGCGGCACT TGCGGCGGCT AATGCCGGCG CGAAGGTGAT CCTGTGCGAC GAACAGCCGG CTGTCGGCGG TGCCCTGCAT TACGACAGCG GCAGCGAGAT CGACGGCAGG GCGGGCTATG ACTGGGCGCT GGCGACGGGC AAGGCGCTGG CGGCAATGGA CAATGTCACG CTGCTGACCC GCACGACGGC CTTCGGATAT TACAACCACA ATTTCGTAGG CCTCGTAGAG CGCGTGACGG ATCACCTGCC CGCGCCCGAC AAGGCGCTCC CGCGTGAGCG GCTGTGGCAG GTGCGCGCGA AAAAGGTCAT TCTCGCCAAC GGCGCCATCG AGCGCCACAT GGTCTTCCCC AACAATGATC GTCCCGGGAT CATGCTGGCC TCGGCCGGCC GCACCTATCT CAACCATTTC GGCGTCGCCG TCGGCAAAAA GGTGGGCATC TATGCAGCCC ATGATTCCGC CTATGAGGCG GCCTTCGACC TCAGAAAGGC CGGCATCGAC ATCCCTGCTA TCGTCGATTG CCGCGAAAAG CCGGGCGACA TGGTGCTTGC GGAGGCGCGA AGCCTCGGCA TCGAAGTGCT GAGCGGTCAA TCGGTCGTCA ACACATCCGG CAAGCTGCGC ATCTCCTCCA TCAGCGTCGC TCGGAACGGC GGCGGAGCGG CACGCAAGAT CGCGGTCGAC GCACTGCTGG TCTCTGCCGG TTGGATACCT TCGGTGCATC TCTTCTCGCA GTCGCGCGGC AAGGTGACCT TCGATGCGGC GACGGAGCGG TTCCTGCCGG GAACCTATGC GCAAGAATGC CTCTCCGTTG GCGCCTGCAA CGGCACGGAC GACCTGCAGG CGACGATCGA CGAGGCGCTT GCCGCCGGTG AACTGGCAGC CCGTGCAGCG GGTGCGGAAG GTGGCGTGCA GGTTGCGCTC TCCGGCCGCA ATGCCTTCGA ATGGACGGGC GGCATGATCG GCGCAGCGGA AGGCGCGGGG CAGGATACGA CGGTCAAGGC CTTCATCGAC TTCCAGCACG ACGTCTGCGC GAAGGATATC CGCCTGGCGG TGCGAGAAGG GATGCATTCG ATCGAGCACA TCAAGCGGTT CACGACCAAC GGGATGGCAT CCGACCAGGG TAAGCTCTCG AACATGCATG GCCTTGCGAT CGCCGCCGAA GCGCTGGGCA AGGAAATCCC TCAGGTGGGG CTCACGACCT TCCGCCAGCC CTACACGCCG GTGACCTTCG GGACCATCGT CAGCCACTCG CGCGGAAATC TCTTCGATCC CGCCAGAAAG ACGCCGATCC ATGCGTGGGA GGAGGCGCAT GGCGCCGAGT TCGAGGACGT CGGCAACTGG AAACGCGCCT GGTTCTATCC GAAGGCGGGC GAGAACATGC ACGAGGCGGT CGCGCGCGAA TGCAAAACCG TTCGCGACGT GGCCGGAGTC TTCGATGCAT CGACCCTCGG AAAAATCGAG GTGGTCGGCC CTGATGCAGC CGCGTTCCTG AACCTCATGT ATACCAATGC CTGGGACAAT CTGAAGCCTG GCCGCTGCCG CTACGGCATC ATGCTTCGCG ATGACGGCTT CGTCTATGAC GACGGCGTCG TCGGCCGTCT GGCCGATGAC CGCTTCCATG TGACGACGAC GACCGGCGGC GCACCGCGGG TGCTGCACCA CATGGAGGAC TATCTTCAGA CGGAGTTCCC GCATCTGAAG GTGTGGCTGA CTTCGACGAC CGAGCAATGG GCCGTTATCG CCGTACAGGG GCCGAGGGCG CGCGAGATCA TCGCGCCGCT TGTCGAAGGC ATCGATCTAT CGAAAGAGGC CTTCCCGCAT ATGAGTGTTG CGGAAGGGAG CATTTGCGGT GTTCCGACCC GGCTCTTCCG AATGTCGTTC ACCGGCGAGC TCGGTTTCGA AGTCAACGTT CCCGCCGATT TCGGGCAGGC CGTGTGGGAA GCGATCTGGG CAAGGGCCGA GCCGATGGGA GCCTGCGCCT ACGGCACGGA GACAATGCAC GTTCTGCGCG CCGAGAAGGG ATACATCATC GTCGGCCAGG ACACGGACGG CACTCTTACG CCTGAAGATG CCGGCCTCTC CTGGGCAGTT TCGAAGAAAA AGCCGGATTT CGTCGGCATT CGCGGAATGA AGCGGCCGGA TCTGGTCAAG GAAGGCCGCA AGCAGCTTGT CGGGCTGCTT GCGAAGGACC CGCAGGTGGT GCTGGAGGAA GGGGCGCAGA TCGTTGCCGA TCCGAACCAG CCGAAGCCGA TGACCATGCT TGGCCATGTG ACCTCGTCCT ACTGGTCGCC GAACTGCGGC CGTTCGATCG CGCTGGCGGT GGTCGCCGGC GGCCGCGCGC GCCACGGGCA GACGCTCTAT GTGCCGATGG CCGACCGGAC GATCGCCGTC GAGGTAAGCG ACATGGTGTT TTTTGACAAG GAAGGAGGTC GCCTCCATGG CTGA
|
Protein sequence | MTGVNRISGA GRLTPARTAR FTFDGRTLTA LEGDTIASAL IANDIHLVGR SFKYHRPRGI LSAGAEEPNA LLDVSRDAAR RQPNVRATVQ EVFDGMRVSS QNRWPSLAFD VGGFNDLLSP FFAAGFYYKT FMWPKAAWHK LYEPFIRRAA GLGVAPTETD PDHYASRYVH CDVLVVGAGA AGLAAALAAA NAGAKVILCD EQPAVGGALH YDSGSEIDGR AGYDWALATG KALAAMDNVT LLTRTTAFGY YNHNFVGLVE RVTDHLPAPD KALPRERLWQ VRAKKVILAN GAIERHMVFP NNDRPGIMLA SAGRTYLNHF GVAVGKKVGI YAAHDSAYEA AFDLRKAGID IPAIVDCREK PGDMVLAEAR SLGIEVLSGQ SVVNTSGKLR ISSISVARNG GGAARKIAVD ALLVSAGWIP SVHLFSQSRG KVTFDAATER FLPGTYAQEC LSVGACNGTD DLQATIDEAL AAGELAARAA GAEGGVQVAL SGRNAFEWTG GMIGAAEGAG QDTTVKAFID FQHDVCAKDI RLAVREGMHS IEHIKRFTTN GMASDQGKLS NMHGLAIAAE ALGKEIPQVG LTTFRQPYTP VTFGTIVSHS RGNLFDPARK TPIHAWEEAH GAEFEDVGNW KRAWFYPKAG ENMHEAVARE CKTVRDVAGV FDASTLGKIE VVGPDAAAFL NLMYTNAWDN LKPGRCRYGI MLRDDGFVYD DGVVGRLADD RFHVTTTTGG APRVLHHMED YLQTEFPHLK VWLTSTTEQW AVIAVQGPRA REIIAPLVEG IDLSKEAFPH MSVAEGSICG VPTRLFRMSF TGELGFEVNV PADFGQAVWE AIWARAEPMG ACAYGTETMH VLRAEKGYII VGQDTDGTLT PEDAGLSWAV SKKKPDFVGI RGMKRPDLVK EGRKQLVGLL AKDPQVVLEE GAQIVADPNQ PKPMTMLGHV TSSYWSPNCG RSIALAVVAG GRARHGQTLY VPMADRTIAV EVSDMVFFDK EGGRLHG
|
| |