Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3287 |
Symbol | |
ID | 5324171 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3478053 |
End bp | 3481016 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640792239 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001328944 |
Protein GI | 150398477 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCCT ATCGCCTTCC GAACCTTGGT CTTGTCAGCC GAGACACACC CGTCTCCTTT ACCTTCGACG GAAAGCCGAT GCAGGGCCTT CAGGGCGACA CGCTCGCCTC GGCGCTGCTC GCCAACGGAC GGATGCTCGT CGGCCGCAGC TTCAAATATC ACCGGCCTCG TGGAATTTTG ACCGCGGGAG CCGCCGAACC GAACGCACTC GTCACCATTG GCCATGGCGG CCGGACCGAG CCGAATACGC GCGCGACGAT GCAGGAGCTC TACGAGGGTC TCGAGGCACA GAGCCAGAAC CGCTGGCCCT CGCTCGATTT CGACCTGGGT GCATTGAACG GTATCCTGTC GCCCTTTCTC GGCGCCGGCT TCTACTACAA GACTTTCATG TGGCCGGCGC CGCTCTGGGA GAAGCTCTAC GAGCCGATCA TCCGCAAGGC GGCCGGCCTC GGCAAGGCAA GCTACGAGGC AGACCCCGAC GCCTATGAGA AGAGCTGGGC GCATTGCGAC CTGCTCGTCA TCGGCGCCGG CCCGACGGGA CTTGCCGCGG CGCTTACCGC CGGCCGCGCC GGTGCCCGGG TCATTCTCCT GGATGAGGGC TCGCTCCCCG GCGGGTCGCT GCTGTTCGAG ACGGCGATGA TCGACGGCAA AGCGGCCGCT CAATTCGCCC GTGACACAAG CGATGAATTG CGCTCGATGC CGAATGTCCG GTTCATGATG CGCACCACCG CCTTCGGTTG GTACGACGGC AATGTTTTCG GGGCCGTCGA ACGGGTACAG AAACATGTGC GGGAGCCGGT GCCCTCCCTG CCGGTCGAGC GGCTATGGCG CATCGCCACC AAAAAGGCCC TGCTTGCGAG CGGCGCGGAA GAACGCCCGC TCGTCTTCGG CGGCAACGAT CGTCCGGGCG TTATGATGGC AAGTGCGATG CGCGCCTACC TCAATCGATA CGGAGTCGCT CCGGGCCGGG CGACCGCGAT CTTCACCACC AACGACAGCG GCTATACCCT TGCACGCGAT CTAGAGGCGG CGGGCGTCGA CGTTGCTGCC ATAGTCGACA GCCGCCCGGC CGCAGGCGTG GACTATCGCG GCAAAGCGCG CCTGATCCGG GAAGCTGTCG TTTGCGGCGT AACGGGCCGC AAGGCAATCT CTGCGATCGA GGTCCATCGC GGCGATCGGA CGGAGTCGAT CGCGGTCGAT GCGCTCGCAA TGGCGGGTGG TTTCGACCCG ATCATCCATC TTGCCTGCCA CCGGGGCGGC AGGCCCGTCT GGTCGGCGGA AAAAACCGCC TTTCTCGCAC CTGGGAGCTT GAACGGCCTC GAGGTTGCCG GCGGCGCAGC AGCGACTTCG GGGCTCGCGG CCTGCCTCGG AGAAGGCATT GCCCAAGCTC AGGCTGCCCT CGAGGGCGTC GGCCTGCGGT GCCCGCCGAT GGACCTTCCG AAGGTCGAGG GAGACGACGC TGCATATTCT TCAAATCCAC TGTGGTCGAT CCCTGGCGTC AAGGACAAAG CCTTCATCGA TTTCCAGAAC GACGTTCATC TCAAGGATAT CGGGCTGGCC GTCCGCGAAG GCTATGGACA TGTCGAGCTT GCCAAACGAT ACACCACCAC CGGCATGGCG ACGGACCAGG GCAAGCTTTC CAACGTGAAT GCAATCGGAC TGATCGCCAA AGCACGCGGC GTCTCGCCTG CCGAGGTCGG GACGACGACG TTCCGCCCTT TCTATACGCC GGTGTCCTTC GGTGCGCTGA CCGGCGCACA TGCGGGACAT CATTTCCAGC CGGTCCGCAA GTCCCCCCTC CATGACTGGG CGAAAAAGCA CGGCGCCGTC TTCGTCGAGA CGGGTCTCTG GTATCGCTCC GCCTGGTTTC CGAAAAGCGG TGAACGGAGC TGGCGGGAGA GCGTCGAGCG AGAAGTGCTG AACGTTCGCA AGAATGCCGG ACTTTGCGAC GTCTCGATGC TCGGCAAGAT CGAGTTATCC GGAAGCGACG CCGCCGAATT CCTCAACCGC GTATATTGCA ACGCCTTCCT CAAACTGCCG GTCGGAAAGG CCCGCTACGG GCTCATGCTG CGCGAAGACG GCTTCATTTA CGACGACGGC ACGACGAGTC GCCTCGCCGA GAATCGCTTT TTCATGACGA CGACCACCGC CTATGCGGCC GGCGTCATGA ACCATCTCGA GTTCTGCGCG CAGGTCCTCT GGCCCGAACT CGACGTCCGC CTCGCCTCCG TCACCGACCA ATGGGCGCAG ATGGCTGTTG CCGGACCAAA GGCGCGCATG ATCCTGCAGA AGATCGTCGA CGACGACATA TCCGACGCAG CCTTCCCGTT TCTCGCAGCG AAGGAGGTCT CCCTGTTCGG CGGCGCCCTT CACGGCTGCC TGTTTCGAAT TTCATTCTCC GGTGAGCTCG CCTACGAGAT AGCGGTGCCG GCCGGCTACG GCGAAAGCGT TGCCGACGCG CTCCTGGACG CGGGGAAGGA CCACGGTATC ATGCCCTATG GCGTCGAGAC GCTTAGCGTC CTGCGCATCG AAAAGGGCCA TGTGACGCAC AACGAGATCA ACGGCACGGT CGTTCCGGCC GATCTGGGCT TCGGTAAAAT GGTGTCGGCC ACCAAGCTGG ATTTCATCGG CAAGGCGATG CTCCAGCGCG AGGGGCTGGC CGCGTCCGGC AGGCCGCAAC TCGTGGGCGT CGTGCCGATC GATCCGAAGC ACTCTTTCCG CAGCGGTTCG CATATTCTCG CCAAGGGAGC GGAAGCCACG CTCGAGAACG ACGAGGGCTA TGTAACGTCG AGCGCCTACT CCCCGCATGT CGGATCGACC ATTGCTCTGG CACTCGTCCA CAACGGGCAG AGCCGCCACG GCGAAGAGGT GCTGGTATGG AGTGGCCTTC ACGGAGAATC CACGCCTGCG CGTCTGTGCC ACCCGGTTTT CTTCGACCCT CAGAACGAGA GGCTCCATGT CTGA
|
Protein sequence | MSSYRLPNLG LVSRDTPVSF TFDGKPMQGL QGDTLASALL ANGRMLVGRS FKYHRPRGIL TAGAAEPNAL VTIGHGGRTE PNTRATMQEL YEGLEAQSQN RWPSLDFDLG ALNGILSPFL GAGFYYKTFM WPAPLWEKLY EPIIRKAAGL GKASYEADPD AYEKSWAHCD LLVIGAGPTG LAAALTAGRA GARVILLDEG SLPGGSLLFE TAMIDGKAAA QFARDTSDEL RSMPNVRFMM RTTAFGWYDG NVFGAVERVQ KHVREPVPSL PVERLWRIAT KKALLASGAE ERPLVFGGND RPGVMMASAM RAYLNRYGVA PGRATAIFTT NDSGYTLARD LEAAGVDVAA IVDSRPAAGV DYRGKARLIR EAVVCGVTGR KAISAIEVHR GDRTESIAVD ALAMAGGFDP IIHLACHRGG RPVWSAEKTA FLAPGSLNGL EVAGGAAATS GLAACLGEGI AQAQAALEGV GLRCPPMDLP KVEGDDAAYS SNPLWSIPGV KDKAFIDFQN DVHLKDIGLA VREGYGHVEL AKRYTTTGMA TDQGKLSNVN AIGLIAKARG VSPAEVGTTT FRPFYTPVSF GALTGAHAGH HFQPVRKSPL HDWAKKHGAV FVETGLWYRS AWFPKSGERS WRESVEREVL NVRKNAGLCD VSMLGKIELS GSDAAEFLNR VYCNAFLKLP VGKARYGLML REDGFIYDDG TTSRLAENRF FMTTTTAYAA GVMNHLEFCA QVLWPELDVR LASVTDQWAQ MAVAGPKARM ILQKIVDDDI SDAAFPFLAA KEVSLFGGAL HGCLFRISFS GELAYEIAVP AGYGESVADA LLDAGKDHGI MPYGVETLSV LRIEKGHVTH NEINGTVVPA DLGFGKMVSA TKLDFIGKAM LQREGLAASG RPQLVGVVPI DPKHSFRSGS HILAKGAEAT LENDEGYVTS SAYSPHVGST IALALVHNGQ SRHGEEVLVW SGLHGESTPA RLCHPVFFDP QNERLHV
|
| |