Gene Smed_2750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2750 
Symbol 
ID5323620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2867642 
End bp2870635 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content64% 
IMG OID640791695 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001328415 
Protein GI150397948 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGG TCAACCGCAT TTCAGGCGCG GGCCGCCTGA CACCGGCGCG CACCGCCCGC 
TTCACCTTCG ACGGCCGGAC GCTGACGGCG CTCGAGGGCG ACACCATCGC CTCGGCGCTC
ATTGCCAACG ACATTCATCT CGTCGGCCGT TCGTTCAAAT ATCACCGTCC GCGCGGCATT
CTTTCCGCAG GCGCCGAGGA GCCGAATGCT TTGCTCGACG TTTCTCGCGA TGCCGCCCGC
CGGCAGCCGA ACGTCCGCGC CACCGTGCAG GAGGTCTTCG ACGGCATGAG AGTGTCGTCG
CAGAACCGTT GGCCTTCGCT TGCCTTCGAC GTCGGCGGTT TCAACGATCT CTTGTCGCCC
TTCTTCGCAG CGGGATTTTA TTACAAGACC TTCATGTGGC CGAAAGCCGC CTGGCATAAG
CTCTATGAGC CCTTTATCCG TCGGGCCGCC GGTCTCGGTG TAGCGCCGAC GGAGACAGAC
CCTGACCATT ATGCAAGCCG CTATGTCCAT TGCGACGTGC TTGTGGTCGG CGCCGGCGCC
GCGGGGCTTG CGGCGGCACT TGCGGCGGCT AATGCCGGCG CGAAGGTGAT CCTGTGCGAC
GAACAGCCGG CTGTCGGCGG TGCCCTGCAT TACGACAGCG GCAGCGAGAT CGACGGCAGG
GCGGGCTATG ACTGGGCGCT GGCGACGGGC AAGGCGCTGG CGGCAATGGA CAATGTCACG
CTGCTGACCC GCACGACGGC CTTCGGATAT TACAACCACA ATTTCGTAGG CCTCGTAGAG
CGCGTGACGG ATCACCTGCC CGCGCCCGAC AAGGCGCTCC CGCGTGAGCG GCTGTGGCAG
GTGCGCGCGA AAAAGGTCAT TCTCGCCAAC GGCGCCATCG AGCGCCACAT GGTCTTCCCC
AACAATGATC GTCCCGGGAT CATGCTGGCC TCGGCCGGCC GCACCTATCT CAACCATTTC
GGCGTCGCCG TCGGCAAAAA GGTGGGCATC TATGCAGCCC ATGATTCCGC CTATGAGGCG
GCCTTCGACC TCAGAAAGGC CGGCATCGAC ATCCCTGCTA TCGTCGATTG CCGCGAAAAG
CCGGGCGACA TGGTGCTTGC GGAGGCGCGA AGCCTCGGCA TCGAAGTGCT GAGCGGTCAA
TCGGTCGTCA ACACATCCGG CAAGCTGCGC ATCTCCTCCA TCAGCGTCGC TCGGAACGGC
GGCGGAGCGG CACGCAAGAT CGCGGTCGAC GCACTGCTGG TCTCTGCCGG TTGGATACCT
TCGGTGCATC TCTTCTCGCA GTCGCGCGGC AAGGTGACCT TCGATGCGGC GACGGAGCGG
TTCCTGCCGG GAACCTATGC GCAAGAATGC CTCTCCGTTG GCGCCTGCAA CGGCACGGAC
GACCTGCAGG CGACGATCGA CGAGGCGCTT GCCGCCGGTG AACTGGCAGC CCGTGCAGCG
GGTGCGGAAG GTGGCGTGCA GGTTGCGCTC TCCGGCCGCA ATGCCTTCGA ATGGACGGGC
GGCATGATCG GCGCAGCGGA AGGCGCGGGG CAGGATACGA CGGTCAAGGC CTTCATCGAC
TTCCAGCACG ACGTCTGCGC GAAGGATATC CGCCTGGCGG TGCGAGAAGG GATGCATTCG
ATCGAGCACA TCAAGCGGTT CACGACCAAC GGGATGGCAT CCGACCAGGG TAAGCTCTCG
AACATGCATG GCCTTGCGAT CGCCGCCGAA GCGCTGGGCA AGGAAATCCC TCAGGTGGGG
CTCACGACCT TCCGCCAGCC CTACACGCCG GTGACCTTCG GGACCATCGT CAGCCACTCG
CGCGGAAATC TCTTCGATCC CGCCAGAAAG ACGCCGATCC ATGCGTGGGA GGAGGCGCAT
GGCGCCGAGT TCGAGGACGT CGGCAACTGG AAACGCGCCT GGTTCTATCC GAAGGCGGGC
GAGAACATGC ACGAGGCGGT CGCGCGCGAA TGCAAAACCG TTCGCGACGT GGCCGGAGTC
TTCGATGCAT CGACCCTCGG AAAAATCGAG GTGGTCGGCC CTGATGCAGC CGCGTTCCTG
AACCTCATGT ATACCAATGC CTGGGACAAT CTGAAGCCTG GCCGCTGCCG CTACGGCATC
ATGCTTCGCG ATGACGGCTT CGTCTATGAC GACGGCGTCG TCGGCCGTCT GGCCGATGAC
CGCTTCCATG TGACGACGAC GACCGGCGGC GCACCGCGGG TGCTGCACCA CATGGAGGAC
TATCTTCAGA CGGAGTTCCC GCATCTGAAG GTGTGGCTGA CTTCGACGAC CGAGCAATGG
GCCGTTATCG CCGTACAGGG GCCGAGGGCG CGCGAGATCA TCGCGCCGCT TGTCGAAGGC
ATCGATCTAT CGAAAGAGGC CTTCCCGCAT ATGAGTGTTG CGGAAGGGAG CATTTGCGGT
GTTCCGACCC GGCTCTTCCG AATGTCGTTC ACCGGCGAGC TCGGTTTCGA AGTCAACGTT
CCCGCCGATT TCGGGCAGGC CGTGTGGGAA GCGATCTGGG CAAGGGCCGA GCCGATGGGA
GCCTGCGCCT ACGGCACGGA GACAATGCAC GTTCTGCGCG CCGAGAAGGG ATACATCATC
GTCGGCCAGG ACACGGACGG CACTCTTACG CCTGAAGATG CCGGCCTCTC CTGGGCAGTT
TCGAAGAAAA AGCCGGATTT CGTCGGCATT CGCGGAATGA AGCGGCCGGA TCTGGTCAAG
GAAGGCCGCA AGCAGCTTGT CGGGCTGCTT GCGAAGGACC CGCAGGTGGT GCTGGAGGAA
GGGGCGCAGA TCGTTGCCGA TCCGAACCAG CCGAAGCCGA TGACCATGCT TGGCCATGTG
ACCTCGTCCT ACTGGTCGCC GAACTGCGGC CGTTCGATCG CGCTGGCGGT GGTCGCCGGC
GGCCGCGCGC GCCACGGGCA GACGCTCTAT GTGCCGATGG CCGACCGGAC GATCGCCGTC
GAGGTAAGCG ACATGGTGTT TTTTGACAAG GAAGGAGGTC GCCTCCATGG CTGA
 
Protein sequence
MTGVNRISGA GRLTPARTAR FTFDGRTLTA LEGDTIASAL IANDIHLVGR SFKYHRPRGI 
LSAGAEEPNA LLDVSRDAAR RQPNVRATVQ EVFDGMRVSS QNRWPSLAFD VGGFNDLLSP
FFAAGFYYKT FMWPKAAWHK LYEPFIRRAA GLGVAPTETD PDHYASRYVH CDVLVVGAGA
AGLAAALAAA NAGAKVILCD EQPAVGGALH YDSGSEIDGR AGYDWALATG KALAAMDNVT
LLTRTTAFGY YNHNFVGLVE RVTDHLPAPD KALPRERLWQ VRAKKVILAN GAIERHMVFP
NNDRPGIMLA SAGRTYLNHF GVAVGKKVGI YAAHDSAYEA AFDLRKAGID IPAIVDCREK
PGDMVLAEAR SLGIEVLSGQ SVVNTSGKLR ISSISVARNG GGAARKIAVD ALLVSAGWIP
SVHLFSQSRG KVTFDAATER FLPGTYAQEC LSVGACNGTD DLQATIDEAL AAGELAARAA
GAEGGVQVAL SGRNAFEWTG GMIGAAEGAG QDTTVKAFID FQHDVCAKDI RLAVREGMHS
IEHIKRFTTN GMASDQGKLS NMHGLAIAAE ALGKEIPQVG LTTFRQPYTP VTFGTIVSHS
RGNLFDPARK TPIHAWEEAH GAEFEDVGNW KRAWFYPKAG ENMHEAVARE CKTVRDVAGV
FDASTLGKIE VVGPDAAAFL NLMYTNAWDN LKPGRCRYGI MLRDDGFVYD DGVVGRLADD
RFHVTTTTGG APRVLHHMED YLQTEFPHLK VWLTSTTEQW AVIAVQGPRA REIIAPLVEG
IDLSKEAFPH MSVAEGSICG VPTRLFRMSF TGELGFEVNV PADFGQAVWE AIWARAEPMG
ACAYGTETMH VLRAEKGYII VGQDTDGTLT PEDAGLSWAV SKKKPDFVGI RGMKRPDLVK
EGRKQLVGLL AKDPQVVLEE GAQIVADPNQ PKPMTMLGHV TSSYWSPNCG RSIALAVVAG
GRARHGQTLY VPMADRTIAV EVSDMVFFDK EGGRLHG