Gene Smed_3287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3287 
Symbol 
ID5324171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3478053 
End bp3481016 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content64% 
IMG OID640792239 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001328944 
Protein GI150398477 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCT ATCGCCTTCC GAACCTTGGT CTTGTCAGCC GAGACACACC CGTCTCCTTT 
ACCTTCGACG GAAAGCCGAT GCAGGGCCTT CAGGGCGACA CGCTCGCCTC GGCGCTGCTC
GCCAACGGAC GGATGCTCGT CGGCCGCAGC TTCAAATATC ACCGGCCTCG TGGAATTTTG
ACCGCGGGAG CCGCCGAACC GAACGCACTC GTCACCATTG GCCATGGCGG CCGGACCGAG
CCGAATACGC GCGCGACGAT GCAGGAGCTC TACGAGGGTC TCGAGGCACA GAGCCAGAAC
CGCTGGCCCT CGCTCGATTT CGACCTGGGT GCATTGAACG GTATCCTGTC GCCCTTTCTC
GGCGCCGGCT TCTACTACAA GACTTTCATG TGGCCGGCGC CGCTCTGGGA GAAGCTCTAC
GAGCCGATCA TCCGCAAGGC GGCCGGCCTC GGCAAGGCAA GCTACGAGGC AGACCCCGAC
GCCTATGAGA AGAGCTGGGC GCATTGCGAC CTGCTCGTCA TCGGCGCCGG CCCGACGGGA
CTTGCCGCGG CGCTTACCGC CGGCCGCGCC GGTGCCCGGG TCATTCTCCT GGATGAGGGC
TCGCTCCCCG GCGGGTCGCT GCTGTTCGAG ACGGCGATGA TCGACGGCAA AGCGGCCGCT
CAATTCGCCC GTGACACAAG CGATGAATTG CGCTCGATGC CGAATGTCCG GTTCATGATG
CGCACCACCG CCTTCGGTTG GTACGACGGC AATGTTTTCG GGGCCGTCGA ACGGGTACAG
AAACATGTGC GGGAGCCGGT GCCCTCCCTG CCGGTCGAGC GGCTATGGCG CATCGCCACC
AAAAAGGCCC TGCTTGCGAG CGGCGCGGAA GAACGCCCGC TCGTCTTCGG CGGCAACGAT
CGTCCGGGCG TTATGATGGC AAGTGCGATG CGCGCCTACC TCAATCGATA CGGAGTCGCT
CCGGGCCGGG CGACCGCGAT CTTCACCACC AACGACAGCG GCTATACCCT TGCACGCGAT
CTAGAGGCGG CGGGCGTCGA CGTTGCTGCC ATAGTCGACA GCCGCCCGGC CGCAGGCGTG
GACTATCGCG GCAAAGCGCG CCTGATCCGG GAAGCTGTCG TTTGCGGCGT AACGGGCCGC
AAGGCAATCT CTGCGATCGA GGTCCATCGC GGCGATCGGA CGGAGTCGAT CGCGGTCGAT
GCGCTCGCAA TGGCGGGTGG TTTCGACCCG ATCATCCATC TTGCCTGCCA CCGGGGCGGC
AGGCCCGTCT GGTCGGCGGA AAAAACCGCC TTTCTCGCAC CTGGGAGCTT GAACGGCCTC
GAGGTTGCCG GCGGCGCAGC AGCGACTTCG GGGCTCGCGG CCTGCCTCGG AGAAGGCATT
GCCCAAGCTC AGGCTGCCCT CGAGGGCGTC GGCCTGCGGT GCCCGCCGAT GGACCTTCCG
AAGGTCGAGG GAGACGACGC TGCATATTCT TCAAATCCAC TGTGGTCGAT CCCTGGCGTC
AAGGACAAAG CCTTCATCGA TTTCCAGAAC GACGTTCATC TCAAGGATAT CGGGCTGGCC
GTCCGCGAAG GCTATGGACA TGTCGAGCTT GCCAAACGAT ACACCACCAC CGGCATGGCG
ACGGACCAGG GCAAGCTTTC CAACGTGAAT GCAATCGGAC TGATCGCCAA AGCACGCGGC
GTCTCGCCTG CCGAGGTCGG GACGACGACG TTCCGCCCTT TCTATACGCC GGTGTCCTTC
GGTGCGCTGA CCGGCGCACA TGCGGGACAT CATTTCCAGC CGGTCCGCAA GTCCCCCCTC
CATGACTGGG CGAAAAAGCA CGGCGCCGTC TTCGTCGAGA CGGGTCTCTG GTATCGCTCC
GCCTGGTTTC CGAAAAGCGG TGAACGGAGC TGGCGGGAGA GCGTCGAGCG AGAAGTGCTG
AACGTTCGCA AGAATGCCGG ACTTTGCGAC GTCTCGATGC TCGGCAAGAT CGAGTTATCC
GGAAGCGACG CCGCCGAATT CCTCAACCGC GTATATTGCA ACGCCTTCCT CAAACTGCCG
GTCGGAAAGG CCCGCTACGG GCTCATGCTG CGCGAAGACG GCTTCATTTA CGACGACGGC
ACGACGAGTC GCCTCGCCGA GAATCGCTTT TTCATGACGA CGACCACCGC CTATGCGGCC
GGCGTCATGA ACCATCTCGA GTTCTGCGCG CAGGTCCTCT GGCCCGAACT CGACGTCCGC
CTCGCCTCCG TCACCGACCA ATGGGCGCAG ATGGCTGTTG CCGGACCAAA GGCGCGCATG
ATCCTGCAGA AGATCGTCGA CGACGACATA TCCGACGCAG CCTTCCCGTT TCTCGCAGCG
AAGGAGGTCT CCCTGTTCGG CGGCGCCCTT CACGGCTGCC TGTTTCGAAT TTCATTCTCC
GGTGAGCTCG CCTACGAGAT AGCGGTGCCG GCCGGCTACG GCGAAAGCGT TGCCGACGCG
CTCCTGGACG CGGGGAAGGA CCACGGTATC ATGCCCTATG GCGTCGAGAC GCTTAGCGTC
CTGCGCATCG AAAAGGGCCA TGTGACGCAC AACGAGATCA ACGGCACGGT CGTTCCGGCC
GATCTGGGCT TCGGTAAAAT GGTGTCGGCC ACCAAGCTGG ATTTCATCGG CAAGGCGATG
CTCCAGCGCG AGGGGCTGGC CGCGTCCGGC AGGCCGCAAC TCGTGGGCGT CGTGCCGATC
GATCCGAAGC ACTCTTTCCG CAGCGGTTCG CATATTCTCG CCAAGGGAGC GGAAGCCACG
CTCGAGAACG ACGAGGGCTA TGTAACGTCG AGCGCCTACT CCCCGCATGT CGGATCGACC
ATTGCTCTGG CACTCGTCCA CAACGGGCAG AGCCGCCACG GCGAAGAGGT GCTGGTATGG
AGTGGCCTTC ACGGAGAATC CACGCCTGCG CGTCTGTGCC ACCCGGTTTT CTTCGACCCT
CAGAACGAGA GGCTCCATGT CTGA
 
Protein sequence
MSSYRLPNLG LVSRDTPVSF TFDGKPMQGL QGDTLASALL ANGRMLVGRS FKYHRPRGIL 
TAGAAEPNAL VTIGHGGRTE PNTRATMQEL YEGLEAQSQN RWPSLDFDLG ALNGILSPFL
GAGFYYKTFM WPAPLWEKLY EPIIRKAAGL GKASYEADPD AYEKSWAHCD LLVIGAGPTG
LAAALTAGRA GARVILLDEG SLPGGSLLFE TAMIDGKAAA QFARDTSDEL RSMPNVRFMM
RTTAFGWYDG NVFGAVERVQ KHVREPVPSL PVERLWRIAT KKALLASGAE ERPLVFGGND
RPGVMMASAM RAYLNRYGVA PGRATAIFTT NDSGYTLARD LEAAGVDVAA IVDSRPAAGV
DYRGKARLIR EAVVCGVTGR KAISAIEVHR GDRTESIAVD ALAMAGGFDP IIHLACHRGG
RPVWSAEKTA FLAPGSLNGL EVAGGAAATS GLAACLGEGI AQAQAALEGV GLRCPPMDLP
KVEGDDAAYS SNPLWSIPGV KDKAFIDFQN DVHLKDIGLA VREGYGHVEL AKRYTTTGMA
TDQGKLSNVN AIGLIAKARG VSPAEVGTTT FRPFYTPVSF GALTGAHAGH HFQPVRKSPL
HDWAKKHGAV FVETGLWYRS AWFPKSGERS WRESVEREVL NVRKNAGLCD VSMLGKIELS
GSDAAEFLNR VYCNAFLKLP VGKARYGLML REDGFIYDDG TTSRLAENRF FMTTTTAYAA
GVMNHLEFCA QVLWPELDVR LASVTDQWAQ MAVAGPKARM ILQKIVDDDI SDAAFPFLAA
KEVSLFGGAL HGCLFRISFS GELAYEIAVP AGYGESVADA LLDAGKDHGI MPYGVETLSV
LRIEKGHVTH NEINGTVVPA DLGFGKMVSA TKLDFIGKAM LQREGLAASG RPQLVGVVPI
DPKHSFRSGS HILAKGAEAT LENDEGYVTS SAYSPHVGST IALALVHNGQ SRHGEEVLVW
SGLHGESTPA RLCHPVFFDP QNERLHV