Gene Rleg_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3658 
Symbol 
ID8014504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3702709 
End bp3705702 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content64% 
IMG OID644826221 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_002977440 
Protein GI241206344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.259851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.147827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG TGAACCGTAT CGCAGGCAAG GGGCGCCTGA CACCGGCCAG GACCGCGCGC 
TTCAGCTTCG ACGGCAAGAG TTATACGGCG CTCGAAGGTG ATACCGTCGC CTCGGCGCTG
ATCGCCAACG GCGTGCATCT CATCGGCCGT TCGTTCAAAT ATCACCGGGC CCGCGGTATT
CTGTCGGCAG GGGCCGAAGA GCCGAACGCG CTGATCGACG TTTCCCGCGA TACGGCGCGC
AAGCAGCCGA ACGTGCGCGC CACCGTGCAG GAAGTCTTCG ACGGCATGAT CGTCAGCTCG
CAAAACCGCT GGCCCTCGCT TGCCTTCGAC GTCGGTGCGG TCAACAACCT GATGTCGCCG
TTCTTTGCCG CTGGTTTCTA CTACAAGACC TTCATGTGGC CGCGCGCCGC CTGGAAACAC
GTCTACGAAC CGCTCATCCG TCGTGCCGCC GGCCTCGGCG TCGCACCGAC GGAGGAGGAT
CCGGACCATT ATGCCAGCCG CTATGCCCAT TGCGACGTGC TGGTGGTCGG CGCCGGTGTT
GCCGGGCTTT CGGCGGCGCT GGCTGCGGCC GAGACCGGCG CCCGCGTGAT CCTTTGCGAC
GAGCAGGCGG AAGCGGGCGG GGCATTGCGC TACGATGCCG GCGTGAGGAT CGACGGACAG
GACGGCAATA GCTGGGCACA GAAGGCCGTG GCGCGACTGA AGGCGATGGA TAATGTCGAA
GTGCTCACCC GCACGACCGC CTTCGGCTAC TACAACCACA ATTTCGTCGG TCTTGCCGAG
CGCGTCACCG ATCATATTGC CAAGCCTTCC CGCGATCTGC CGCGCGAGCG GCTTTGGCAG
GTCCGGGCCA AGCGGGTGAT CCTTGCCACC GGCGCGATCG AGCGGCACAT GGTGTTTCCG
AACAACGACC GGCCGGGCAT CATGCTCGCC TCGGCGGGGC GGATGTATCT CAATCATTAC
GGCGTTGCGG TCGGGGCCAA AGTCGGCATC TACACGGCGC ACGACTCGGC CTATGAGGCG
GCTTTCGACC TGAAACGATC CGGTGTTTCG ATCGCCGCCA TCGTCGATTG CAGGCAGACG
CCGGGAGCGG CGGTGCTGGA GGAGGCGCGT ACGCTCGGCA TCGATGTTCT GGCCGGCCAA
TCGGTCGTCA ACACCTCGGG ACGGCTGCGC ATCTCGTCGA TGACGGTGGC GCGCAACGGC
GGCGGTTCGC CGCGCAAGAT CGCGGTCGAT GCGCTGCTGG TTTCGGCCGG CTGGACGCCG
TCCGTCCATC TGTTCTCGCA GTCGCGCGGC AAAGTGGCCT TCGATGCCGA AAGCCAGCGT
TTCCTGCCCG GCTCCTATGC GCAGGACTGC CTCTCGGTCG GCGCGTGCAA CGGCACCGAC
GACCTGCAGC GGACGATCGA GGAGTCGCTT GCCGCCGGCG AACTGATGGC GCAGGCCACC
GGCAGAAGCA GTGGCGAGAA GATCGCAATT TCGGCCGAAC AGGCCTATGA CTGGACGGGC
GGCATGATTG GTGCTGCCGA AGGCGCCGGG CCGAAGACCA ACGCCAAGGC CTTTATCGAT
TTCCAGCATG ACGTCTGCGC CAAGGATATC CGCCTTGCCG TGCGCGAGGG CATGCATTCG
ATCGAGCATA TCAAGCGCTT CACGACCAAC GGCATGGCCT CGGACCAGGG CAAGCTCTCC
AACATGCATG GCCTGGCGAT TGCCGCTGAA ATGCTCGGCA AGGAAATCCC GCAGGTCGGC
CTCACCACCT TCCGTGCGCC CTATACGCCG GTTACCTACG GTACGCTGAT CGGTCATTCG
CGCGGAGAGC TGTTCGATCC GACGCGCAAG ACGCCGCTGC ATGCCTGGGA AGAAGCCCAT
GGCGCGGTCT TCGAGGATGT CGGCAACTGG AAGCGTGCCT GGTTCTATCC GCAGGCCGGC
GAGACCATGC ATCAGGCGGT GGCTCGCGAA TGCCGGACGG CACGCGAGGC GGCCGGTATC
TTCGACGCCT CGACACTCGG CAAGATCGAG GTGGTGGGGC CGGATGCGGC GGAATTTCTC
AACCTCATCT ACACCAATGC CTGGGACACG CTGAAGCCCG GCAAGGCCCG CTACGGTATC
ATGACCCGCG AGGACGGTTT CGTTTATGAC GACGGCGTTG TCGGACGCCT GGCGGACGAC
CGTTTCCATG TGACGACGAC CACCGGCGGC GCGCCGCGTG TTCTCCATCA CATGGAAGAT
TACCTGCAGA CGGAATTCCC GCATCTGAAG GTATGGCTGA CTTCGGTGAC CGAACAATGG
GCTGTCATCG CCGTGCAGGG ACCGAAGGCG CGCGAGATCG TCGCGCCGCT GGTCGAAGGG
CTCGATCTTT CGAACGAGGC CTTCCCGCAT ATGAGCGTTG CCGAGTGCAC GGTTTGCGGC
GTGCCGGCGC GGCTCTTCCG CGTCTCTTTC ACCGGTGAAA CCGGCTTCGA AATCAATGTG
CCGGCCGATT ACGGCCAGTC GGTTCTCGAA GCGGTCTGGG CCAATGCTGA GCCGCTCGGC
GCCTGCGTCT ACGGCACGGA GACGATGCAC GTTCTTCGCG CCGAGAAGGG TTACATCATC
GTCGGGCAGG ATACCGATGG GACGGTGACC CCCGATGACG CCGGACTTTC CTGGGCGGTT
TCGAAGAAAA AGACGGATTT CGTCGGCATC CGCGGGTTGA AGCGGCCGGA TCTCGTCAAG
GACGGGCGCA AACAGCTCGT CGGCCTCGTC ACCAAGGACC CGAAGCTGGT GCTCGAAGAA
GGCGCGCAGA TCGTCGCGAG CCCGAACGAG CCGAAGCCGA TGACCATGCT CGGCCACGTC
ACATCAGCCT ATTGGTCGGA CAATTGCGGC AGGTCGATTG CCTTCGCGCT CGTCGCCGGC
GGCCGGGCGC GGATGGGCGA AACGCTCTAT GTGCCGATGC CGGACCGGAC GATCGCCGTA
GACGTGACGG ATCTGGTATT CTTTGACAAG GAAGGGGGGC GCATCCATGG CTGA
 
Protein sequence
MSGVNRIAGK GRLTPARTAR FSFDGKSYTA LEGDTVASAL IANGVHLIGR SFKYHRARGI 
LSAGAEEPNA LIDVSRDTAR KQPNVRATVQ EVFDGMIVSS QNRWPSLAFD VGAVNNLMSP
FFAAGFYYKT FMWPRAAWKH VYEPLIRRAA GLGVAPTEED PDHYASRYAH CDVLVVGAGV
AGLSAALAAA ETGARVILCD EQAEAGGALR YDAGVRIDGQ DGNSWAQKAV ARLKAMDNVE
VLTRTTAFGY YNHNFVGLAE RVTDHIAKPS RDLPRERLWQ VRAKRVILAT GAIERHMVFP
NNDRPGIMLA SAGRMYLNHY GVAVGAKVGI YTAHDSAYEA AFDLKRSGVS IAAIVDCRQT
PGAAVLEEAR TLGIDVLAGQ SVVNTSGRLR ISSMTVARNG GGSPRKIAVD ALLVSAGWTP
SVHLFSQSRG KVAFDAESQR FLPGSYAQDC LSVGACNGTD DLQRTIEESL AAGELMAQAT
GRSSGEKIAI SAEQAYDWTG GMIGAAEGAG PKTNAKAFID FQHDVCAKDI RLAVREGMHS
IEHIKRFTTN GMASDQGKLS NMHGLAIAAE MLGKEIPQVG LTTFRAPYTP VTYGTLIGHS
RGELFDPTRK TPLHAWEEAH GAVFEDVGNW KRAWFYPQAG ETMHQAVARE CRTAREAAGI
FDASTLGKIE VVGPDAAEFL NLIYTNAWDT LKPGKARYGI MTREDGFVYD DGVVGRLADD
RFHVTTTTGG APRVLHHMED YLQTEFPHLK VWLTSVTEQW AVIAVQGPKA REIVAPLVEG
LDLSNEAFPH MSVAECTVCG VPARLFRVSF TGETGFEINV PADYGQSVLE AVWANAEPLG
ACVYGTETMH VLRAEKGYII VGQDTDGTVT PDDAGLSWAV SKKKTDFVGI RGLKRPDLVK
DGRKQLVGLV TKDPKLVLEE GAQIVASPNE PKPMTMLGHV TSAYWSDNCG RSIAFALVAG
GRARMGETLY VPMPDRTIAV DVTDLVFFDK EGGRIHG