Gene Rleg2_5633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5633 
Symbol 
ID6977024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp19715 
End bp22669 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content66% 
IMG OID643393090 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_002277908 
Protein GI209546018 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCT TCCGTCTTTC CTCCGGCGGT CGCATCGACC GCGCCACGAC ACTCGGCTTC 
ACTTTCGACG GCAAGGCGCT CGCAGGCCAT CCGGGCGATA CGCTCGCCTC GGCGCTGCTT
GCCAACGGCG TCCAGCTTGT CGGCCGCAGT TTCAAATATC ACCGCCCGCG CGGCATCCTG
ACGGCAGGTG CCGCCGAACC GAACGCGTTG GTGACGACAG GCAGCGGCGG GCGCACCGAG
CCGAATACGC GCGCGACGAT GATCGAGCTT TACGAGGGGC TGGCGGCTAA GAGCCAGAAC
CGCTGGCCGT CGCTCGGTTT CGATGTCGGC GCCGTCAACG GCTTTCTCTC GCCTTTCCTC
AGCGCCGGTT TCTACTACAA GACCTTCATG TGGCCGGCGC CCCTCTGGGA AAAGCTCTAT
GAGCCGGTGA TCCGCAAGGC GGCGGGCCTC GGCAGAGCCT CCTACGAGGC GGATCCCGAT
ATTTATGAGA AATGCTGGGC GCATTGCGAT CTGCTGGTGA TCGGCGCCGG CCCTGCCGGC
CTTGCCGCGG CACTGACCGC CGGCCGCGCC GGCGCGCGCG TCATCCTCGC CGACGAAGGC
CCGGAACTCG GCGGCAGCCT GCTTTCCCGC GGCACTGTCG AGGACAAGCC GGCGGACGCG
CTTCTCAACG AGCTGCTGGC AGAACTCGAA GAGCTGGAAA ATGTGCGCCG CCTGCCGCGC
ATGACAGTCT TCGGCTGGTA CGACGACAAT GTCTTCGGCG CCGTCGAACG CGTGCAGAAA
CATGTGGCGC TGCCCGACCC GGAGCGCCCG GTCGAACGCT TGTGGCGCAT CGTCGCCCGC
CAGGCGATCC TGGCGACAGG GGCCGAGGAA CGGCCGCTCG TCTTCGGCGG CAACGATATA
CCAGGCGTAA TGATGGCGGG CGCCATGCGC AGCTATCTCA ACCGACAGGC TGTGGCGCCC
GGCAAAAGCA CTGTCATCTT CACCACCAAC GATGCCGGTT ACCGCACTGC CGCCGATTTG
GAAGCCGCCG GCCTCGCCGT CGCGGCGATC GTCGACAGCC GCGCCGATGC CGGCAAAGGC
TGGCAGGGCC GCGCCGAAGT GTTGAAGGGT GCCCGGGTGA TCGATGCGTT CGGCGGCCAG
CGCCTGCGCG GCGTCAGCGT TGAAACCACC GGCGGTATCC GCCGCATCGA AGCCGATGCG
CTTGCCATGT CGGGCGGCTG GAGCCCGATC ATCCATCTTG CCTGCCATCG CGGCGGCAAG
CCGCAATGGT CGGACGAGGC GGCGGCGTTT CTGGCGCCGG CCGCACAGGA CGGTCTTGCC
GTTGCCGGTT CGGCTGCCGG GCTTGCGCAA ACCACGGCCT GCCTTGGCGA TGGCGCCGCA
AAGGCTGCGG CCGCCCTGAA GGCGATCGGC TTTGCAGCGG CGGAACCGGC CTTTGCGGTT
GCCCCGGAAG CGGCCCAAGG CTCAAAGCCG CTCTGGTCCG TCAAGGGCTC GAAGGGCAAG
GCTTTCGTCG ACTATCAGAA CGACGTGCAT CTCAAGGACC TCGGCCTTGC CGTGCGCGAA
GGCTACGGTC ATGTCGAGCT TGCCAAGCGT TATACAACAT CAGGCATGGC GACCGACCAG
GGCAAGCTTT CCAACATCAA CGCCATCGGC ATTCTCGCTG AGGCGCGCGG CGTGTCGCCC
GCCGAGGTGG GAACGACGAC CTTCCGGCCT TTCTATACGC CGGTCTCCTT TGGTGCTTTG
ACCGGGGCAT CACGCGGCAA GGATTTCCAG CCGGCGCGCA AATCGCCGCT GCACGGCTGG
GCTAAGAAGA ACGGCGCGGT CTTCGTTGAA ACCGGCCTCT GGTACCGTTC CTCCTGGTTC
CCGCGCGCAG GTGAGGCGAC ATGGCGCGAC AGCGTCGACC GCGAAGTGCT GAACATCCGC
AAAAACGCCG GCCTCTGCGA CGTCTCGACC CTCGGTAAGA TCGAAATCTT CGGCCGCGAT
GCCGCGACCT TCCTCGACCG GATCTATTGC AACGGCTTCG CCAAGCTTGC TCTGGGCAAG
GCGCGTTATG GCATCATGCT GCGCGAAGAC GGCTTCATCT ATGACGACGG CACCACCAGC
CGCTTCGGCG ACGAGCATTT CTTCATGACG ACGACGACGG CGCTTGCCGC CGGTGTGCTG
ACCCATCTCG AATTCTGCGC CCAGACGCTC TGGCCGGAGC TCGACGTCTG CTTCGCCTCC
TCGACCGATC AATGGGCGCA GATGGCCATT GCCGGGCCGA AGTCCCGCGC CATCCTCGCC
GAGATCGTCG ACGAGGATTT GTCCGATGCG GCCTTCCCCT TCATGAGCGC CCGCAAGGTT
TCGCTCTTCG GCGGCCGTCT CGAAGGCCGG CTCTTCCGGA TCTCCTTCTC CGGCGAACTC
GCCTACGAGC TGGCCGTGCC GGCTGGCTAC GGTGAGAGCG TTGCCGACGC CGTCATGGCA
TCAGGCGAAA AACACGGCAT CTGCGCCTAT GGCGCCGAAG CGCTCGGCGT CCTGCGCATC
GAAAAGGGCC ACGTCACCCA TGCCGAGATC AACGGCACGG TCACCCCGGG CGATCTCGGT
TTCGGCCGCA TGGTTTCATC GACCAAGCCG GATTTCATCG GCAAGGCGAT GCTCGCCCGC
GAAGGTCTGC AGGATCCCGA GCGCCCACGC CTCGTCGGCG TCAAGCCGCT CAATCCGGCA
AGCGGCTTCC GCACCGGCTC GCATATTCTC GCCGAGGGCG CCGCGGCGAC GCTCGAAAAC
GACCAGGGCT ACATCTCCTC AAGCGCCTTT TCGCCGACGC TCGGCCATAC GATCGGCCTA
GCGCTGGTCA GGCGCGGGCC GGAGCGCATC GGCGAAAAGG TGACGGTCTG GAACGGCCTG
CGCAATGAAT TCACCGATGC GGTGCTCTGC CATCCCGTCT TCATCGATCC CGAAAACGAG
AAGCTCCATG CCTGA
 
Protein sequence
MTSFRLSSGG RIDRATTLGF TFDGKALAGH PGDTLASALL ANGVQLVGRS FKYHRPRGIL 
TAGAAEPNAL VTTGSGGRTE PNTRATMIEL YEGLAAKSQN RWPSLGFDVG AVNGFLSPFL
SAGFYYKTFM WPAPLWEKLY EPVIRKAAGL GRASYEADPD IYEKCWAHCD LLVIGAGPAG
LAAALTAGRA GARVILADEG PELGGSLLSR GTVEDKPADA LLNELLAELE ELENVRRLPR
MTVFGWYDDN VFGAVERVQK HVALPDPERP VERLWRIVAR QAILATGAEE RPLVFGGNDI
PGVMMAGAMR SYLNRQAVAP GKSTVIFTTN DAGYRTAADL EAAGLAVAAI VDSRADAGKG
WQGRAEVLKG ARVIDAFGGQ RLRGVSVETT GGIRRIEADA LAMSGGWSPI IHLACHRGGK
PQWSDEAAAF LAPAAQDGLA VAGSAAGLAQ TTACLGDGAA KAAAALKAIG FAAAEPAFAV
APEAAQGSKP LWSVKGSKGK AFVDYQNDVH LKDLGLAVRE GYGHVELAKR YTTSGMATDQ
GKLSNINAIG ILAEARGVSP AEVGTTTFRP FYTPVSFGAL TGASRGKDFQ PARKSPLHGW
AKKNGAVFVE TGLWYRSSWF PRAGEATWRD SVDREVLNIR KNAGLCDVST LGKIEIFGRD
AATFLDRIYC NGFAKLALGK ARYGIMLRED GFIYDDGTTS RFGDEHFFMT TTTALAAGVL
THLEFCAQTL WPELDVCFAS STDQWAQMAI AGPKSRAILA EIVDEDLSDA AFPFMSARKV
SLFGGRLEGR LFRISFSGEL AYELAVPAGY GESVADAVMA SGEKHGICAY GAEALGVLRI
EKGHVTHAEI NGTVTPGDLG FGRMVSSTKP DFIGKAMLAR EGLQDPERPR LVGVKPLNPA
SGFRTGSHIL AEGAAATLEN DQGYISSSAF SPTLGHTIGL ALVRRGPERI GEKVTVWNGL
RNEFTDAVLC HPVFIDPENE KLHA