Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5633 |
Symbol | |
ID | 6977024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 19715 |
End bp | 22669 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643393090 |
Product | sarcosine oxidase, alpha subunit family |
Protein accession | YP_002277908 |
Protein GI | 209546018 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCT TCCGTCTTTC CTCCGGCGGT CGCATCGACC GCGCCACGAC ACTCGGCTTC ACTTTCGACG GCAAGGCGCT CGCAGGCCAT CCGGGCGATA CGCTCGCCTC GGCGCTGCTT GCCAACGGCG TCCAGCTTGT CGGCCGCAGT TTCAAATATC ACCGCCCGCG CGGCATCCTG ACGGCAGGTG CCGCCGAACC GAACGCGTTG GTGACGACAG GCAGCGGCGG GCGCACCGAG CCGAATACGC GCGCGACGAT GATCGAGCTT TACGAGGGGC TGGCGGCTAA GAGCCAGAAC CGCTGGCCGT CGCTCGGTTT CGATGTCGGC GCCGTCAACG GCTTTCTCTC GCCTTTCCTC AGCGCCGGTT TCTACTACAA GACCTTCATG TGGCCGGCGC CCCTCTGGGA AAAGCTCTAT GAGCCGGTGA TCCGCAAGGC GGCGGGCCTC GGCAGAGCCT CCTACGAGGC GGATCCCGAT ATTTATGAGA AATGCTGGGC GCATTGCGAT CTGCTGGTGA TCGGCGCCGG CCCTGCCGGC CTTGCCGCGG CACTGACCGC CGGCCGCGCC GGCGCGCGCG TCATCCTCGC CGACGAAGGC CCGGAACTCG GCGGCAGCCT GCTTTCCCGC GGCACTGTCG AGGACAAGCC GGCGGACGCG CTTCTCAACG AGCTGCTGGC AGAACTCGAA GAGCTGGAAA ATGTGCGCCG CCTGCCGCGC ATGACAGTCT TCGGCTGGTA CGACGACAAT GTCTTCGGCG CCGTCGAACG CGTGCAGAAA CATGTGGCGC TGCCCGACCC GGAGCGCCCG GTCGAACGCT TGTGGCGCAT CGTCGCCCGC CAGGCGATCC TGGCGACAGG GGCCGAGGAA CGGCCGCTCG TCTTCGGCGG CAACGATATA CCAGGCGTAA TGATGGCGGG CGCCATGCGC AGCTATCTCA ACCGACAGGC TGTGGCGCCC GGCAAAAGCA CTGTCATCTT CACCACCAAC GATGCCGGTT ACCGCACTGC CGCCGATTTG GAAGCCGCCG GCCTCGCCGT CGCGGCGATC GTCGACAGCC GCGCCGATGC CGGCAAAGGC TGGCAGGGCC GCGCCGAAGT GTTGAAGGGT GCCCGGGTGA TCGATGCGTT CGGCGGCCAG CGCCTGCGCG GCGTCAGCGT TGAAACCACC GGCGGTATCC GCCGCATCGA AGCCGATGCG CTTGCCATGT CGGGCGGCTG GAGCCCGATC ATCCATCTTG CCTGCCATCG CGGCGGCAAG CCGCAATGGT CGGACGAGGC GGCGGCGTTT CTGGCGCCGG CCGCACAGGA CGGTCTTGCC GTTGCCGGTT CGGCTGCCGG GCTTGCGCAA ACCACGGCCT GCCTTGGCGA TGGCGCCGCA AAGGCTGCGG CCGCCCTGAA GGCGATCGGC TTTGCAGCGG CGGAACCGGC CTTTGCGGTT GCCCCGGAAG CGGCCCAAGG CTCAAAGCCG CTCTGGTCCG TCAAGGGCTC GAAGGGCAAG GCTTTCGTCG ACTATCAGAA CGACGTGCAT CTCAAGGACC TCGGCCTTGC CGTGCGCGAA GGCTACGGTC ATGTCGAGCT TGCCAAGCGT TATACAACAT CAGGCATGGC GACCGACCAG GGCAAGCTTT CCAACATCAA CGCCATCGGC ATTCTCGCTG AGGCGCGCGG CGTGTCGCCC GCCGAGGTGG GAACGACGAC CTTCCGGCCT TTCTATACGC CGGTCTCCTT TGGTGCTTTG ACCGGGGCAT CACGCGGCAA GGATTTCCAG CCGGCGCGCA AATCGCCGCT GCACGGCTGG GCTAAGAAGA ACGGCGCGGT CTTCGTTGAA ACCGGCCTCT GGTACCGTTC CTCCTGGTTC CCGCGCGCAG GTGAGGCGAC ATGGCGCGAC AGCGTCGACC GCGAAGTGCT GAACATCCGC AAAAACGCCG GCCTCTGCGA CGTCTCGACC CTCGGTAAGA TCGAAATCTT CGGCCGCGAT GCCGCGACCT TCCTCGACCG GATCTATTGC AACGGCTTCG CCAAGCTTGC TCTGGGCAAG GCGCGTTATG GCATCATGCT GCGCGAAGAC GGCTTCATCT ATGACGACGG CACCACCAGC CGCTTCGGCG ACGAGCATTT CTTCATGACG ACGACGACGG CGCTTGCCGC CGGTGTGCTG ACCCATCTCG AATTCTGCGC CCAGACGCTC TGGCCGGAGC TCGACGTCTG CTTCGCCTCC TCGACCGATC AATGGGCGCA GATGGCCATT GCCGGGCCGA AGTCCCGCGC CATCCTCGCC GAGATCGTCG ACGAGGATTT GTCCGATGCG GCCTTCCCCT TCATGAGCGC CCGCAAGGTT TCGCTCTTCG GCGGCCGTCT CGAAGGCCGG CTCTTCCGGA TCTCCTTCTC CGGCGAACTC GCCTACGAGC TGGCCGTGCC GGCTGGCTAC GGTGAGAGCG TTGCCGACGC CGTCATGGCA TCAGGCGAAA AACACGGCAT CTGCGCCTAT GGCGCCGAAG CGCTCGGCGT CCTGCGCATC GAAAAGGGCC ACGTCACCCA TGCCGAGATC AACGGCACGG TCACCCCGGG CGATCTCGGT TTCGGCCGCA TGGTTTCATC GACCAAGCCG GATTTCATCG GCAAGGCGAT GCTCGCCCGC GAAGGTCTGC AGGATCCCGA GCGCCCACGC CTCGTCGGCG TCAAGCCGCT CAATCCGGCA AGCGGCTTCC GCACCGGCTC GCATATTCTC GCCGAGGGCG CCGCGGCGAC GCTCGAAAAC GACCAGGGCT ACATCTCCTC AAGCGCCTTT TCGCCGACGC TCGGCCATAC GATCGGCCTA GCGCTGGTCA GGCGCGGGCC GGAGCGCATC GGCGAAAAGG TGACGGTCTG GAACGGCCTG CGCAATGAAT TCACCGATGC GGTGCTCTGC CATCCCGTCT TCATCGATCC CGAAAACGAG AAGCTCCATG CCTGA
|
Protein sequence | MTSFRLSSGG RIDRATTLGF TFDGKALAGH PGDTLASALL ANGVQLVGRS FKYHRPRGIL TAGAAEPNAL VTTGSGGRTE PNTRATMIEL YEGLAAKSQN RWPSLGFDVG AVNGFLSPFL SAGFYYKTFM WPAPLWEKLY EPVIRKAAGL GRASYEADPD IYEKCWAHCD LLVIGAGPAG LAAALTAGRA GARVILADEG PELGGSLLSR GTVEDKPADA LLNELLAELE ELENVRRLPR MTVFGWYDDN VFGAVERVQK HVALPDPERP VERLWRIVAR QAILATGAEE RPLVFGGNDI PGVMMAGAMR SYLNRQAVAP GKSTVIFTTN DAGYRTAADL EAAGLAVAAI VDSRADAGKG WQGRAEVLKG ARVIDAFGGQ RLRGVSVETT GGIRRIEADA LAMSGGWSPI IHLACHRGGK PQWSDEAAAF LAPAAQDGLA VAGSAAGLAQ TTACLGDGAA KAAAALKAIG FAAAEPAFAV APEAAQGSKP LWSVKGSKGK AFVDYQNDVH LKDLGLAVRE GYGHVELAKR YTTSGMATDQ GKLSNINAIG ILAEARGVSP AEVGTTTFRP FYTPVSFGAL TGASRGKDFQ PARKSPLHGW AKKNGAVFVE TGLWYRSSWF PRAGEATWRD SVDREVLNIR KNAGLCDVST LGKIEIFGRD AATFLDRIYC NGFAKLALGK ARYGIMLRED GFIYDDGTTS RFGDEHFFMT TTTALAAGVL THLEFCAQTL WPELDVCFAS STDQWAQMAI AGPKSRAILA EIVDEDLSDA AFPFMSARKV SLFGGRLEGR LFRISFSGEL AYELAVPAGY GESVADAVMA SGEKHGICAY GAEALGVLRI EKGHVTHAEI NGTVTPGDLG FGRMVSSTKP DFIGKAMLAR EGLQDPERPR LVGVKPLNPA SGFRTGSHIL AEGAAATLEN DQGYISSSAF SPTLGHTIGL ALVRRGPERI GEKVTVWNGL RNEFTDAVLC HPVFIDPENE KLHA
|
| |