Gene Rleg_7231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_7231 
Symbol 
ID8022937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp655430 
End bp658387 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content65% 
IMG OID644834063 
Productsarcosine oxidase, alpha subunit family 
Protein accessionYP_002985197 
Protein GI241667113 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0118066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.098262 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCT TCCGTCTTTC CTCCGGCGGC CGCATCGACC GCGCCACGAC GCTCGGTTTT 
ACTTTCGACG GCAGGCCACT CGAAGGCCAT CCGGGCGATA CACTCGCCTC GGCGCTGCTT
GCCAACGGCG TCCAGCTCGT CGGCCGCAGC TTCAAATATC ACCGCCCGCG CGGCATCCTG
ACGGCGGGCG CTGCCGAACC GAACGCGCTG GTAACGACGG GCAGCGGCGG GCGCACCGAG
CCGAATACGC GCGCCACGAT GATCGAGCTC TACCAGGGGC TGACGGCAAA GAGCCAGAAC
CGCTGGCCTT CGCTCGGCTT CGATGTCGGC GCGGTCAACG GGCTGCTGTC GCCCTTCCTC
AGTGCCGGCT TCTACTACAA GACCTTCATG TGGCCGGCGG CCCTCTGGGA AAAGCTCTAC
GAGCCGGTGA TCCGCAAGGC GGCGGGCCTC GGCAGGGCTT CCTACGAGGC CGATCCCGAT
AGCTACGAGA AATGCTGGGC GCATTGCGAT CTGCTGGTGA TCGGTGCCGG CCCGACCGGT
CTTGCCGCAG CGCTGACCGC CGGCCGCGCC GGTGCGCGCG TCATCCTCGC CGACGAGGGG
TCCGAACTTG GCGGCAGCCT GCTTTCCGAC AGCGGCACTA TTGACGGCAA GCCGGCGGAT
GCGCTTCTCG GCGAGCTGCT TGCCGAACTG GCGAGCCTGG AGAACGTGCG CCGCCTGCCG
CGCATGACGG TGTTCGGCTG GTATGACGAC AATGTCTTTG GCGCCGTCGA ACGGGTGCAG
AAGCATGTGG CGATGCCGGA TCCGGAACGC CCCGTGGAAC GGTTGTGGCG CATCGTCGCC
CGCCAGGCGA TCCTTGCAAC CGGCGTCGAG GAGCGTCCGC TGGTCTTCGG CGGCAACGAT
ATTCCCGGTG TGATGATGGC AGGCGCGATG CGCAGCTATC TCAATCGGCA GGCGGTCGCG
CCCGGCAGAA GCACGGTCAT CTTCACCACC AACGATGCCG GATATCGAAC CGCAGCCGAT
CTGGAGGCCG CCGGCCTCAC CGTCGCGGCG ATCGTCGACA GCCGCGCGGA TGCGGCCGCG
AGCTGGCAGG GCCGCGCCGA AGTGCTGAAG GGCGCCAGGG TCATCGATGC GCTTGGCGGC
AAGCGCCTGC GCGGCGTCAG CGTCGAAACC GCAACCGGCA CCCGCCGGAT CGAAGCCGAT
GCGCTAGCCA TGTCGGGCGG CTGGAGCCCG ATCATTCATC TTGCCTGCCA TCGCGGTGCG
AAACCGCAAT GGTCGGACGA AGCCTCGGCG TTTCTGGCGC CTGTCCAGCA GGAAGGTCTG
ACCGTTGCCG GCTCGGCTGC CGGCCTTGCG CAAACCGCGA CCTGCCTCGG CGATGGCGCC
GCAAAGGCAG TTGCCGCGTT GAAGGCGATC GGTTTCGCGG CGGCGGAACC GGCCTTTGCC
TTGGCGTCCG AGGCGGCTGC ACAGGCAAAG CCGCTCTGGT CGGTCAAGGG CTCGAAGGGC
AAGGCCTTCG TCGATTACCA GAACGACGTG CATCTCAAGG ATCTCGGCCT CGCCGTCCGC
GAAGGCTACG GTCATGTCGA GCTTGCCAAG CGCTACACCA CATCAGGCAT GGCGACCGAT
CAGGGCAAGC TTTCCAACAT CAACGCCATC GGCATCCTTG CCGAGGCGCG CGGCGTGTCG
CCCGCCGATG TGGGAACGAC GACTTTCCGG CCCTTCTATA CGCCGGTTTC CTTCGGCGCC
TTGACTGGGG CGTCGCGCGG CAAACATTTC CAGCCGGCGC GCAAATCGCC GCTGCATGGC
TGGGCTAAGA AGAATGGCGC GATCTTCGTC GAAACCGGCC TCTGGTACCG GTCCGCCTGG
TTTCCGCGCG CCGGAGAGAC GACATGGCGT GAAAGCGTCG ACCGCGAGGT GCTGAACATC
CGGAAGAATG CCGGCCTCTG CGACGTCTCG ACCCTCGGCA AGATCGAAAT ATCAGGGAGG
GATGCCGCGA CCTTCCTCGA TCGCATCTAT TGCAACGGCT TCGCCAAGCT CGCTGTCGGC
AAGGCGCGTT ACGGCATCAT GCTGCGTGAG GACGGCTTCA TCTATGATGA CGGCACCACC
AGCCGGTTCA GTGACGAGCA TTTCTTCATG ACGACGACGA CCGCACTTGC CGCCGGCGTG
TTGACCCATC TCGAATTCTG CGCCCAGACG CTCTGGCCGG AACTCGACGT CTGCTTCGCC
TCCTCGACCG ATCAATGGGC GCAGATGGCC GTTGCCGGGC CGAAGTCGCG CGCCATTCTT
CAGGAGATCG TCGACGAGGA TTTGTCGGAT GCAGCCTTCC CCTTCATGAG CGCGCGCAAG
GTCTCCCTGT TCGGCGGCCA ACTTGAAGGG CGGCTCTTCC GGATTTCCTT CTCGGGCGAA
CTCGCCTACG AGCTGGCGGT GCCCGCCGGC TACGGCGAAG GCGTTGCGGA TGCGATCATG
GCGGCGGGCG AGAAACATGG CATCTGCGCC TATGGCGCCG AAGCGCTCGG CGTCATGCGC
ATCGAAAAGG GCCATGTCAC CCATGCCGAA ATCAACGGCA CGGTGACGCC TGGCGACCTC
GGTTTCGGCC GCATGGTTTC ATCAACCAAG CCGGATTTCA TCGGCAAGGC ATTGCTTGCC
CGCGAAGGGC TGCAAGATCC CGACCGGCAG CGCCTTGTCG GCGTCAAGCC GCTTAATCCG
GCGACCGGTT TCCGCACCGG CTCGCATATT CTCGCCGACG GCGTTGCGGC GACCCTCGAA
AACGACCAAG GCTACGTCAC TTCAAGCGCC TTTTCGCCAG GGCTCGGCCA TACGATCGGC
CTGGCGCTCG TCAGGCGTGG GCCGGAGCGC ATCGGTGAAA AGGTGACGGT CTGGAACGGC
CTGCGCAACG AATTCACCGA TGCGGTGCTC TGCCATCCCG TCTTCATCGA TCCCGAAAAC
GAGAAGCTCC ATGCGTGA
 
Protein sequence
MTSFRLSSGG RIDRATTLGF TFDGRPLEGH PGDTLASALL ANGVQLVGRS FKYHRPRGIL 
TAGAAEPNAL VTTGSGGRTE PNTRATMIEL YQGLTAKSQN RWPSLGFDVG AVNGLLSPFL
SAGFYYKTFM WPAALWEKLY EPVIRKAAGL GRASYEADPD SYEKCWAHCD LLVIGAGPTG
LAAALTAGRA GARVILADEG SELGGSLLSD SGTIDGKPAD ALLGELLAEL ASLENVRRLP
RMTVFGWYDD NVFGAVERVQ KHVAMPDPER PVERLWRIVA RQAILATGVE ERPLVFGGND
IPGVMMAGAM RSYLNRQAVA PGRSTVIFTT NDAGYRTAAD LEAAGLTVAA IVDSRADAAA
SWQGRAEVLK GARVIDALGG KRLRGVSVET ATGTRRIEAD ALAMSGGWSP IIHLACHRGA
KPQWSDEASA FLAPVQQEGL TVAGSAAGLA QTATCLGDGA AKAVAALKAI GFAAAEPAFA
LASEAAAQAK PLWSVKGSKG KAFVDYQNDV HLKDLGLAVR EGYGHVELAK RYTTSGMATD
QGKLSNINAI GILAEARGVS PADVGTTTFR PFYTPVSFGA LTGASRGKHF QPARKSPLHG
WAKKNGAIFV ETGLWYRSAW FPRAGETTWR ESVDREVLNI RKNAGLCDVS TLGKIEISGR
DAATFLDRIY CNGFAKLAVG KARYGIMLRE DGFIYDDGTT SRFSDEHFFM TTTTALAAGV
LTHLEFCAQT LWPELDVCFA SSTDQWAQMA VAGPKSRAIL QEIVDEDLSD AAFPFMSARK
VSLFGGQLEG RLFRISFSGE LAYELAVPAG YGEGVADAIM AAGEKHGICA YGAEALGVMR
IEKGHVTHAE INGTVTPGDL GFGRMVSSTK PDFIGKALLA REGLQDPDRQ RLVGVKPLNP
ATGFRTGSHI LADGVAATLE NDQGYVTSSA FSPGLGHTIG LALVRRGPER IGEKVTVWNG
LRNEFTDAVL CHPVFIDPEN EKLHA