Gene Rleg_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3661 
Symbol 
ID8014507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3706454 
End bp3707707 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID644826224 
Productsarcosine oxidase, beta subunit family 
Protein accessionYP_002977443 
Protein GI241206347 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.616865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.13922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAAT ATTCGGTTTT TGCCGTGGCA CGGGAGGCCC TTCGCGGCCA CAAGGGCTGG 
GAGAAGCAGT GGACTTCGCC TGAGCCGCGC GCCGAATACG ACGTCGTCAT CATTGGCGGC
GGCGGCCACG GGCTGGGCGC TGCCTACTAT CTCGCCAAGG AGCACGGCAT CACCAATGTG
GCGGTGATCG AGAAGGGCTG GCTCGGCGGC GGCAATACCG GGCGCAACAC CACCATCATC
CGCTCGAACT ATCTCTACGA AGAGAGCATG CACATTTACG AGCATTCGAT GAAGCTCTGG
GAAGGGCTTT CCCAGGAGCT CAACTACAAT GTGATGTATT CGCCGCGCGG CGTGATGATG
CTTTCGCACA ATATTCACGA CCAGCAGTCC TTCAAGCGGC ATATCCATGC CAACCGGCTC
TACGGCATCG ACAATGAATG GCTGACGCCG GAGCAGGCGA GGGCCTATTG TCCGCCGCTC
GATATCTCGG CCAGCGCGCG CTACCCGATC AACGGCGCAG CACTGCAGCG GCGCGGCGGC
ACGGCCAGGC ACGACGCGGT CGCCTGGGGT TATGCGCGGG CGGCCTCAGA CCGCGGGGTG
CACATCATCC AGAATTGTGA AGTGACCGGC ATCCGGCGCG GTCCGGACGG ACAGGTGACC
GGGGTCGAGA CCTCGCGTGG CTTCATCGGC GCCAGAAAGA TCGCCGTCTC GGCGGCCGGC
CATACGACAA CAATCATGCA GATGGCTGGT GTGCGCGTGC CGCTGCAATC GAGCCCTCTG
CAGGCGCTGG TCTCCGAGCC GCTGAAGCCG ATCTTTCCCT GCGTCGTCAT GTCGAACACG
GTGCATGCCT ATATCTCCCA GTCCGACAAG GGAGAGCTGG TCATCGGCGC CGGCACCGAC
CAGTATAATT CCTATTCCCA GACCGGCGGG CTGCAGATCA TCACGCACAC GCTCGACGCT
ATCTGCGAGC TCTTCCCGAT GTTCCGGCGC GTCAAGATGA TGCGGCAATG GGGCGGCATC
GTCGACAATA CGCCGGATCG TTCGGCGATC CAGTCGAAGA CGCCGGTTCC CGGGCTTTAC
GTCAATTGCG GCTGGGGCAC CGGCGGCTTC AAGGCGACGC CGGGCTCGGC CAATCTCTTC
GCGCATCTGA TTGCCCGCGA CGAGCCGCAC AAATTCAATG CCGGGCTGAC GCTGGATCGT
TTCCGCAGTG GCCGGCTGAT CGACGAGGCG GCGGCGGCGG CGGTGGCACA CTGA
 
Protein sequence
MRKYSVFAVA REALRGHKGW EKQWTSPEPR AEYDVVIIGG GGHGLGAAYY LAKEHGITNV 
AVIEKGWLGG GNTGRNTTII RSNYLYEESM HIYEHSMKLW EGLSQELNYN VMYSPRGVMM
LSHNIHDQQS FKRHIHANRL YGIDNEWLTP EQARAYCPPL DISASARYPI NGAALQRRGG
TARHDAVAWG YARAASDRGV HIIQNCEVTG IRRGPDGQVT GVETSRGFIG ARKIAVSAAG
HTTTIMQMAG VRVPLQSSPL QALVSEPLKP IFPCVVMSNT VHAYISQSDK GELVIGAGTD
QYNSYSQTGG LQIITHTLDA ICELFPMFRR VKMMRQWGGI VDNTPDRSAI QSKTPVPGLY
VNCGWGTGGF KATPGSANLF AHLIARDEPH KFNAGLTLDR FRSGRLIDEA AAAAVAH