Gene Rleg_5687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5687 
Symbol 
ID8016650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012853 
Strand
Start bp268611 
End bp270044 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content61% 
IMG OID644827840 
Productdihydropyrimidinase 
Protein accessionYP_002979040 
Protein GI241518412 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0293921 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.131902 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCG CCAAGCCTTA TGACCTCGTG ATCCGTCGTG GCCGTGTGGT ACTGCCGGAT 
GCGACCAGAC AGATCGATAT CGGCGTTCGC GACGGCGCGA TTGCGGCGCT CGGGCCGGAT
CTGCCCGAGG GGAAACATGA AGTCGTGGCT GAGGGGCGCA TCGTGCTGCC CGGCGGCGTC
GATAGCCATT GCCACATGGA TCAGCAGCCC TGGGAAGGGA AGGCGACAGC GGACGATTTC
AACACCGGCA CGCTGTCGGC GATGTGCGGC GGCACGACAA CTGTCGTGCC TTTCGCCATG
CAGATGCGTG GCCAGTCGCT ACGCGACATC GTCGAGGATT ATCACGAGCG CGCCCGTTCG
AAAGCGCGTA TCGACTATGG TTTTCACCTG ATCGTCGGCG ATCCATCAGC CGAAGTGTTG
CGCGACGAGA TCCCTCAGCT GATTGCCGAG GGCTGCACTT CGATCAAGAT CTACCTGACC
TATGACGGGC TGAAGCTCGA CGATTATGAG GTGTTGAATG TGCTCGACCT TGCGCGTGCT
CAAGGCGCGA TGGTCATGGT TCACGCGGAA AACGATGCCT GCATTCGCTG GTTGACCGAA
AAGTTCATCG CCTCGCGCAA GACTGAGCTG CGCTATCACG AAAAGGCGCA TTCGGCGATC
GGCGACCGTG AAGCGACCTT CCGGGCGATC AGCCTTTCGG AGCTGATCGA GACGCCGATT
CTCGTCAGCC ATGTCGCTGC GGGCGGCGCC GTCGAAGAAA TCCGCCGCGC CAAGGCGCGG
GGGCTTCCGA TCTACGCCGA AACCTGCCCC CAATATCTTT TCCTTTCGGC CGAGGATATC
GACACCCATG ATCTCTCAGG CTCCAAATGC GTCTGCACGC CGCCGCCACG CGACAAGTCG
AACCAGCCTG CAATCTGGGC CGGTATCCTG GACGGCACGC TGGAGGTTTT TTCTTCGGAC
CATTCGCCGT GGCACTATGC GGATAAGATA GCGGGCGGGC CGGGGACACC GTTCCACCGT
ATTCCTAACG GTATTCCCGG TATCGAGACA CGGCTCGCCT TGCTCTTTTC TGCTGGCGTG
AACGGCGGCC TGATCTCGCT GCAGAAATTC GCTGACCTGA CCGCCGGCGC CCCGGCGCGG
CTGTTCGGTC TTCATCCGCG CAAGGGCAGG ATTGCCGTAG GCGCAGACGC CGATATCGCG
ATTTGGGATC CGGATCGCAG CATGACGATC ACCAATTCGC TCCTGCATCA CGCGACTGAC
TACACGCCTT ATGAAGGACA GGTCGTCAAG GGCTGGCCCA TCATGACGAT CTCGCGGGGC
GATATCGTCT GGGACGACGG AAGAATCATG GCCGAGCCCG GGCGCGGCCA GTTCATTGCC
CGACAGCGAC CTTTCCCACC GCAGCAAGGT CTTTCGAAGG TCCTCGCATC ATGA
 
Protein sequence
MTPAKPYDLV IRRGRVVLPD ATRQIDIGVR DGAIAALGPD LPEGKHEVVA EGRIVLPGGV 
DSHCHMDQQP WEGKATADDF NTGTLSAMCG GTTTVVPFAM QMRGQSLRDI VEDYHERARS
KARIDYGFHL IVGDPSAEVL RDEIPQLIAE GCTSIKIYLT YDGLKLDDYE VLNVLDLARA
QGAMVMVHAE NDACIRWLTE KFIASRKTEL RYHEKAHSAI GDREATFRAI SLSELIETPI
LVSHVAAGGA VEEIRRAKAR GLPIYAETCP QYLFLSAEDI DTHDLSGSKC VCTPPPRDKS
NQPAIWAGIL DGTLEVFSSD HSPWHYADKI AGGPGTPFHR IPNGIPGIET RLALLFSAGV
NGGLISLQKF ADLTAGAPAR LFGLHPRKGR IAVGADADIA IWDPDRSMTI TNSLLHHATD
YTPYEGQVVK GWPIMTISRG DIVWDDGRIM AEPGRGQFIA RQRPFPPQQG LSKVLAS