Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5687 |
Symbol | |
ID | 8016650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 268611 |
End bp | 270044 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644827840 |
Product | dihydropyrimidinase |
Protein accession | YP_002979040 |
Protein GI | 241518412 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type [TIGR02033] D-hydantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0293921 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.131902 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCG CCAAGCCTTA TGACCTCGTG ATCCGTCGTG GCCGTGTGGT ACTGCCGGAT GCGACCAGAC AGATCGATAT CGGCGTTCGC GACGGCGCGA TTGCGGCGCT CGGGCCGGAT CTGCCCGAGG GGAAACATGA AGTCGTGGCT GAGGGGCGCA TCGTGCTGCC CGGCGGCGTC GATAGCCATT GCCACATGGA TCAGCAGCCC TGGGAAGGGA AGGCGACAGC GGACGATTTC AACACCGGCA CGCTGTCGGC GATGTGCGGC GGCACGACAA CTGTCGTGCC TTTCGCCATG CAGATGCGTG GCCAGTCGCT ACGCGACATC GTCGAGGATT ATCACGAGCG CGCCCGTTCG AAAGCGCGTA TCGACTATGG TTTTCACCTG ATCGTCGGCG ATCCATCAGC CGAAGTGTTG CGCGACGAGA TCCCTCAGCT GATTGCCGAG GGCTGCACTT CGATCAAGAT CTACCTGACC TATGACGGGC TGAAGCTCGA CGATTATGAG GTGTTGAATG TGCTCGACCT TGCGCGTGCT CAAGGCGCGA TGGTCATGGT TCACGCGGAA AACGATGCCT GCATTCGCTG GTTGACCGAA AAGTTCATCG CCTCGCGCAA GACTGAGCTG CGCTATCACG AAAAGGCGCA TTCGGCGATC GGCGACCGTG AAGCGACCTT CCGGGCGATC AGCCTTTCGG AGCTGATCGA GACGCCGATT CTCGTCAGCC ATGTCGCTGC GGGCGGCGCC GTCGAAGAAA TCCGCCGCGC CAAGGCGCGG GGGCTTCCGA TCTACGCCGA AACCTGCCCC CAATATCTTT TCCTTTCGGC CGAGGATATC GACACCCATG ATCTCTCAGG CTCCAAATGC GTCTGCACGC CGCCGCCACG CGACAAGTCG AACCAGCCTG CAATCTGGGC CGGTATCCTG GACGGCACGC TGGAGGTTTT TTCTTCGGAC CATTCGCCGT GGCACTATGC GGATAAGATA GCGGGCGGGC CGGGGACACC GTTCCACCGT ATTCCTAACG GTATTCCCGG TATCGAGACA CGGCTCGCCT TGCTCTTTTC TGCTGGCGTG AACGGCGGCC TGATCTCGCT GCAGAAATTC GCTGACCTGA CCGCCGGCGC CCCGGCGCGG CTGTTCGGTC TTCATCCGCG CAAGGGCAGG ATTGCCGTAG GCGCAGACGC CGATATCGCG ATTTGGGATC CGGATCGCAG CATGACGATC ACCAATTCGC TCCTGCATCA CGCGACTGAC TACACGCCTT ATGAAGGACA GGTCGTCAAG GGCTGGCCCA TCATGACGAT CTCGCGGGGC GATATCGTCT GGGACGACGG AAGAATCATG GCCGAGCCCG GGCGCGGCCA GTTCATTGCC CGACAGCGAC CTTTCCCACC GCAGCAAGGT CTTTCGAAGG TCCTCGCATC ATGA
|
Protein sequence | MTPAKPYDLV IRRGRVVLPD ATRQIDIGVR DGAIAALGPD LPEGKHEVVA EGRIVLPGGV DSHCHMDQQP WEGKATADDF NTGTLSAMCG GTTTVVPFAM QMRGQSLRDI VEDYHERARS KARIDYGFHL IVGDPSAEVL RDEIPQLIAE GCTSIKIYLT YDGLKLDDYE VLNVLDLARA QGAMVMVHAE NDACIRWLTE KFIASRKTEL RYHEKAHSAI GDREATFRAI SLSELIETPI LVSHVAAGGA VEEIRRAKAR GLPIYAETCP QYLFLSAEDI DTHDLSGSKC VCTPPPRDKS NQPAIWAGIL DGTLEVFSSD HSPWHYADKI AGGPGTPFHR IPNGIPGIET RLALLFSAGV NGGLISLQKF ADLTAGAPAR LFGLHPRKGR IAVGADADIA IWDPDRSMTI TNSLLHHATD YTPYEGQVVK GWPIMTISRG DIVWDDGRIM AEPGRGQFIA RQRPFPPQQG LSKVLAS
|
| |