Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0009 |
Symbol | |
ID | 8011259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 7917 |
End bp | 8861 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644822600 |
Product | Inosine/uridine-preferring nucleoside hydrolase |
Protein accession | YP_002973860 |
Protein GI | 241202764 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000140534 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAGCG CAAGAAAGAT CATCATCGAC ACGGATCCCG GCCAGGACGA CGCGGCCGCC ATCATGCTGG CCTTCGGCAG TCCCGATGAG CTGGAGGTGC TGGGGATCAC GACGGTCGCC GGCAACGTGC CGCTTTCGCT CACCAGCCGC AATGCGCGCA TCGTCTGCGA GCTTTGCGAA CGGACGGAGA CGAAGGTTTT CGCCGGCGCC GACGCACCGA TCGCCCGCAA GCTGGTGACG GCCGAACATG TGCACGGCAA GACCGGCCTC GACGGTCCGG AGCTGAACGA GCCGACGATG GCGCTGCAGC CTGGCCATGC CGTCGACTTC ATCATCGAGA CGTTGCGCCA TGAGCCTGAA GGCACGGTGA CGCTGTGCAC GCTCGGGCCG CTCACCAATA TCGGCATGGC CTTCCAGAAG GCGCCCGACA TCATCCCCCG CATCCGCGAA CTGGTGATGA TGGGCGGCGG CTTCTTCGAG GGCGGCAACA TCACGCCGGC GGCCGAATTC AACATCTATG TCGACCCCGA AGCCGCCGAT ATCGTCTTCC GCTCAGGCGT TCCGATCGTG ATGATGCCGC TAGATGTGAC GCATCAATTG CTGACCCGCA AGGACCGGGT GAAACGCATG GCCGAGATCG GCACGGCGCC GGCAAAGGCC ATGGTCGAGA TGCTCGAATT CTTCGAACGC TTCGACATCG AGAAATACGG TTCCGACGGC GGGCCGCTGC ACGACCCGAC CGTCGTCGCC TACCTGCTGA AGCCGGAGCT TTTCCAGGGC CGGGACTGCA ATGTCGAGAT CGAGGTCCAG TCCGAACTCA CCGTCGGCAT GACGGTCGTC GACTGGTGGC ATGTGACCGA GCGCAAGCGC AACGCCAAGG TTATGCGCCA TGTCGATGCG GATGGCTTTT TCGATCTGCT GATCGAACGC TTCGCCCGCA TCTGA
|
Protein sequence | MASARKIIID TDPGQDDAAA IMLAFGSPDE LEVLGITTVA GNVPLSLTSR NARIVCELCE RTETKVFAGA DAPIARKLVT AEHVHGKTGL DGPELNEPTM ALQPGHAVDF IIETLRHEPE GTVTLCTLGP LTNIGMAFQK APDIIPRIRE LVMMGGGFFE GGNITPAAEF NIYVDPEAAD IVFRSGVPIV MMPLDVTHQL LTRKDRVKRM AEIGTAPAKA MVEMLEFFER FDIEKYGSDG GPLHDPTVVA YLLKPELFQG RDCNVEIEVQ SELTVGMTVV DWWHVTERKR NAKVMRHVDA DGFFDLLIER FARI
|
| |