Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6925 |
Symbol | |
ID | 8022953 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 377892 |
End bp | 379565 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644833786 |
Product | urocanate hydratase |
Protein accession | YP_002984920 |
Protein GI | 241666836 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.102745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC CACGCCACAA TATCCGCGAA ATCCGCGCGC CCCGCGGCAA CGATCTCAAT GCCAAAAGCT GGATGACCGA AGCGCCGCTA CGCATGCTGA TGAACAATCT CGACCCCGAC GTCGCGGAAA ATCCGAACGA GCTCGTCGTC TATGGCGGCA TCGGGCGCGC CGCCCGCACC TGGGAGGATT TCGACCGCAT CGTCGCGACG CTGAAGACGC TGACGGAAGA AGAAACGCTG GTCGTGCAAT CCGGCAAGCC GGTCGGCGTG TTCCGCACCC ACAAGGATGC GCCGCGGGTG CTGATCGCCA ATTCCAACCT CGTGCCGCAT TGGGCGACCT GGGACCATTT CAACGAGTTG GATAAGAAGG GCCTTGCCAT GTACGGCCAG ATGACGGCCG GCTCGTGGAT CTATATCGGC ACCCAGGGCA TCGTGCAGGG CACCTACGAG ACCTTCGTCG AGGCCGGCCG CCAACACTAC GGCGGCAATC TCAAGGGCAA ATGGATCCTG ACCGGCGGTC TCGGCGGCAT GGGCGGCGCC CAGCCGCTCG CCGCCGTCAT GGCCGGCGCC TGCTGCCTTG CCGTCGAATG CAATCCCGAT TCGATCGATT TTCGCCTGCG CACCCGCTAT GTCGACGCCA AGGCCGAGAC GCTCGATGAA GCGCTCGAGA TGATCGACCG CTGGACCAAG GCCGGCGAGG CGAAATCCGT CGGTCTGCTC GGCAACGCCG CCGAAATCCT GCCGGAGATG GTCCGCCGCG GCATCCGCCC CGATATCGTC ACCGACCAGA CCTCGGCGCA CGACCCGATC AACGGCTACC TGCCGAAGGG CTGGACGATG GGCGAATGGA AGGCAAAGCG CGAAACCGAT CCGAAGGCCG TGGAAAAAGC GGCGCGCGCC TCGATGCGCG AGCATGTCGA AGCGATGATC GCCTTCTGGA ACGCCGGCGT GCCGACCCTC GATTATGGCA ACAATATCCG CCAGGTCGCC AAGGATGAAG GCCTCGAAAA CGCCTTCGCC TTCCCTGGCT TCGTGCCGGC CTATATCCGG CCGCTGTTTT GCCGCGGCAT CGGCCCCTTC CGCTGGGCGG CCCTTTCGGG CGATCCGGAG GATATCTACA AGACCGATGC CAAGGTGAAG GAGCTGCTGC CTGACAACAA GCACCTGCAT CATTGGCTCG ACATGGCCAG GGAGCGCATC GCCTTCCAGG GCCTGCCGGC GCGCATCTGC TGGGTGGGCT TGGGCGACCG CCACAAGCTC GGCCTCGCCT TCAACGAGAT GGTGAGAACA GGCGAGCTCT CCGCCCCGAT CGTCATCGGT CGCGATCATC TGGACTCCGG CTCCGTCGCT TCGCCGAACC GCGAGACCGA AGCGATGAAG GACGGTTCCG ACGCCGTCTC CGATTGGCCG CTGCTCAACG CCCTGCTCAA CACGGCGTCG GGCGCCACCT GGGTGTCGCT GCATCACGGC GGCGGCGTCG GCATGGGCTT CTCGCAGCAT TCCGGCATGG TCATTTGCGC CGATGGCACG GACGATGCCG CAAGACGCCT CGAGCGCGTG CTCTGGAACG ACCCGGCGAC CGGTGTCATG CGCCACGCCG ATGCCGGTTA CGAGATCGCC ATCGACTGCG CCAAGGAAAA GGGCCTGCGC CTGCCCGGCA TTCTCGGGAA CTGA
|
Protein sequence | MSNPRHNIRE IRAPRGNDLN AKSWMTEAPL RMLMNNLDPD VAENPNELVV YGGIGRAART WEDFDRIVAT LKTLTEEETL VVQSGKPVGV FRTHKDAPRV LIANSNLVPH WATWDHFNEL DKKGLAMYGQ MTAGSWIYIG TQGIVQGTYE TFVEAGRQHY GGNLKGKWIL TGGLGGMGGA QPLAAVMAGA CCLAVECNPD SIDFRLRTRY VDAKAETLDE ALEMIDRWTK AGEAKSVGLL GNAAEILPEM VRRGIRPDIV TDQTSAHDPI NGYLPKGWTM GEWKAKRETD PKAVEKAARA SMREHVEAMI AFWNAGVPTL DYGNNIRQVA KDEGLENAFA FPGFVPAYIR PLFCRGIGPF RWAALSGDPE DIYKTDAKVK ELLPDNKHLH HWLDMARERI AFQGLPARIC WVGLGDRHKL GLAFNEMVRT GELSAPIVIG RDHLDSGSVA SPNRETEAMK DGSDAVSDWP LLNALLNTAS GATWVSLHHG GGVGMGFSQH SGMVICADGT DDAARRLERV LWNDPATGVM RHADAGYEIA IDCAKEKGLR LPGILGN
|
| |