Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5930 |
Symbol | |
ID | 6977317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 347855 |
End bp | 349528 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643393383 |
Product | urocanate hydratase |
Protein accession | YP_002278201 |
Protein GI | 209546311 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.138114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATC CACGTCATAA CATCCGCGAA ATCCGCGCGC CCCGCGGCAA CGAGCTCAAT TCCAAGAGCT GGATGACCGA AGCGCCGCTG CGCATGCTGA TGAACAACCT CGATCCCGAC GTCGCCGAAA ATCCGAACGA GCTCGTCGTC TACGGCGGCA TCGGCCGCGC CGCCCGCACC TGGGAGGATT TCGACCGCAT CGTCGCGACG CTGAAGACGC TGACGGAAGA AGAAACGCTG CTGGTGCAGT CCGGCAAGCC GGTTGGCGTC TTCCGCACCC ACAAGGATGC GCCACGCGTG CTGATCGCCA ATTCCAACCT CGTCCCGCAT TGGGCGACAT GGGATCATTT CAATGAGCTG GATAAGAAGG GCCTTGCCAT GTACGGCCAG ATGACGGCCG GCTCGTGGAT CTATATCGGC ACGCAAGGGA TCGTGCAGGG CACCTACGAA ACCTTCGTCG AGGCCGGCCG CCAGCATTAC GGCGGCAATC TCAAGGGCAA ATGGATCCTG ACCGGCGGCC TCGGCGGCAT GGGCGGCGCC CAGCCTCTCG CCGCCGTCAT GGCCGGCGCC TGCTGCCTCG CCGTCGAATG CAATCCAGAC TCGATCGATT TCCGCCTGCG CACCCGCTAT GTCGACGCCA AGGCCGAGAC GCTCGACGAA GCGCTCGAAA TGATCGACCG CTGGACCAAA GCCGGTGAAG CCAAATCCGT CGGCCTGCTC GGCAATGCCG CCGAAATCCT GCCGGAAATG GTCCGCCGCG GCATCCGCCC CGACATGGTC ACCGACCAGA CCTCGGCGCA CGACCCGATC AACGGCTACC TGCCGAAGGG CTGGACGATG GCCGAGTGGA AGGCCAAGCG CGAAAGCGAT CCGAAGGCCG TGGAAAAGGC CGCACGCGCC TCGATGCGCG AGCATGTCGA AGCGATGATC GCCTTCTGGG ACGCCGGCAT TCCGACGCTC GACTACGGCA ACAATATCCG CCAGGTCGCC AAGGAAGAAG GCTTGGAAAA CGCCTTCGCC TTCCCGGGCT TCGTGCCGGC TTATATCCGT CCGCTGTTCT GCCGCGGCAT TGGCCCCTTC CGCTGGGCCG CCTTGTCGGG CGACCCGGAG GATATCTACA AGACCGATGC CAAGGTGAGG GAGCTGACCC CCGGCAATAC CCATCTGCAC AACTGGCTCG ATATGGCCAG GGAGCGCATC GCCTTCCAGG GCCTGCCGGC GCGCATCTGC TGGGTCGGCC TCGGCGACCG CCACCGCCTA GCTCTGGCCT TCAATGAAAT GGTCAGGAAC GGCGAGCTTT CCGCGCCGAT CGTCATCGGC CGCGACCATC TCGACTCCGG CTCCGTCGCC TCGCCGAACC GCGAAACCGA GGCGATGAAG GACGGCTCCG ACGCCGTCTC CGACTGGCCG CTGCTGAACG CCCTGCTCAA CACGGCGTCG GGCGCCACCT GGGTATCGCT GCATCATGGC GGCGGCGTCG GCATGGGCTT CTCCCAGCAT TCGGGTGTCG TTATTTGCGC CGACGGCAGC GACGATGCGG CCAAGCGTCT CGAGCGGGTG CTCTGGAACG ACCCGGCGAC CGGCGTCATG CGCCACGCCG ATGCCGGCTA CGACATCGCC CTCGACTGCG CCCGGGACAA GGGCCTGCGC CTGCCCGGCA TCCTGGGGAA CTGA
|
Protein sequence | MNNPRHNIRE IRAPRGNELN SKSWMTEAPL RMLMNNLDPD VAENPNELVV YGGIGRAART WEDFDRIVAT LKTLTEEETL LVQSGKPVGV FRTHKDAPRV LIANSNLVPH WATWDHFNEL DKKGLAMYGQ MTAGSWIYIG TQGIVQGTYE TFVEAGRQHY GGNLKGKWIL TGGLGGMGGA QPLAAVMAGA CCLAVECNPD SIDFRLRTRY VDAKAETLDE ALEMIDRWTK AGEAKSVGLL GNAAEILPEM VRRGIRPDMV TDQTSAHDPI NGYLPKGWTM AEWKAKRESD PKAVEKAARA SMREHVEAMI AFWDAGIPTL DYGNNIRQVA KEEGLENAFA FPGFVPAYIR PLFCRGIGPF RWAALSGDPE DIYKTDAKVR ELTPGNTHLH NWLDMARERI AFQGLPARIC WVGLGDRHRL ALAFNEMVRN GELSAPIVIG RDHLDSGSVA SPNRETEAMK DGSDAVSDWP LLNALLNTAS GATWVSLHHG GGVGMGFSQH SGVVICADGS DDAAKRLERV LWNDPATGVM RHADAGYDIA LDCARDKGLR LPGILGN
|
| |