Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4656 |
Symbol | |
ID | 5318819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1165915 |
End bp | 1167594 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640776454 |
Product | urocanate hydratase |
Protein accession | YP_001313386 |
Protein GI | 150376790 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.312199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0628347 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGA ACAACCCGCG CCACAATATT CGCGAAGTAC GCAGCCCGCG AGGCACCGAG ATCAGCGCCA GGAGCTGGCT GACCGAAGCG CCGCTCAGAA TGCTGATGAA CAATCTCGAT CCTGATGTCG CCGAGAACCC GCACGAACTC GTCGTCTATG GCGGCATCGG CCGCGCTGCA CGCACCTGGG CCGATTTCGA CCGCATCGTC GCCTCGCTCA GGTATCTGAA CGAAGACGAG ACCCTGCTCG TCCAGTCTGG CAAGCCGGTC GGAGTCTTTC GCACGCACCG GGATGCGCCG CGGGTGTTGA TCGCCAACTC CAATCTCGTG CCGCACTGGG CGAACTGGGA TCATTTCAAT GAGCTGGATA AGAAGGGTCT CGCCATGTAT GGCCAGATGA CCGCCGGCTC GTGGATCTAT ATCGGCACTC AAGGGATCGT GCAGGGCACG TATGAGACCT TCGTCGAGGC GGGCCGCCAG CACTATGGCG GCAGCCTGAA AGGCAAGTGG ATCCTGACGG GCGGCCTCGG CGGTATGGGC GGAGCACAGC CCCTTGCGGC GGTCATGGCC GGGGCCTGCT GCCTCGCCGT CGAATGCAAC CCGGATTCGA TCGATTTCCG CCTCCGTACC CGCTACCTCG ACGAAAAGGC CGAGACGCTG GAGGAAGCCA TGGAAATGAT CGAGCGCTGG ACGAAAGCCG GCGAAGCCAA GTCCGTCGGT CTTCTCGGCA ATGCAGCTGA AATTCTGCCC CAAATGGTCC GCCGCGGCAT CCGCCCCGAT ATCGTCACCG ACCAGACCTC CGCCCACGAT CCGGTGAATG GCTACCTGCC GAAGGGCTGG ACGATCGCCG AATGGAAGGC CAAGCGCGAG AGCGACCCGA AGACGGTCGA GAAGGCCGCC CGCGCGTCGA TGCGCGACCA TGTCGAAGCA ATGCTTGCCT TCTGGGATTC CGGCATACCG ACGCTCGACT ACGGGAACAA CATCCGCCAG GTCGCCAAGG ACGAAGGTCT CGAGCGCGCC TTTGATTTCC CCGGTTTCGT GCCGGCCTAT ATCCGTCCGC TTTTCTGCCG GGGCATCGGT CCGTTCCGCT GGGCGGCTCT CTCGGGCGAT CCGGAAGATA TCTACAAGAC CGACCAGAAG GTGAAGGAGC TGCTGCCGGA TAACAAGCAC CTGCACAACT GGCTCGACAT GGCGCGCGAG CGCATCGCCT TTCAGGGCCT GCCCGCACGC ATCTGCTGGG TTGGCCTTGG CGATCGCCAC CGCCTCGGCC TCGCCTTCAA CGAGATGGTG CGCAGCGGCG AGCTGAAGGC GCCGATCGTC ATCGGGCGCG ACCATCTCGA CTCCGGCTCC GTCGCCTCGC CAAACCGCGA GACCGAAGCG ATGAAGGACG GGTCCGATGC CGTCTCCGAT TGGCCGCTCC TGAACGCTCT CCTCAACACC GCCTCCGGTG CCACCTGGGT ATCACTGCAC CACGGCGGCG GGGTCGGCAT GGGCTTCTCG CAGCATTCCG GAATGGTGAT CTGCTGCGAC GGTACGGACG ATGCGGCACG CCGCGTTGAG CGGGTGCTGT GGAACGACCC GGCAACCGGC GTCATGCGCC ACGCCGATGC CGGATATGAC ATCGCATTGG ACTGCGCCCG CGAAAAAGGC CTCCGCCTGC CGGGCATATT GGGCGAGTGA
|
Protein sequence | MNMNNPRHNI REVRSPRGTE ISARSWLTEA PLRMLMNNLD PDVAENPHEL VVYGGIGRAA RTWADFDRIV ASLRYLNEDE TLLVQSGKPV GVFRTHRDAP RVLIANSNLV PHWANWDHFN ELDKKGLAMY GQMTAGSWIY IGTQGIVQGT YETFVEAGRQ HYGGSLKGKW ILTGGLGGMG GAQPLAAVMA GACCLAVECN PDSIDFRLRT RYLDEKAETL EEAMEMIERW TKAGEAKSVG LLGNAAEILP QMVRRGIRPD IVTDQTSAHD PVNGYLPKGW TIAEWKAKRE SDPKTVEKAA RASMRDHVEA MLAFWDSGIP TLDYGNNIRQ VAKDEGLERA FDFPGFVPAY IRPLFCRGIG PFRWAALSGD PEDIYKTDQK VKELLPDNKH LHNWLDMARE RIAFQGLPAR ICWVGLGDRH RLGLAFNEMV RSGELKAPIV IGRDHLDSGS VASPNRETEA MKDGSDAVSD WPLLNALLNT ASGATWVSLH HGGGVGMGFS QHSGMVICCD GTDDAARRVE RVLWNDPATG VMRHADAGYD IALDCAREKG LRLPGILGE
|
| |