Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_1580 |
Symbol | |
ID | 4896240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 1661399 |
End bp | 1663081 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640112171 |
Product | urocanate hydratase |
Protein accession | YP_001043462 |
Protein GI | 126462348 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.21091 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.126486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCC GTCACAACCT GCGCGACATC TTCCCGCCCA CCGGCTCCAC CCTCTCGGCC CGAAACTGGC AGTCCGAGGC GGCGCTGCGG ATGCTGATGA ACAACCTCCA TCCCGATGTG GCCGAGACTC CGCACGAGCT GGTGGTCTAT GGCGGGATCG GCCGCGCCGC CCGCAGCTGG GAGGATTTCG ACCGGATCGT GGCGGCGCTC CGCGCGCTCG AGGCCGACGA GACGCTGCTC GTGCAGTCGG GCCGCCCGGT CGGCGTCTTC CGCACCCACG AGGATGCGCC GCGGGTGCTG ATCGCCAATT CCAACCTCGT GCCGCACTGG GCCACCTGGG AGCATTTCCA CGAGCTCGAC CGCAAGGGGC TCATGATGTA CGGCCAGATG ACGGCCGGCA GCTGGATCTA TATCGGCACG CAGGGCATCG TGCAGGGCAC CTACGAGACC TTCGCCGAGG TGGGGCGCCA GCAGTATGGC GGCGATCTCA CCGGCCGCTG GGTGCTGACG GCGGGTCTCG GCGGCATGGG CGGCGCGCAG CCGCTCGCCG CGGTGATGGC CGGCGCCTCC TGCCTCGCGG TCGAATGCGA CGAGAGCCGG ATCGACGCCC GGCTGCGCAC CCGCTATCTC GACGAGAAGG CGCAGACGCT CGACGAGGCG CTGGCGCTCA TCGCGCGCTG GACGGCCGCG GGCGAGGCGC GCTCGGTGGG GCTCCTCGGC AATGCGGCCG AGGTCGTCCC CGAGATCCTC GCGCGGATGC GGGCGGGCGG GCCGCGCCCC GACATCGTGA CCGACCAGAC CTCGGCCCAC GATCCGCTCC ACGGCTACCT CCCCGCGGGC TGGAGCCTCG GCGACTGGCG GGCCCGGGCC GAAAGCGATC CCGCCGCCGT CACGAAGGCC GCCCGCGCCG CGATGCGCAC CCATGTCGAG GCGATGGTGG GCTTTCACGA TGCGGGCGTG CCGACGCTCG ATTACGGCAA CAACATCCGC CAGATGGCGC TGGAGGAGGG CTTCGCGCGC GCCTTCGCCT TCCCGGGCTT CGTGCCTGCC TGCATCCGCC CGCTCTTCTG CCGCGGGATC GGCCCCTTCC GCTGGGTCGC CCTCTCGGGC GATCCCGAAG ACATCCGCAG GACCGATGCA AAGATGAAGG AGCTCTTCCC CGACAATGCC CATCTCCACC GCTGGCTCGA CATGGCGGCC GAACGGATCG CCTTCCAGGG CCTGCCCGCG CGGATCTGCT GGATCGGCCT CGGCGAGCGC CATCTCGCGG GCCTCGCCTT CAACGAGATG GTGCGCCGGG GCGAGCTTGC GGCGCCCGTG GTGATCGGGC GCGACCATCT CGACTCCGGC TCCGTCGCCT CGCCCAACCG CGAAACCGAA GCGATGAAGG ACGGCTCCGA TGCGGTGTCG GACTGGCCGC TCCTGAACGC ACTCCTCAAC ACCGCCTCGG GTGCGACCTG GGTCTCGCTC CATCACGGCG GCGGCGTGGG GATGGGCTTC TCGCAGCATG CGGGCATGGT GATCTGCTGC GACGGCAGCG CGGCCGCCGA CCGACGCCTC GCCCGCGTGC TCTGGAACGA TCCCGCCACC GGCGTCATGC GCCACGCCGA CGCGGGCTAC GAGAAGGCGC TCGCCTGCGC CCGCACCCAC GGCCTCACGC TCCCCTCCAT CCCCGCCCCC TGA
|
Protein sequence | MTSRHNLRDI FPPTGSTLSA RNWQSEAALR MLMNNLHPDV AETPHELVVY GGIGRAARSW EDFDRIVAAL RALEADETLL VQSGRPVGVF RTHEDAPRVL IANSNLVPHW ATWEHFHELD RKGLMMYGQM TAGSWIYIGT QGIVQGTYET FAEVGRQQYG GDLTGRWVLT AGLGGMGGAQ PLAAVMAGAS CLAVECDESR IDARLRTRYL DEKAQTLDEA LALIARWTAA GEARSVGLLG NAAEVVPEIL ARMRAGGPRP DIVTDQTSAH DPLHGYLPAG WSLGDWRARA ESDPAAVTKA ARAAMRTHVE AMVGFHDAGV PTLDYGNNIR QMALEEGFAR AFAFPGFVPA CIRPLFCRGI GPFRWVALSG DPEDIRRTDA KMKELFPDNA HLHRWLDMAA ERIAFQGLPA RICWIGLGER HLAGLAFNEM VRRGELAAPV VIGRDHLDSG SVASPNRETE AMKDGSDAVS DWPLLNALLN TASGATWVSL HHGGGVGMGF SQHAGMVICC DGSAAADRRL ARVLWNDPAT GVMRHADAGY EKALACARTH GLTLPSIPAP
|
| |