Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_1749 |
Symbol | |
ID | 5159966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 1930184 |
End bp | 1931854 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640553666 |
Product | urocanate hydratase |
Protein accession | YP_001234872 |
Protein GI | 148260745 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.132868 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGTC GTCTCGACAA TGCGCGGACG ATCCGCGCGC CGCATGGCAG CGATCTCAGC GCCCGGTCCT GGCTGACCGA GGCGCCGCTC AGGATGCTGA TGAACAACCT CGACCCCGAT GTCGCGGAGC GGCCGGAGGA ACTCGTCGTC TATGGTGGGA TCGGCCGGGC GGCGCGGGAC TGGGAGAGTT TCGACCGCAT TCTCGGCGCG CTGCGCGACC TCGGGCCGGA GGAGACGCTG CTGGTGCAGT CGGGCAAGCC GGTCGGCGTG TTCCGCACCC ATGCGGACGC GCCGCGCGTG CTGATCGCCA ATTCCAACCT GGTGCCGAAC TGGGCGAACT GGCAGCATTT CCACGAACTC GATCGCAAGG GCCTGATGAT GTACGGCCAG ATGACGGCGG GGTCGTGGAT CTATATCGGC TCGCAGGGCA TCGTGCAGGG CACGTATGAG ACCTTCGTCG AGGCCGGGCG GCAGCATTAC GGCGGCGATC TCGCCGGGCG GTGGATCCTG ACCGGCGGGC TCGGCGGCAT GGGCGGGGCG CAGCCGCTGG CGGCGACGAT GGCCGGGGCC TCGATGATCG CGGTGGAGTG CACGCCCTCG CGCATCGAGA TGCGGCTGCG CACGCGCTAT CTCGACCGCC GGGCCGACAC GCTCGACGAG GCGCTGGCGA TGCTGGAGGC GGCGAAGCGC GACGGAAAAC CGGTCTCGAT CGGGCTGCTC GGCAATGCCG CGGAGGTGTT TCCCGAACTC GTCCGCCGCG GCATCCGCCC GGATATCGTG ACCGACCAGA CCTCGGCGCA TGATCCGGTG AACGGCTACC TGCCGGCGGG ATGGACGCTG GAGCACTGGG CGGCGATGCG CGAGCGCGAC CCCGAGGCGG TGGCGCTGGC GGCGAAGCGG TCGATGGCCG GGCAGGTGCG GGCGATGCTC GATTTCTGGC GGATGGGGAT TCCGGTGCTC GACTACGGCA ACAACATCCG CGCCATGGCG CAGGAGATGG GGGTGGCCGA CGCGTTCGAC TTTCCGGGCT TCGTGCCGGC CTATATCCGG CCGCTGTTCT GCCGGGGGAT CGGGCCGTTC CGCTGGGCGG CGCTGTCGGG CGACCCCGAG GACATCTACC GGACGGACGC GAAGGTGAAG GAGCTGATCC CCGACGATCC GCATCTGCAT CACTGGCTCG ACATGGCGCG CGAGTGGATC GCCTTCCAGG GCCTGCCGGC GCGGATCTGC TGGGTGGGGC TCGGGCAGCG GCACCGGCTC GGCCTCGCGT TCAACGAGAT GGTGGCGCGG GGCGAGCTGT CGGCGCCGGT GGTGATCGGG CGGGACCATC TCGATTCCGG CTCGGTCGCC AGCCCCAACC GCGAGACCGA GGCGATGCGC GACGGGTCGG ACGCGGTGTC GGACTGGCCG CTGCTCAACG CGCTGCTCAA CTGCGCCTCG GGGGCGACCT GGGTCTCGCT GCATCACGGC GGCGGGGTCG GGATGGGGTA TTCGCAGCAT GCGGGGATGG TGATCGTCGC CGACGGCACC GAGGCGGCGG CGAAGCGGCT GGAGCGGGTG CTGTGGAACG ACCCGGCGAC CGGGGTGATG CGCCATGCCG ATGCCGGCTA CGACATCGCC ATCGACTGCG CGCGGGAGAA CGGGCTGAAC CTGCCGGGGA TCACCGGCTG A
|
Protein sequence | MNRRLDNART IRAPHGSDLS ARSWLTEAPL RMLMNNLDPD VAERPEELVV YGGIGRAARD WESFDRILGA LRDLGPEETL LVQSGKPVGV FRTHADAPRV LIANSNLVPN WANWQHFHEL DRKGLMMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY GGDLAGRWIL TGGLGGMGGA QPLAATMAGA SMIAVECTPS RIEMRLRTRY LDRRADTLDE ALAMLEAAKR DGKPVSIGLL GNAAEVFPEL VRRGIRPDIV TDQTSAHDPV NGYLPAGWTL EHWAAMRERD PEAVALAAKR SMAGQVRAML DFWRMGIPVL DYGNNIRAMA QEMGVADAFD FPGFVPAYIR PLFCRGIGPF RWAALSGDPE DIYRTDAKVK ELIPDDPHLH HWLDMAREWI AFQGLPARIC WVGLGQRHRL GLAFNEMVAR GELSAPVVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNCAS GATWVSLHHG GGVGMGYSQH AGMVIVADGT EAAAKRLERV LWNDPATGVM RHADAGYDIA IDCARENGLN LPGITG
|
| |