Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1295 |
Symbol | |
ID | 5898750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1366640 |
End bp | 1368304 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641561780 |
Product | urocanate hydratase |
Protein accession | YP_001682923 |
Protein GI | 167645260 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0784383 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0497778 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCC TCGACAACAC CCGCGTCATC CGCCCCGCCA CCGGAACGGA ACTCACCGCC AAGAGCTGGC TGACCGAAGC CCCGCTGCGG ATGCTGATGA ACAACCTGCA CCCCGACGTC GCCGAGCGGC CCGAGGAGCT GGTGGTCTAT GGCGGCATCG GCCGGGCGGC GCGGGACTGG GAAAGCTACG ACAAGATCGT CGAGACCCTG CGCCGGCTGG AGGACGACGA GACCCTGCTG GTCCAGTCGG GCAAGCCGGT CGGGGTGTTC AAGACCCACC CCGACGCGCC GCGCGTGCTG ATCGCCAACT CCAACCTCGT GCCGCGCTGG GCGACCTGGG AGCATTTCAA CGAGCTCGAT CGCAAGGGCC TGGCCATGTA CGGCCAGATG ACCGCCGGCT CGTGGATCTA TATCGGCGCC CAGGGCATCG TGCAGGGCAC CTACGAGACC TTCGTCGAGA TGGGCCGCCA GCATCACGGC GGCGACCTGG CGGGCAAATG GCTGCTGACC GCGGGCCTGG GCGGCATGGG CGGGGCCCAG CCGCTGGCGG CGGTGATGGC CGGCGCCTCG TGCCTGGCCA TCGAGTGCCA GCCGTCGCGG ATCGAGATGC GCCTGCGCAC CGGCTATCTG GACAAGGCCA CCGAGCGCCT CGACGAGGCC CTGGCCTGGA TCGCCGAGGC CAACGCGGCC AAGGCCCCGG TCTCGGTCGG CCTGCTGGGC AACGCCGCCG AATTGCTGCC GGCCATGTTC GCGGCCGGCG TCCGCCCCGA CCTGCTGACC GACCAGACCA GCGCCCACGA CCCGATCAAC GGCTACCTGC CGGCCGGCTG GACCCTGGAT CAGTGGGCGA CCGCCAAGGA GCGCGAGCCG GAAACCGTCA ACCGCGCCGC CCGCGCCTCG ATGGCCGTGC ACGTCCAGGC GATGCTCGAC TTCCAGGCCG CCGGCGTACC CACGGTCGAC TACGGCAACA ACATCCGCCA GATGGCGCTG GAGGAAGGCG TCAAGAACGC CTTCGACTTC CCCGGCTTCG TGCCGGCCTA TATCCGGCCG CTGTTCTGCC GGGGGATCGG GCCGTTCCGC TGGGCGGCGC TGTCGGGCGA TCCGGAAGAC ATCGCCAAGA CCGACGCCAA GGTCAAGGAA CTGATCCCCG ACAATCCCCA CCTGCACCAC TGGCTGGACA TGGCGGCCGA GAAGATCAAG TTCCAAGGTC TTCCCGCTCG CATCTGCTGG GTCGGCCTGG GCGATCGCCA CAGGCTGGGC CTGGCCTTCA ACGCGATGGT CGCCAGCGGC GAGTTAAAGG CCCCGGTGGT GATCGGCCGC GACCACCTGG ACAGCGGCTC GGTCGCCTCG CCCAACCGCG AGACGGAAGC GATGATGGAC GGCTCGGACG CGGTGTCGGA CTGGCCGCTG CTGAACGCCC TGCTCAATAC GGCGTCCGGC GCCACCTGGG TGTCGCTACA CCATGGCGGC GGGGTCGGTA TGGGCTTCTC ACAGCACGCG GGCATGGTCA TCGTCGCCGA CGGCACCGAA GCCGCCGCCA AGCGGCTGGC GCGGGTGCTG TGGAACGACC CGGCCTCCGG CGTCATGCGC CACGCCGACG CCGGCTACGA GATCGCCAAG GCCTGCGCCC GGGAACACGG GCTGGATTTG CCTGGCATAC TGTAG
|
Protein sequence | MTRLDNTRVI RPATGTELTA KSWLTEAPLR MLMNNLHPDV AERPEELVVY GGIGRAARDW ESYDKIVETL RRLEDDETLL VQSGKPVGVF KTHPDAPRVL IANSNLVPRW ATWEHFNELD RKGLAMYGQM TAGSWIYIGA QGIVQGTYET FVEMGRQHHG GDLAGKWLLT AGLGGMGGAQ PLAAVMAGAS CLAIECQPSR IEMRLRTGYL DKATERLDEA LAWIAEANAA KAPVSVGLLG NAAELLPAMF AAGVRPDLLT DQTSAHDPIN GYLPAGWTLD QWATAKEREP ETVNRAARAS MAVHVQAMLD FQAAGVPTVD YGNNIRQMAL EEGVKNAFDF PGFVPAYIRP LFCRGIGPFR WAALSGDPED IAKTDAKVKE LIPDNPHLHH WLDMAAEKIK FQGLPARICW VGLGDRHRLG LAFNAMVASG ELKAPVVIGR DHLDSGSVAS PNRETEAMMD GSDAVSDWPL LNALLNTASG ATWVSLHHGG GVGMGFSQHA GMVIVADGTE AAAKRLARVL WNDPASGVMR HADAGYEIAK ACAREHGLDL PGIL
|
| |