Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3007 |
Symbol | |
ID | 4076580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 3175510 |
End bp | 3177198 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638008336 |
Product | urocanate hydratase |
Protein accession | YP_615001 |
Protein GI | 99082847 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.237465 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGATC CCCGCAAGAA TACCCGTGAC ATTTTCCCTG CCACTGGCAC CGAGATCACT GCGAAGTCCT GGTTGACGGA GGCTCCGATG CGGATGCTGA TGAACAACCT GCACCCCGAT GTGGCTGAAA ACCCGCATGA GCTCGTGGTT TATGGCGGGA TTGGCCGGGC GGCACGCACG TGGCAGGATT TCGACCTCAT CGTCGAGACC CTCAAGACTC TTGAAGAAGA TCAGACCCTG ATGGTGCAGT CCGGCAAACC CGTCGGCGTC TTTCAGACCC ACAAGGACGC GCCGCGCGTG TTGATCGCCA ACTCCAACCT CGTGCCGCAT TGGGCCAATT GGGATCACTT CAACGAGCTC GATAAGAAGG GTCTGATGAT GTACGGCCAG ATGACCGCTG GTTCGTGGAT TTACATCGGC ACCCAGGGCA TCGTGCAGGG CACCTATGAG ACCTTTGCCG AGGCGGGCCG TCAGCACTAT GGCGGGGACC TGACGGGAAA ATGGATCCTC ACCGGCGGTC TTGGGGGGAT GGGGGGGGCG CAGCCTTTGG CTGCGGTTTT TGCTGGTGCT TGCTGCCTTG CGGTGGAGTG CAACCCCGAC TCGATCGATT TCCGCCTGCG CACCAAATAC CTTGATGAGA AGGCTGAAAC GCTGGACGAA GCGCTGGAGA TGATTGAGCG TTGGACCAAG GCGGGCGAGG CCAAATCCGT TGGCCTTTTG GGCAATGCGG CGGATGTGTT TGCAGAGCTT GTGGAGCGTG CGAAGGCGGG TGGCATGCGC CCCGACATCG TGACGGATCA GACCTCAGCG CATGACCCGG TCAACGGTTA TCTGCCGCAA GGCTGGACGA TGGCCGAATG GAGAGAGAAG CGCGAAACGG ACAAAAAGGC GGTTGAGAAA GCCTCTCGGG CGTCGATGAA GGCTCATGTG AAGGCCATGG TGGATTTCCA CGAGATGGGG ATTCCCACCG TCGATTATGG CAACAACATC CGTCAAGTCG CGCTGGAAGA GGGGCTGGAG ACGGCGTTCT CATTCCCCGG ATTTGTGCCA GCCTACATTC GCCCGCTGTT CTGCAAGGGG ATCGGTCCCT TCCGTTGGTG TGCGCTCTCG GGGGATCCCG AGGATATCCG CAAGACCGAT GCCAAGATGA AAGAGCTGTT CCCGGAGAAC GAGAGCCTGC ACCGCTGGCT CGACATGGCG CAGGACCGCA TCGCCTTTCA GGGGCTACCG GCGCGGATCT GCTGGATCGG CCTTGGAGAT CGCCACAAGG CGGGGCTTGC CTTCAACGAA ATGGTGCGCA ACGGCGAATT GTCGGCGCCG GTCGTGATTG GCCGAGATCA TCTTGACTCG GGTTCCGTGG CATCGCCCAA CCGTGAAACC GAAGCGATGA TGGATGGATC GGATGCGGTC TCTGACTGGC CATTGCTCAA TGCGCTTTTG AACACGGCCT CGGGCGCGAC ATGGGTTTCG CTGCATCATG GCGGCGGTGT TGGCATGGGG TTTTCGCAGC ACTCTGGCGT GGTGATCTGC TGTGACGGCA CAGAGGATGC AGATCGCCGG ATCGGGCGCG TTCTGTGGAA CGACCCGGCG ACCGGCGTAA TGCGTCACGC AGACGCAGGC TATGAGATCG CGAAGGACTG CGCCAAAGAG CACGGATTGA ACCTGCCCGG TATCCTGCGC TCGGAATGA
|
Protein sequence | MSDPRKNTRD IFPATGTEIT AKSWLTEAPM RMLMNNLHPD VAENPHELVV YGGIGRAART WQDFDLIVET LKTLEEDQTL MVQSGKPVGV FQTHKDAPRV LIANSNLVPH WANWDHFNEL DKKGLMMYGQ MTAGSWIYIG TQGIVQGTYE TFAEAGRQHY GGDLTGKWIL TGGLGGMGGA QPLAAVFAGA CCLAVECNPD SIDFRLRTKY LDEKAETLDE ALEMIERWTK AGEAKSVGLL GNAADVFAEL VERAKAGGMR PDIVTDQTSA HDPVNGYLPQ GWTMAEWREK RETDKKAVEK ASRASMKAHV KAMVDFHEMG IPTVDYGNNI RQVALEEGLE TAFSFPGFVP AYIRPLFCKG IGPFRWCALS GDPEDIRKTD AKMKELFPEN ESLHRWLDMA QDRIAFQGLP ARICWIGLGD RHKAGLAFNE MVRNGELSAP VVIGRDHLDS GSVASPNRET EAMMDGSDAV SDWPLLNALL NTASGATWVS LHHGGGVGMG FSQHSGVVIC CDGTEDADRR IGRVLWNDPA TGVMRHADAG YEIAKDCAKE HGLNLPGILR SE
|
| |