Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I1822 |
Symbol | hutU |
ID | 3849633 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2045813 |
End bp | 2047501 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637841491 |
Product | urocanate hydratase |
Protein accession | YP_442354 |
Protein GI | 83720912 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCACC CGAAACACAT CGATCCGCGC CTCGATCCGA CCCGCGTGAT CCGCGCGCCG CGCGGCGGCG AGAAGACCTG CAAGAACTGG CTCGCCGAGG CTGCGTACCG GATGATCCAG AACAATCTCG ATCCGGAGGT CGCCGAGCAT CCGCACGCGC TCGTCGTCTA CGGCGGCATC GGCCGCGCGG CGCGCAACTG GGATTGCTTC GACCAGATTC TCGCGTCGCT GAAGGACCTG AACGACGACG AGACGCTGCT CGTGCAATCG GGCAAGCCGG TGGGCGTGTT CCGCACGCAC GAGAACGCGC CGCGCGTGCT GATCGCGAAC TCGAACCTCG TGCCGCACTG GGCGACGTGG GACCACTTCA ACGAGCTCGA CCGCAAGGGC CTGATGATGT ACGGCCAGAT GACGGCGGGC AGCTGGATCT ACATCGGCAG CCAGGGGATC GTGCAGGGCA CGTACGAGAC GTTCTTCGCG GTCGCGAACC AGCACTTCAA CGGCGATCCT GCCGGGCGCT GGATTCTGAC GGGCGGGCTC GGCGGCATGG GCGGCGCGCA GCCGCTCGCC GCGACGATGG CGGGCTTCTC GATGATCGCG GTCGAATGCG ACGAATCGCG AATCGACTTC CGCCTGAAGA CGCGCTACGT CGACAAGAAG GCGAAGACGC TCGACGAAGC GCTCGCGATG ATCGACGAAG CGAAGCGCGC GGGCAAGCCG GTGTCGGTCG GCCTGCTCGG CAACGCGGCC GACGTGTTCT CGGAGATCGT CGCGCGCGGC ATCACGCCGG ACTGCGTGAC CGACCAGACG AGCGCGCACG ATCCGATCAA CGGCTACCTG CCGCAGGGCT GGAGCGTCGA GCAGTGGCGC GAGGCACAGA AGGTCGATCC GCAGAGCATC GTGAAGGCCG CGAAGCAATC GATGGCCGTG CAGGTGCGCG CGATGCTCGC GCTGCAGGAG CGCGGCGCGG CGACGCTCGA CTACGGCAAC AACATCCGCC AGATGGCGCT GGAGATGGGC GTCGACAATG CGTTCGACTT CCCGGGTTTC GTGCCCGCGT ACATCCGGCC GCTCTTCTGC GAAGGCAAGG GGCCGTTCCG CTGGGTCGCG CTGTCGGGCG ATCCGGAGGA CATCTACAAG ACCGATGCGA AGGTCAAGGA GCTGATCCCC GACGATGCGC ACCTGCACAA CTGGCTCGAC ATGGCGCGCG AGCGCATCGC GTTCCAGGGC CTGCCGGCGC GCATTTGCTG GGTCGGCGTG AAGGATCGCT ATCGCCTCGG CCAGGCGTTC AACGAGATGG TGAGGACGGG CGAGCTGAAG GCGCCGATCG TGATCGGCCG CGACCACCTC GACACCGGCT CGGTCGCAAG CCCGAACCGC GAGACGGAAG CGATGAAGGA CGGATCGGAC GCGGTGAGCG ACTGGCCGCT GCTGAACGCG CTGCTGAATA CCGCGGGCGG CGCGTCATGG GTGTCGCTGC ATCACGGCGG CGGCGTCGGC ATGGGCTTCT CGCAGCATGC GGGCGTCGTG ATCGTCGCCG ACGGCACCGA CGCCGCGCAC GAGCGCTTGG GCCGCGTCCT CTTCAACGAT CCGGCCACGG GCGTGATGCG TCACGCGGAT GCCGGCTACG AGCTCGCGCA GCGCACCGCG AAGGAAGCGG GCCTGAAGCT GCCGATGCTC GGACGCTGA
|
Protein sequence | MNHPKHIDPR LDPTRVIRAP RGGEKTCKNW LAEAAYRMIQ NNLDPEVAEH PHALVVYGGI GRAARNWDCF DQILASLKDL NDDETLLVQS GKPVGVFRTH ENAPRVLIAN SNLVPHWATW DHFNELDRKG LMMYGQMTAG SWIYIGSQGI VQGTYETFFA VANQHFNGDP AGRWILTGGL GGMGGAQPLA ATMAGFSMIA VECDESRIDF RLKTRYVDKK AKTLDEALAM IDEAKRAGKP VSVGLLGNAA DVFSEIVARG ITPDCVTDQT SAHDPINGYL PQGWSVEQWR EAQKVDPQSI VKAAKQSMAV QVRAMLALQE RGAATLDYGN NIRQMALEMG VDNAFDFPGF VPAYIRPLFC EGKGPFRWVA LSGDPEDIYK TDAKVKELIP DDAHLHNWLD MARERIAFQG LPARICWVGV KDRYRLGQAF NEMVRTGELK APIVIGRDHL DTGSVASPNR ETEAMKDGSD AVSDWPLLNA LLNTAGGASW VSLHHGGGVG MGFSQHAGVV IVADGTDAAH ERLGRVLFND PATGVMRHAD AGYELAQRTA KEAGLKLPML GR
|
| |