Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2665 |
Symbol | hutU |
ID | 4883417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2640284 |
End bp | 2641972 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640128593 |
Product | urocanate hydratase |
Protein accession | YP_001059689 |
Protein GI | 126440789 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.712116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCACC CGAAACATAT CGATCCGCGC CTCGATCCGA CCCGCGTGAT CCGCGCGCCG CGCGGCGGCG AGAAGACCTG CAAGAACTGG CTCGCCGAGG CGGCGTACCG GATGATCCAG AACAATCTGG ACCCTGAAGT GGCCGAGCAT CCGCACGCGC TCGTCGTCTA CGGCGGCATC GGCCGCGCGG CGCGCAACTG GGATTGCTTC GATCAGATCC TCGCGTCGCT GAAGGATCTG AACGACGACG AGACGCTGCT CGTGCAGTCG GGCAAGCCGG TGGGCGTGTT CCGCACGCAC GAGAACGCGC CGCGCGTGCT GATCGCGAAC TCGAACCTCG TGCCGCACTG GGCGACGTGG GACCACTTCA ACGAGCTCGA CCGCAAGGGC CTGATGATGT ACGGCCAGAT GACGGCGGGC AGCTGGATCT ACATCGGCAG CCAGGGGATC GTGCAGGGCA CCTACGAGAC CTTCTTCGCG GTCGCGAACC AGCACTTCAA CGGCGATCCG TCGGGCCGCT GGATCCTGAC GGGCGGCCTG GGCGGGATGG GCGGCGCGCA GCCGCTTGCC GCGACGATGG CGGGCTTCTC GATGATCGCG GTCGAGTGCG ACGAATCGCG GATCGATTTC CGCCTGAAGA CGCGCTATGT CGACAGGAAG GCGACGACCC TCGACGAAGC GCTCGGCATG ATCGAAGAGG CGAAGCGCAC GGGCAAGCCC GTATCGGTGG GCCTGCTCGG CAACGCGGCC GACGTGTTCA CCGAGCTCGT CGAGCGCGGC ATCACGCCCG ACTGCGTGAC CGACCAGACG AGCGCGCACG ATCCGATCAA CGGCTACCTG CCGCAGGGCT GGAGCGTCGC GCGGTGGCGC GACGCGCAGA AGGTCGATCC GCGAAGCATC GTGCAGGTCG CCAAGCAATC GATGGCCGTG CAGGTGCGCG CGATGCTCAC GCTGCAGGCG CGCGGCGCGG CGACGCTCGA CTACGGCAAC AACATCCGCC AGATGGCGCT GGAGATGGGC GTCGAGAATG CGTTCGACTT TCCGGGCTTC GTGCCCGCCT ATATCCGGCC GCTCTTCTGC GAGGGCAAGG GCCCGTTCCG TTGGGTCGCG CTGTCGGGCG ATCCGGAGGA CATCTACAAG ACCGACCGGA AGGTGAAGGA GCTGATCCCC GACGATGCGC ACCTGCACAA CTGGCTCGAC ATGGCGCGCG AGCGCATCGC GTTCCAGGGG CTGCCCGCGC GGATCTGCTG GGTCGGCGTG AACGATCGCT ATCGTCTCGG CCAGGCGTTC AACGAGATGG TGAAGACGGG CGAGCTGAAG GCGCCGATCG TGATCGGGCG CGACCACCTC GACACCGGCT CGGTCGCGAG CCCGAATCGC GAGACCGAAG CGATGAAGGA CGGCTCGGAC GCGGTCAGCG ATTGGCCGCT GCTCAACGCG CTGCTGAACA CGGCGGGCGG CGCGTCGTGG GTGTCGCTGC ATCACGGCGG CGGCGTCGGC ATGGGCTTCT CGCAGCATGC GGGCGTCGTG ATCGTCGCCG ACGGCACCGA TGCCGCGCAC GCGCGCCTCG GCCGCGTGCT GTTCAACGAT CCGGCCACGG GCGTGATGCG TCACGCGGAC GCCGGCTATG AGCTCGCGCA GCGCACCGCG AACGAAGCGG GCCTGAAGCT GCCGATGCTC GGGCGCTGA
|
Protein sequence | MNHPKHIDPR LDPTRVIRAP RGGEKTCKNW LAEAAYRMIQ NNLDPEVAEH PHALVVYGGI GRAARNWDCF DQILASLKDL NDDETLLVQS GKPVGVFRTH ENAPRVLIAN SNLVPHWATW DHFNELDRKG LMMYGQMTAG SWIYIGSQGI VQGTYETFFA VANQHFNGDP SGRWILTGGL GGMGGAQPLA ATMAGFSMIA VECDESRIDF RLKTRYVDRK ATTLDEALGM IEEAKRTGKP VSVGLLGNAA DVFTELVERG ITPDCVTDQT SAHDPINGYL PQGWSVARWR DAQKVDPRSI VQVAKQSMAV QVRAMLTLQA RGAATLDYGN NIRQMALEMG VENAFDFPGF VPAYIRPLFC EGKGPFRWVA LSGDPEDIYK TDRKVKELIP DDAHLHNWLD MARERIAFQG LPARICWVGV NDRYRLGQAF NEMVKTGELK APIVIGRDHL DTGSVASPNR ETEAMKDGSD AVSDWPLLNA LLNTAGGASW VSLHHGGGVG MGFSQHAGVV IVADGTDAAH ARLGRVLFND PATGVMRHAD AGYELAQRTA NEAGLKLPML GR
|
| |