Gene BURPS668_2665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2665 
SymbolhutU 
ID4883417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2640284 
End bp2641972 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content68% 
IMG OID640128593 
Producturocanate hydratase 
Protein accessionYP_001059689 
Protein GI126440789 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.712116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCACC CGAAACATAT CGATCCGCGC CTCGATCCGA CCCGCGTGAT CCGCGCGCCG 
CGCGGCGGCG AGAAGACCTG CAAGAACTGG CTCGCCGAGG CGGCGTACCG GATGATCCAG
AACAATCTGG ACCCTGAAGT GGCCGAGCAT CCGCACGCGC TCGTCGTCTA CGGCGGCATC
GGCCGCGCGG CGCGCAACTG GGATTGCTTC GATCAGATCC TCGCGTCGCT GAAGGATCTG
AACGACGACG AGACGCTGCT CGTGCAGTCG GGCAAGCCGG TGGGCGTGTT CCGCACGCAC
GAGAACGCGC CGCGCGTGCT GATCGCGAAC TCGAACCTCG TGCCGCACTG GGCGACGTGG
GACCACTTCA ACGAGCTCGA CCGCAAGGGC CTGATGATGT ACGGCCAGAT GACGGCGGGC
AGCTGGATCT ACATCGGCAG CCAGGGGATC GTGCAGGGCA CCTACGAGAC CTTCTTCGCG
GTCGCGAACC AGCACTTCAA CGGCGATCCG TCGGGCCGCT GGATCCTGAC GGGCGGCCTG
GGCGGGATGG GCGGCGCGCA GCCGCTTGCC GCGACGATGG CGGGCTTCTC GATGATCGCG
GTCGAGTGCG ACGAATCGCG GATCGATTTC CGCCTGAAGA CGCGCTATGT CGACAGGAAG
GCGACGACCC TCGACGAAGC GCTCGGCATG ATCGAAGAGG CGAAGCGCAC GGGCAAGCCC
GTATCGGTGG GCCTGCTCGG CAACGCGGCC GACGTGTTCA CCGAGCTCGT CGAGCGCGGC
ATCACGCCCG ACTGCGTGAC CGACCAGACG AGCGCGCACG ATCCGATCAA CGGCTACCTG
CCGCAGGGCT GGAGCGTCGC GCGGTGGCGC GACGCGCAGA AGGTCGATCC GCGAAGCATC
GTGCAGGTCG CCAAGCAATC GATGGCCGTG CAGGTGCGCG CGATGCTCAC GCTGCAGGCG
CGCGGCGCGG CGACGCTCGA CTACGGCAAC AACATCCGCC AGATGGCGCT GGAGATGGGC
GTCGAGAATG CGTTCGACTT TCCGGGCTTC GTGCCCGCCT ATATCCGGCC GCTCTTCTGC
GAGGGCAAGG GCCCGTTCCG TTGGGTCGCG CTGTCGGGCG ATCCGGAGGA CATCTACAAG
ACCGACCGGA AGGTGAAGGA GCTGATCCCC GACGATGCGC ACCTGCACAA CTGGCTCGAC
ATGGCGCGCG AGCGCATCGC GTTCCAGGGG CTGCCCGCGC GGATCTGCTG GGTCGGCGTG
AACGATCGCT ATCGTCTCGG CCAGGCGTTC AACGAGATGG TGAAGACGGG CGAGCTGAAG
GCGCCGATCG TGATCGGGCG CGACCACCTC GACACCGGCT CGGTCGCGAG CCCGAATCGC
GAGACCGAAG CGATGAAGGA CGGCTCGGAC GCGGTCAGCG ATTGGCCGCT GCTCAACGCG
CTGCTGAACA CGGCGGGCGG CGCGTCGTGG GTGTCGCTGC ATCACGGCGG CGGCGTCGGC
ATGGGCTTCT CGCAGCATGC GGGCGTCGTG ATCGTCGCCG ACGGCACCGA TGCCGCGCAC
GCGCGCCTCG GCCGCGTGCT GTTCAACGAT CCGGCCACGG GCGTGATGCG TCACGCGGAC
GCCGGCTATG AGCTCGCGCA GCGCACCGCG AACGAAGCGG GCCTGAAGCT GCCGATGCTC
GGGCGCTGA
 
Protein sequence
MNHPKHIDPR LDPTRVIRAP RGGEKTCKNW LAEAAYRMIQ NNLDPEVAEH PHALVVYGGI 
GRAARNWDCF DQILASLKDL NDDETLLVQS GKPVGVFRTH ENAPRVLIAN SNLVPHWATW
DHFNELDRKG LMMYGQMTAG SWIYIGSQGI VQGTYETFFA VANQHFNGDP SGRWILTGGL
GGMGGAQPLA ATMAGFSMIA VECDESRIDF RLKTRYVDRK ATTLDEALGM IEEAKRTGKP
VSVGLLGNAA DVFTELVERG ITPDCVTDQT SAHDPINGYL PQGWSVARWR DAQKVDPRSI
VQVAKQSMAV QVRAMLTLQA RGAATLDYGN NIRQMALEMG VENAFDFPGF VPAYIRPLFC
EGKGPFRWVA LSGDPEDIYK TDRKVKELIP DDAHLHNWLD MARERIAFQG LPARICWVGV
NDRYRLGQAF NEMVKTGELK APIVIGRDHL DTGSVASPNR ETEAMKDGSD AVSDWPLLNA
LLNTAGGASW VSLHHGGGVG MGFSQHAGVV IVADGTDAAH ARLGRVLFND PATGVMRHAD
AGYELAQRTA NEAGLKLPML GR