Gene BURPS1106A_2721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2721 
SymbolhutU 
ID4899999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2688164 
End bp2689852 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content68% 
IMG OID640135948 
Producturocanate hydratase 
Protein accessionYP_001066972 
Protein GI126453635 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCACC CGAAACATAT CGATCCGCGC CTCGATCCGA CCCGCGTGAT CCGCGCGCCG 
CGCGGCGGCG AGAAGACCTG CAAGAACTGG CTCGCCGAGG CGGCGTACCG GATGATCCAG
AACAATCTGG ACCCCGAAGT GGCCGAGCAT CCGCACGCGC TCGTCGTCTA CGGCGGCATC
GGCCGCGCGG CGCGCAACTG GGATTGCTTC GATCAGATCC TCGCGTCGCT GAAGGATCTG
AACGACGACG AGACGCTGCT CGTGCAGTCG GGCAAGCCGG TGGGCGTGTT CCGCACGCAC
GAGAACGCGC CGCGCGTGCT GATCGCGAAC TCGAACCTCG TGCCGCACTG GGCGACGTGG
GACCACTTCA ACGAGCTCGA CCGCAAGGGC CTGATGATGT ACGGCCAGAT GACGGCGGGC
AGCTGGATCT ACATCGGCAG CCAGGGGATC GTGCAGGGCA CCTACGAGAC CTTCTTCGCG
GTCGCGAACC AGCACTTCAA CGGCGATCCG TCTGGCCGCT GGATCCTGAC GGGCGGCCTG
GGCGGGATGG GCGGCGCGCA GCCGCTTGCC GCGACGATGG CGGGCTTCTC GATGATCGCG
GTCGAGTGCG ACGAATCGCG GATCGATTTC CGCCTGAAGA CGCGCTATGT CGACAGGAAG
GCGACGACCC TCGACGAAGC GCTCGGCATG ATCGAAGAGG CGAAGCGCAC GGGCAAGCCC
GTATCGGTGG GCCTGCTCGG CAACGCGGCC GACGTGTTCA CCGAGCTCGT CGAGCGCGGC
ATCACGCCGG ACTGCGTGAC CGACCAGACG AGCGCGCACG ATCCGATCAA CGGCTACCTG
CCGCAGGGCT GGAGCGTCGC GCAGTGGCGC GACGCGCAGA AGGTCGATCC GCGAAGCATC
GTGCAGGTCG CCAAGCAATC GATGGCCGTG CAGGTGCGCG CGATGCTCAC GCTGCAGGCG
CGCGGCGCGG CGACGCTCGA CTACGGCAAC AACATCCGCC AGATGGCGCT GGAGATGGGC
GTCGAGAATG CGTTCGACTT TCCGGGCTTC GTGCCCGCCT ATATCCGGCC GCTCTTCTGC
GAGGGCAAGG GCCCGTTCCG TTGGGTCGCG CTGTCGGGCG ATCCGGAGGA CATCTACAAG
ACCGACCGGA AGGTGAAGGA GCTGATCCCC GACGATGCGC ACCTGCACAA CTGGCTCGAC
ATGGCGCGCG AGCGCATCGC GTTCCAGGGG CTGCCCGCGC GGATCTGCTG GGTCGGCGTG
AACGATCGCT ATCGTCTCGG CCAGGCGTTC AACGAGATGG TGAAGACGGG CGAGCTGAAG
GCGCCGATCG TGATCGGGCG CGACCACCTC GACACCGGCT CGGTCGCGAG CCCGAATCGC
GAGACCGAAG CGATGAAGGA CGGCTCGGAC GCGGTCAGCG ATTGGCCGCT GCTCAACGCG
CTGCTGAACA CGGCGGGCGG CGCGTCGTGG GTGTCGCTGC ATCACGGCGG CGGCGTCGGC
ATGGGCTTCT CGCAGCATGC GGGCGTCGTG ATCGTCGCCG ACGGCACCGA TGCCGCGCAC
GCGCGCCTCG GCCGCGTGCT GTTCAACGAT CCGGCCACGG GCGTGATGCG TCACGCGGAC
GCCGGCTATG AGCTCGCGCA GCGCACCGCG AACGAAGCGG GCCTGAAGCT GCCGATGCTC
GGGCGCTGA
 
Protein sequence
MNHPKHIDPR LDPTRVIRAP RGGEKTCKNW LAEAAYRMIQ NNLDPEVAEH PHALVVYGGI 
GRAARNWDCF DQILASLKDL NDDETLLVQS GKPVGVFRTH ENAPRVLIAN SNLVPHWATW
DHFNELDRKG LMMYGQMTAG SWIYIGSQGI VQGTYETFFA VANQHFNGDP SGRWILTGGL
GGMGGAQPLA ATMAGFSMIA VECDESRIDF RLKTRYVDRK ATTLDEALGM IEEAKRTGKP
VSVGLLGNAA DVFTELVERG ITPDCVTDQT SAHDPINGYL PQGWSVAQWR DAQKVDPRSI
VQVAKQSMAV QVRAMLTLQA RGAATLDYGN NIRQMALEMG VENAFDFPGF VPAYIRPLFC
EGKGPFRWVA LSGDPEDIYK TDRKVKELIP DDAHLHNWLD MARERIAFQG LPARICWVGV
NDRYRLGQAF NEMVKTGELK APIVIGRDHL DTGSVASPNR ETEAMKDGSD AVSDWPLLNA
LLNTAGGASW VSLHHGGGVG MGFSQHAGVV IVADGTDAAH ARLGRVLFND PATGVMRHAD
AGYELAQRTA NEAGLKLPML GR