Gene Bphy_5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_5037 
Symbol 
ID6246533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp2136011 
End bp2137870 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content64% 
IMG OID642596765 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001861172 
Protein GI186473830 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.458835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.533366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAT ACCGTTCCAA AACTTCCACC GCCGGCCGCA ACATGGCAGG GGCGCGCTCG 
CTGTGGCGCG CCACCGGCAT GAAAGACGAA GATTTTTCCA AGCCGATTAT CGCGGTGGTG
AACTCGTTCA CCCAGTTCGT GCCCGGACAC GTGCACCTGA AAGACCTCGG CCAGCTCGTC
GCGCGCGAAA TCGAAGCGGC GGGTGGCGTC GCGAAGGAAT TCAATACCAT CGCCGTCGAC
GACGGCATCG CAATGGGCCA CGACGGCATG CTGTACTCGC TGCCGAGCCG CGACATCATC
GCAGATTCGG TCGAGTACAT GGTCAACGCC CACTGCGCCG ACGCGATGGT GTGCATCTCG
AACTGCGACA AGATTACCCC CGGCATGCTG ATGGCCGCGA TGCGCCTGAA CATTCCCGTG
ATCTTCGTGT CCGGCGGCCC GATGGAGGCC GGCAAGACGC GCCTCGCGAA TCCCGCTACG
GGCACGATCG AGTTCAAGAA GCTGGACCTC GTGGACGCCA TGGTGATCGC CGCCGACCAA
GCCTACTCCG ACGCCGACGT GGCCGAAGTC GAACGCTCCG CGTGCCCGAC CTGCGGCTCG
TGCTCGGGCA TGTTTACGGC CAACTCGATG AACTGCCTGA CGGAAGCACT CGGCCTTTCG
CTGCCGGGCA ACGGCACGGT GGTCGCGACG CACGCGGACC GCGAACAGCT CTTCAAGCGC
GCCGGCCGCC GTATCGTCGA ACTGGCGCGC CAGTACTACG AGAAGGAAGA CGAGCGCGTG
CTGCCGCGTT CGGTGGGCTT CAAGGCGTTC GAAAATGCGA TGACGCTCGA CATCGCGATG
GGCGGCTCGA CCAACACCAT CCTGCACCTG CTCGCGATCG CGCGTGAAGC CGGGATCGAC
TTCACAATGA CGGACATCGA CCGTCTGTCG CGCGTTGTGC CGCAGCTGTG CAAGGTCGCG
CCGAACACGA ATAAATACCA TATCGAAGAC GTGCATCGCG CGGGCGGCAT CATGGCTATC
CTCGGTGAGC TGGAACGTGC GGGCAAGCTG CACACGGATG TGCCGACAGT GCACGCACCT
ACGCTAAAGG ACGCGCTGAA CGCGTGGGAC ATCGCGCTCA CCGACGACGA AGCCGTGAAG
ACCTTCTATA TGGCCGGTCC CGCAGGGATT CCGACGCAGG TCGCGTTCAG CCAGAACACG
CGCTGGCCGA GCCTCGATCT CGATCGCGCC GAGGGCTGCA TTCGTTCGTA CCAGCACGCG
TTCTCGAAGG AAGGCGGCCT CGCGGTGCTG ACGGGCAACA TCGCGCTCGA CGGTTGTGTC
GTGAAAACGG CGGGCGTCGA CGAGAGCATT CTCGTATTCG AAGGCACCGC GCACGTGACG
GAATCGCAGG ATGAAGCTGT CGAGAACATC CTCAACGACA AGGTCAAGGC CGGCGACGTC
GTGATCGTCC GCTACGAAGG TCCGAAGGGC GGTCCGGGCA TGCAGGAAAT GCTGTACCCG
ACGAGCTACA TCAAGTCGAA GGGTCTGGGC AAGGCCTGCG CGCTGCTGAC GGACGGACGC
TTCTCCGGCG GCACCTCGGG CCTGTCGATC GGTCACTGCT CGCCTGAAGC GGCGGCAGGC
GGCGCGATCG GACTCGTGCG CGACGGCGAC AAGATCCGCA TCGACATTCC GAACCGCACC
ATCGACGTGC TGCTGTCCGA CGAGGAACTG GCACGCCGCC GCGAGGAACA AAACGCGAAG
GGCTGGAAGC CTGCGAAGCC GCGCCCCCGC AAGGTGTCGG CGGCGCTGAA GGCGTACGCG
AAGCTCGTGA TGTCGGCGGA CAAGGGCGCG GTGCGCGACC TGTCGTTGCT CGACGATTGA
 
Protein sequence
MPAYRSKTST AGRNMAGARS LWRATGMKDE DFSKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRDII ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAAMRLNIPV IFVSGGPMEA GKTRLANPAT GTIEFKKLDL VDAMVIAADQ
AYSDADVAEV ERSACPTCGS CSGMFTANSM NCLTEALGLS LPGNGTVVAT HADREQLFKR
AGRRIVELAR QYYEKEDERV LPRSVGFKAF ENAMTLDIAM GGSTNTILHL LAIAREAGID
FTMTDIDRLS RVVPQLCKVA PNTNKYHIED VHRAGGIMAI LGELERAGKL HTDVPTVHAP
TLKDALNAWD IALTDDEAVK TFYMAGPAGI PTQVAFSQNT RWPSLDLDRA EGCIRSYQHA
FSKEGGLAVL TGNIALDGCV VKTAGVDESI LVFEGTAHVT ESQDEAVENI LNDKVKAGDV
VIVRYEGPKG GPGMQEMLYP TSYIKSKGLG KACALLTDGR FSGGTSGLSI GHCSPEAAAG
GAIGLVRDGD KIRIDIPNRT IDVLLSDEEL ARRREEQNAK GWKPAKPRPR KVSAALKAYA
KLVMSADKGA VRDLSLLDD