Gene Bcep18194_B1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B1899 
Symbol 
ID3753664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2173922 
End bp2175781 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content66% 
IMG OID637766748 
Productdihydroxy-acid dehydratase 
Protein accessionYP_372657 
Protein GI78062749 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.18498 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.373013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCACTT ACCGTTCCAA AACCTCCACC GCAGGCCGCA ACATGGCAGG TGCGCGCTCG 
CTCTGGCGCG CCACCGGCAT GAAAGACGAC GATTTCTCGA AGCCGATCAT CGCGGTCGTC
AACTCGTTCA CCCAGTTCGT GCCGGGGCAC GTGCACCTGA AGGATCTCGG CCAGCTCGTC
GCGCGCGAGA TCGAGGCTGC CGGCGGCGTC GCGAAGGAAT TCAACACGAT CGCGGTCGAC
GACGGCATCG CGATGGGCCA CGACGGCATG CTGTATTCGC TGCCGAGCCG CGACATCATC
GCCGACTCGG TCGAATACAT GGTGAACGCG CACTGCGCGG ATGCGATGGT GTGCATCTCG
AACTGCGACA AGATCACGCC GGGGATGCTG ATGGCCGCGA TGCGCCTGAA CATCCCGGTG
ATCTTCGTGT CGGGCGGCCC GATGGAAGCG GGCAAGACGC GCCTCGCGAA CCCGGTCACC
AAGGCCATCG AAGTGAAGAA GCTCGACCTC GTCGACGCGA TGGTGATTGC GGTCGACCCG
TCGTATTCCG ACGCTGAAGT CGCCGAAGTC GAACGCTCGG CCTGCCCGAC TTGCGGCTCG
TGCTCGGGCA TGTTCACCGC GAACTCGATG AACTGCCTGA CCGAAGCGCT CGGCCTGTCG
CTGCCCGGCA ACGGCACGGT GGTGGCCACG CACGCCGACC GCGAGCAACT GTTCAAGCGC
GCCGGCCGTC GCATCGTCGA ACTCACCCGC CAGCACTACG AGCAGGACGA CGAACGCGTG
TTGCCGCGCT CGGTGGGCTT CAAGGCGTTC GAGAACGCGA TGACGCTCGA CATCGCGATG
GGCGGCTCGA CCAACACGAT CCTGCACCTG CTGGCGATCG CGCAGGAAGC CGGCATCGAC
TTCACGATGA AGGACATCGA CCGCCTGTCG CGCGTCGTGC CGCAGCTGTG CAAGGTCGCA
CCGAACACGA ACAAGTACCA CATCGAGGAC GTGCACCGCG CAGGCGGCAT CATGGCGATC
CTCGGCGAGC TCGACCGCGC CGGCAAGCTG CACACCGACG TGCCGACCGT ACACACGCCG
TCGCTGAAGG ATGCGCTCGA CCAGTGGGAC ATCGTCCGCA CGCAGGACGA TGCGGTCCGC
ACGTTCTACC AGGCCGGCCC GGCCGGCGTC CCGACGCAGG TCGCGTTCAG CCAGAACACG
CGCTGGCCGA GCCTCGACCT CGATCGCGCC GAAGGCTGCA TCCGCTCGTA CGAGCATGCG
TTCTCGAAGG AAGGCGGCCT CGCCGTGCTG ACGGGCAACA TCGCGCTCGA CGGCTGCGTG
GTGAAGACGG CCGGCGTCGA CGAGAGCATT CTCGTGTTCG AAGGCACGGC GCACGTGACC
GAATCGCAGG ACGAAGCAGT CGAAAACATC CTGAACGACA AGGTCAAGGC GGGCGACGTG
GTGATCGTGC GCTACGAAGG CCCGAAGGGT GGCCCCGGCA TGCAGGAAAT GCTCTACCCG
ACCAGCTACA TCAAGTCGAA GGGCCTCGGC AAGGCATGCG CGCTGCTGAC GGACGGCCGT
TTCTCGGGCG GCACGTCGGG CCTGTCGATC GGCCATTGCT CGCCGGAAGC GGCAGCGGGC
GGCGCGATCG GCCTCGTGCG CGACGGCGAC AAGATCCGCA TCGACATCCC GAACCGCACG
ATCAACGTGC TGGTGTCGGA CGAGGAACTG GCGCGCCGCC GCGAAGAGCA GAACGCGAAG
GGCTGGAAGC CGGCGCAACC GCGTCCGCGC AAGGTGTCCG CTGCGCTGAA GGCCTACGCG
AAGCTGGTCA TGTCCGCCGA CAAGGGGGCC GTGCGCGACC TGTCGCTGCT CGACGACTGA
 
Protein sequence
MPTYRSKTST AGRNMAGARS LWRATGMKDD DFSKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRDII ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAAMRLNIPV IFVSGGPMEA GKTRLANPVT KAIEVKKLDL VDAMVIAVDP
SYSDAEVAEV ERSACPTCGS CSGMFTANSM NCLTEALGLS LPGNGTVVAT HADREQLFKR
AGRRIVELTR QHYEQDDERV LPRSVGFKAF ENAMTLDIAM GGSTNTILHL LAIAQEAGID
FTMKDIDRLS RVVPQLCKVA PNTNKYHIED VHRAGGIMAI LGELDRAGKL HTDVPTVHTP
SLKDALDQWD IVRTQDDAVR TFYQAGPAGV PTQVAFSQNT RWPSLDLDRA EGCIRSYEHA
FSKEGGLAVL TGNIALDGCV VKTAGVDESI LVFEGTAHVT ESQDEAVENI LNDKVKAGDV
VIVRYEGPKG GPGMQEMLYP TSYIKSKGLG KACALLTDGR FSGGTSGLSI GHCSPEAAAG
GAIGLVRDGD KIRIDIPNRT INVLVSDEEL ARRREEQNAK GWKPAQPRPR KVSAALKAYA
KLVMSADKGA VRDLSLLDD