Gene BURPS668_A2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2784 
Symbol 
ID4887192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2657019 
End bp2658044 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID640132720 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001063776 
Protein GI126442465 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.95808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAAT ACATGAAGGC CGCGGTGGTG CATGCATTCG GCGAACCGCT TCGGATCGAG 
GAGGTGCCCG TGCCGACGCC CGGCGCGGGG CAGATTCTCG TGAACGTCAA GGCATCGGGC
GTGTGCCATA CCGATCTGCA CGCGGCCGAC GGCGACTGGC CCGTCAAGCC GACGCTGCCG
TTCATTCCGG GGCACGAGGG CGTCGGCTTC GTCGCGGCGG TGGGCGAAGG CGTGAGGCAC
GTGAAGGAGG GCGATCGCGT CGGCGTGCCT TGGCTCTATA CCGCGTGCGG CCATTGCGAG
TATTGCCAGA CCGGCTGGGA GACGCTGTGC CACGAGCAGC AGAACACCGG CTATTCGGTG
AACGGCAGCT ACGCGGAATA CGTGCTCGCC GATCCGAACT ACGTCGGCCA TCTGCCGAGC
AACGTCGCGT TCGACGAGAT CGCGCCGATC CTGTGCGCGG GCGTGACCGT CTACAAGGGC
ATTCGGGTGA CCGACACGCG CCCGGGGCAA TGGATCGCGA TCTCGGGGAT CGGCGGGCTC
GGGCACGTCG CGGTGCAGTA CGCGAAGGCG ATGGGGCTGC ACGTGGTCGC GGTGGACGTC
GCGCCGCAGA AGCTCGAGCT TGCGCGCAAG CTGGGCGCGG CGTTCGTCGT CGATGCGTCG
AAGGACGATC CGGCGGCGGT GATCCAGAAG GAGATCGGCG GCGTGCACGG CGTGCTCGTG
ACGGCCGTGT CGCGCGGCGC GTTCGCGCAG GCGCTCGGCA TGGTGAGGCG CGGCGGGACG
GTCTCGCTGA ACGGGCTGCC GCCGGGCGAT TTTCCGCTGC CGATCTTCTC GACGGTGCTC
AACGGGATCA CGGTGCGAGG CTCGATCGTC GGCACGCGGC GCGATCTCCA GGAATCGCTC
GATTTCGCGG CCGAAGGGCT CGTGCGCGCG CATATCCATC GCGACAAGCT CGAGCACATC
AACGGCGTGT TCTCGGCGCT GCGCGAAGGG AAGGTCGACG GGCGGATCGT GTTGACCGGG
CAATGA
 
Protein sequence
MAQYMKAAVV HAFGEPLRIE EVPVPTPGAG QILVNVKASG VCHTDLHAAD GDWPVKPTLP 
FIPGHEGVGF VAAVGEGVRH VKEGDRVGVP WLYTACGHCE YCQTGWETLC HEQQNTGYSV
NGSYAEYVLA DPNYVGHLPS NVAFDEIAPI LCAGVTVYKG IRVTDTRPGQ WIAISGIGGL
GHVAVQYAKA MGLHVVAVDV APQKLELARK LGAAFVVDAS KDDPAAVIQK EIGGVHGVLV
TAVSRGAFAQ ALGMVRRGGT VSLNGLPPGD FPLPIFSTVL NGITVRGSIV GTRRDLQESL
DFAAEGLVRA HIHRDKLEHI NGVFSALREG KVDGRIVLTG Q