Gene BURPS668_A1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1779 
Symbol 
ID4886540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1725938 
End bp1727095 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content71% 
IMG OID640131717 
Productiron-containing alcohol dehydrogenase 
Protein accessionYP_001062774 
Protein GI126444714 
COG category[C] Energy production and conversion 
COG ID[COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATT TCGATTTCTA CAACCCGACC CGGATCGTCT TCGGCGAAAA GACGGCCGCG 
CGGCTGAACG ATCTGCTGCC GGCGGCGGCC CGCGTGCTCG TGCTGTACGG CGGCGAGAGC
GCGCGCAGCA ACGGCACGCT CGACGAGGTC CGCGCCGCGC TCGGCGCGCG CGACGTGCGC
GAGTTCGGCG GGATCGAGCC GAACCCGGCC TACGAGACGC TGATGCGGGC GGTCGAGCTC
GCGCGGCGCG AGCGTGTGGA TTTCCTGCTC GCGGTCGGCG GCGGCTCGGT GATCGACGGC
ACGAAGTTCG TCGCGGCCGC GGTGCCGTTC GAGGGCGATC CGTGGACCAT CCTCGAGACG
CACGGCGCGA ACGTCGCGGC GGCGCTGCCG TTCGGCTGCG TGCTGACGCT GCCCGCGACG
GGCTCGGAGA TGAACAACGG CGCGGTCCTC ACGCGCCGCG CGACGCGCGC GAAGCTCGCG
TTCCGCCATC CGCTCGTGTT TCCGACGTTC TCGATTCTGG ACCCGACGAA GACCTACACG
CTGCCGCCGC GGCAGGTGGC GAACGGCGTC GTCGACGCGT TCACGCACAT CGTCGAGCAG
TACCTGACGT ATCCGGCCGA CGGCCTCGCG CAGGACCGCT TCGCCGAGGG CCTGCTGCAG
ACGCTGATCG AGATCGGCCC GAAGGCCTTG GCCGAGCCGC GCGACTATGC GACGCGCGCG
AACCTGATGT GGGTCGCGAC GCTCGCGCTG AACGGCCTGA TCGGCGCGGG CGTGCCGCAG
GACTGGGCGA CGCACATGGT CGGGCACGAG CTCACCGCGC GCTACGACAT CGACCATGCG
CGCACGCTCG CCGTCGTGCT GCCGTCGATG CTCGACGCGC GCCGCGACGC GAAGCGCGCA
AAGCTGCTGC AATACGCGGC GCGCGTCTGG AACATCGTCG ACGGCCCCGA GGACGCGCGC
ATCGACGCGG CGATCGCGCG CACGCGCGCG TTCTTCGAAA GCCTCGGCGT GAAGACCCGC
CTCGCCGATT ACGGCGTGGG CGCCGATGCG ATCGACGGCC TGATCGCGCA ACTCGAGGCG
CACGGGATGA CGCGACTCGG CGAGCGCAAG GACGTCACGC TCGACGTGAG CCGCCGCGTG
CTCGAGGCCA GCCTGTGA
 
Protein sequence
MLNFDFYNPT RIVFGEKTAA RLNDLLPAAA RVLVLYGGES ARSNGTLDEV RAALGARDVR 
EFGGIEPNPA YETLMRAVEL ARRERVDFLL AVGGGSVIDG TKFVAAAVPF EGDPWTILET
HGANVAAALP FGCVLTLPAT GSEMNNGAVL TRRATRAKLA FRHPLVFPTF SILDPTKTYT
LPPRQVANGV VDAFTHIVEQ YLTYPADGLA QDRFAEGLLQ TLIEIGPKAL AEPRDYATRA
NLMWVATLAL NGLIGAGVPQ DWATHMVGHE LTARYDIDHA RTLAVVLPSM LDARRDAKRA
KLLQYAARVW NIVDGPEDAR IDAAIARTRA FFESLGVKTR LADYGVGADA IDGLIAQLEA
HGMTRLGERK DVTLDVSRRV LEASL