Gene BURPS668_A0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0856 
Symbol 
ID4887987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp833983 
End bp834984 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID640130796 
Productdehydrogenase 
Protein accessionYP_001061855 
Protein GI126442982 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.944789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGACGA AGATGCATGC ATGGAGCGCG CAACACGTGC CGCCGCAGGG CGGAAAAGTC 
GCGGTCGTCA CGGGGGCCAA CAGCGGCCTC GGCTGGCAGA TCGCGCAAAC GCTCGCCGCC
AAGGGCGCGC AAGTCGTGAT GGGCTGCCGG GATACGGCCA AGGGCGAACT GGCCGCGCAT
GCGATCCGCA CCCGCTATCC GCGCGCCCGA ATCGAAGTCG AGGCGCTCGA TCTCGCCGAC
CTCGCCAGCG TCTGCCGTTT CGCCGACGCC GTCGCCGATC GCCACGGCCG CGTCGACATT
CTCTGCAACA ACGCGGGCGT GATGTTCCTG CCGCTGCGCC ACACGCGCGA TGGCTTCGAA
ATGCAGATGG GCACGAACCA CCTCGGCCAC TTCGCGTTGA CGGGGCTGTT GCTGCCCGCG
TTGCGCGCAT CGCACCGCGC GCGCGTCGTG ACGATGTCGA GCGGCTTCAA CCGGCTCGGC
AAGATCCGCC TCGACAACAT GCTCGCCGAG CGCGGCTACA ACAAGTACCG CGCGTATTGC
GACAGCAAGC TCGCGAACCT GATGTTCACG CTCGAGCTGC AGCGCCGCTT CGATCAAGCG
TGCCTGCCGA TCCTGAGCGT GGCCGCGCAC CCCGGCTATG CGGCCACCCA CCTGCAGTTC
GCGGGCCCCG AAATGGCGAA CTCGTCGCTC GGCACGTTCG CGATGCGCCT GTCGAACCGG
CTCGTCGCCC AATCGGCCGA TGTCGGCGCG CTGCCCGCGA TCCATGCGGC GACGGCGGTC
GACGTCGACG GCGGCGCATA CATCGGCCCG GCCCATCTCT GCGAGACGCG CGGCTATCCC
GCCGAGGCAC GCATCCCGCG TCAGGCGCGC GACGTGCGCA TGGGCAAGCG CCTGTGGGAA
AAATCCGAGC AACTGACCGG CGTGCGCTAT CTCGACACGC CGCCGCCGCC CGGTTCGCGC
CGCCGCGCAT CGCGCGACGA CGCGACGTTC GGCGCGCTCT GA
 
Protein sequence
METKMHAWSA QHVPPQGGKV AVVTGANSGL GWQIAQTLAA KGAQVVMGCR DTAKGELAAH 
AIRTRYPRAR IEVEALDLAD LASVCRFADA VADRHGRVDI LCNNAGVMFL PLRHTRDGFE
MQMGTNHLGH FALTGLLLPA LRASHRARVV TMSSGFNRLG KIRLDNMLAE RGYNKYRAYC
DSKLANLMFT LELQRRFDQA CLPILSVAAH PGYAATHLQF AGPEMANSSL GTFAMRLSNR
LVAQSADVGA LPAIHAATAV DVDGGAYIGP AHLCETRGYP AEARIPRQAR DVRMGKRLWE
KSEQLTGVRY LDTPPPPGSR RRASRDDATF GAL