Gene BURPS668_A1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1742 
Symbol 
ID4886349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1688183 
End bp1689217 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content70% 
IMG OID640131680 
ProductD-xylulose reductase 
Protein accessionYP_001062737 
Protein GI126443001 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.247806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGCCC TCGTACTCGA AAAGGCGCGC GAGCTCGCGC TGCGCGACAT CGATCTGCCG 
CTCGAGGTCG GCCCCGCCGA CGTCAAGATC CGCGTCCACA CGGTGGGCGT GTGCGGCAGC
GATGTCCATT ACTACATGCA CGGCGGAATC GGCCCGTTCC GGGTGGACGC GCCGATGGTG
CTCGGCCACG AGGCGTCGGG CACGGTCGTC GAGGTCGGGC GCGACGTCAC GTACCTGCGC
GTGGGCGAGC GCGTGTGCAT GGAGCCCGGC GTGCCGCGCT TCGATTCGAA AGCGACGCTG
CACGGCCTGT ACAACCTCGA TCCGTCGGTG CGCTTCTGGG CGACGCCGCC CGTGCACGGG
TGCCTCGCGC CCTACGTCGT GCACCCGGCG GCGTTCACGT ACCGGCTGCC GCCGAACGTG
TCGTTCGCCG AGGGCGCGAT CGTCGAGCCG CTCGCGATCG GGCTGCAGGC CGCGAAGAAG
GCGGCGATGA AGCCGGGCGA CATCGCGGTG GTCGTCGGCG CGGGCACGAT CGGCGCGATG
ACGGCGCTCG CCGCGCTCGC GGGCGGCGCC GCGCGCGTGA TCCTCGCCGA CGTCGTCAAG
GAAAAGCTCG CGCTGTTCGC CGCGAACCGC GCGGTGACGA CGGTCGACGC GAGCACGCAG
TCGCTCGCCG ACGCGGTCGC GCACGCGACC GACGGCTGGG GCGCGGACGT GGTGTTCGAG
GCGAGCGGCA ACGCGAACGC GTATGCGGGC ATCGTCGATC TGCTGTGCCC GAACGGCTGC
CTCGTGCTGG TCGGCATGCC GCTCGCGCCG GTGCCGCTCG ATGTCGTTTC GCTGCAGGCG
AAGGAGGCGC GCATCGAATC GGTGTTCCGC TACGCGAACG TCTTCCCGCG CGCGCTCGCG
CTGATCGCCT CCGGCGCGAT CGACGTGAAG CCGTTCATCT CGCGCACGTT CCCGTTCTCG
GACGGCCTGC GCGCATTCGA GGCGGCGGCG AGCAGCCAGC CGCACGACGT GAAGATTCAG
ATCGAAATGG ATTGA
 
Protein sequence
MKALVLEKAR ELALRDIDLP LEVGPADVKI RVHTVGVCGS DVHYYMHGGI GPFRVDAPMV 
LGHEASGTVV EVGRDVTYLR VGERVCMEPG VPRFDSKATL HGLYNLDPSV RFWATPPVHG
CLAPYVVHPA AFTYRLPPNV SFAEGAIVEP LAIGLQAAKK AAMKPGDIAV VVGAGTIGAM
TALAALAGGA ARVILADVVK EKLALFAANR AVTTVDASTQ SLADAVAHAT DGWGADVVFE
ASGNANAYAG IVDLLCPNGC LVLVGMPLAP VPLDVVSLQA KEARIESVFR YANVFPRALA
LIASGAIDVK PFISRTFPFS DGLRAFEAAA SSQPHDVKIQ IEMD