Gene BURPS1106A_A1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1653 
Symbol 
ID4903824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1621377 
End bp1622411 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content70% 
IMG OID640144759 
Productputative D-xylulose reductase 
Protein accessionYP_001075687 
Protein GI126457208 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGCCC TCGTACTCGA AAAGGCGCGC GAGCTCGCGC TGCGCGACAT CGATCTGCCG 
CTCGAGGTCG GCCCCGCCGA CGTCAAGATC CGCGTCCACA CGGTGGGCGT GTGCGGCAGC
GATGTCCATT ACTACATGCA CGGCGGAATC GGCCCGTTCC GGGTGGACGC GCCGATGGTG
CTCGGCCACG AGGCGTCGGG CACGGTCGTC GAGGTCGGGC GCGACGTCAC GTACCTGCGC
GTGGGCGAGC GCGTGTGCAT GGAGCCCGGC GTGCCGCGCT TCGATTCGAA AGCGACGCTG
CACGGCCTGT ACAACCTCGA TCCGTCGGTG CGCTTCTGGG CGACGCCGCC CGTGCACGGG
TGCCTCGCGC CCTACGTCGT GCACCCGGCG GCGTTCACGT ACCGGCTGCC GCCGAACGTG
TCGTTCGCCG AGGGCGCGAT CGTCGAGCCG CTCGCGATCG GGCTGCAGGC CGCGAAGAAG
GCGGCGATGA AGCCGGGCGA CATCGCGGTG GTCGTCGGCG CGGGCACGAT CGGCGCGATG
ACGGCGCTCG CCGCGCTCGC GGGCGGCGCC GCGCGCGTGA TCCTCGCCGA CGTCGTCAAG
GAAAAGCTCG CGCTGTTCGC CGCGAACCGC GCGGTGACGA CGGTCGACGC GAGCACGCAG
TCGCTCGCCG ATGCGGTCGC GCACGCGACC GACGGCTGGG GCGCGGACGT GGTGTTCGAG
GCAAGCGGCA ACGCGAGCGC GTATGCGGGC ATCGTCGATC TGCTGTGCCC GAACGGCTGC
CTCGTGCTGG TCGGCATGCC GCTCGCGCCG GTGCCGCTCG ATGTCGTTTC GCTGCAGGCG
AAGGAGGCGC GCATCGAATC GGTGTTCCGC TACGCGAACG TCTTCCCGCG CGCGCTCGCG
CTGATCGCCT CCGGCGCGAT CGACGTGAAG CCGTTCATCT CGCGCACGTT CCCGTTCTCG
GACGGCCTGC GCGCATTCGA GGCGGCGGCG AGCGGCCAGC CGCACGACGT GAAGATTCAG
ATCGAAATGG ATTGA
 
Protein sequence
MKALVLEKAR ELALRDIDLP LEVGPADVKI RVHTVGVCGS DVHYYMHGGI GPFRVDAPMV 
LGHEASGTVV EVGRDVTYLR VGERVCMEPG VPRFDSKATL HGLYNLDPSV RFWATPPVHG
CLAPYVVHPA AFTYRLPPNV SFAEGAIVEP LAIGLQAAKK AAMKPGDIAV VVGAGTIGAM
TALAALAGGA ARVILADVVK EKLALFAANR AVTTVDASTQ SLADAVAHAT DGWGADVVFE
ASGNASAYAG IVDLLCPNGC LVLVGMPLAP VPLDVVSLQA KEARIESVFR YANVFPRALA
LIASGAIDVK PFISRTFPFS DGLRAFEAAA SGQPHDVKIQ IEMD