Gene BURPS668_A0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A0459 
Symbol 
ID4888048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp419751 
End bp420887 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content69% 
IMG OID640130400 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001061465 
Protein GI126442691 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.479531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACAGA CAGCCGAAGC GCCGCCGCAA CGCGCGGCCG AGCGCGGCGA GACGCCCGCG 
CTGCCCGCCA CGATGCGCGC CGTGGTTTGC CACGGCCCGC GAGACTACCG CCTCGAGCAG
GTGCCGGTGC CGAAGCCGGG GCCGGACGAG ATCCTGACCC AGGTGGAGCG CGTGGGCATC
TGCATGGGCG ACATCAAGAC GTTTCGCGGC GCGCCGTCGT TCTGGGGCGA CGCGGTGCAG
CCGCGCTACG TGAAGCCGCC GATGATTCCC GGCCACGAAT TCGTGTGTCG CGTCGTCGCG
CTCGGCCCCG GCGCCGAGCG GCGCGGCGTG AAGGCGGGCG ATCGCGTGAT CTCCGAGCAG
ATCGTGCCGT GCTGGAGCTG CCGCTTCTGC GGCCACGGCC AGTACTGGAT GTGCCAGAAG
CACGATCTGT ACGGATTCCA GAACAACGTG CACGGCGCGA TGGCCGAATA CATGATCTTC
ACGAAGGAGG CGATCGTGCA CCGCGTGCCC GATTCGATCC CGACCGACGA GGCGATCCTG
ATCGAGCCGC TGTCGTGCTC GCTGCACGCG GCCGATCGCG CGAACGTCGG CTTCGACGAC
GTGGTCGTCG TCGCCGGCGC GGGCACGCTC GGGCTCGGCA TCATCGGCGC GGTGCGGCTG
CGCCATCCGA AGCAGCTGAT CGTGCTCGAC ATGAAGCCCG AGCGCGCGGC GCTCGCGCGC
CGGATGGGCG CGGACGACGT GTGGAACCCG GCCGAGGAGA ACGTGATCGA GAAGATCCGC
GCGATCACGG GCGGCTACGG CTGCGATATC TACATCGAGG CGACCGGCCA CCATCGCGCG
GTAGGCCAGG GGCTCGCGAT GCTGCGCAAG CTCGGGCGCT TCGTCGAGTT CAGCGTGTTC
AACGACGAAG CGAGCGTCGA CTGGTCGATC ATCGGCGATC GCAAGGAGCT CGACGTGCTC
GGCTCGCATC TCGGCCCGTA CATGTACCCG CGCGCGATCG AGTTCATCGC ATCGCGCAGG
ATCGACGTGC GCGGCATCGT CACGCACACG TTCCCGCTGT CGCGCTTCGC CGACGCGTTC
GCCGTGATGG AGCGCGGCGA GCAATCGTTG AAGGTCGTTC TGGATCCGCG AGGTTAA
 
Protein sequence
MPQTAEAPPQ RAAERGETPA LPATMRAVVC HGPRDYRLEQ VPVPKPGPDE ILTQVERVGI 
CMGDIKTFRG APSFWGDAVQ PRYVKPPMIP GHEFVCRVVA LGPGAERRGV KAGDRVISEQ
IVPCWSCRFC GHGQYWMCQK HDLYGFQNNV HGAMAEYMIF TKEAIVHRVP DSIPTDEAIL
IEPLSCSLHA ADRANVGFDD VVVVAGAGTL GLGIIGAVRL RHPKQLIVLD MKPERAALAR
RMGADDVWNP AEENVIEKIR AITGGYGCDI YIEATGHHRA VGQGLAMLRK LGRFVEFSVF
NDEASVDWSI IGDRKELDVL GSHLGPYMYP RAIEFIASRR IDVRGIVTHT FPLSRFADAF
AVMERGEQSL KVVLDPRG