Gene BURPS1710b_A2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2026 
Symbol 
ID3692789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2462973 
End bp2464463 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID637732280 
Productaldehyde dehydrogenase family protein 
Protein accessionYP_337177 
Protein GI76818933 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA CCACTTTGGC TGACTGGCAG GACAAGGCCG CGACGCTCGC GATCGAGGGG 
CGCGCATTCA TCGACGGCGC GTATCGCGAC GCGCACGGCG GCAAGACCTT CGATTGCGTG
AGCCCGATCG ACGGGCGCGT GCTCGCGAAG GTCGCCGATT GCGGCGCGGC CGATGTCGAC
GCGGCGGTGG CCGCCGCGCG GCGCGCGTTC GACGCGCAGG CGTGGGCGGG CCTGAACCCG
CGCGAGCGCA AGGCGATCCT GCTGCGCTGG GCCGCGCTGA TGCGCGCGCA TCTCGACGAG
CTGGCGCTGC TCGAGACGCT CGACGCGGGC AAGCCGATCG GCGACACGAC GAGCGTCGAC
GTGCCGGGCG CCGCGTACTG CGTCGAATGG TTCGCCGAGG CGATCGACAA GGTGGGCGGC
GAAGTGGTGC CCGCCGATCA TCATCTCGTC GGCCTCGTCA CGCGCGAGCC GCTCGGCGTC
GTCGCCGCCG TCGTGCCGTG GAATTTTCCG ATCCTGATGG CGTCGTGGAA GTTCGGCCCG
GCGCTCGCCG CGGGCAACAG CGTCGTGCTC AAGCCGTCGG AGAAATCGCC GCTCACGGCG
ATCAGGGTCG CGCGGCTCGC GCACGAGGCG GGGATTCCGG CCGGCGTGTT CAACGTCGTG
CCGGGCGGCG GCGAGCCGGG CAAGCTGCTC GCGCTGCATC GCGACGTCGA CTGTCTCGCG
TTCACCGGCT CCACGGGTGT CGGCAAGCTG ATCATGCAGT ACGCGGGGCA ATCGAACCTG
AAGCGCGTGT GGCTCGAGCT GGGCGGCAAG TCGCCGAACA TCGTGCTGCC CGACTGCCCG
GATCTCGACC GCGCGGCGAA GGCGGCGGCG GGCGCGATCT TCTACAACAT GGGCGAGATG
TGCACGGCGG GATCGCGCCT GCTCGTGCAC CGCGAGATCA AGGACGCGTT CGTCGAAAAG
CTCGTCGCCG CGGCGCGCGC GTACAAGCCG GGCAATCCGC TCGATCCGAA CGTGTCGATG
GGCGCGATCG TCGACGCGAT CCAGCTCGAG CGCGTGCTCG GCTACATCGA GGCGGGCCGC
GCCGAAGCGC GGCTGCTGCT CGGCGGCGCG CGCGTGAACG AGGCGAGCGG CGGCTTCTAC
ATCGAGCCGA CCGTGTTCGA CACCGCGCCC GACACACGGA TCGCGCGCGA GGAAATCTTC
GGCCCGGTGC TGTCGATGAT CACGTTCGAT TCGGTCGACG AAGCGGTGAG GATCGCGAAC
GACAGCGAAT ACGGGCTCGG CGCGGCCGTG TGGACCGCGA ACCTGACGAC CGCGCACGAA
CTCGCGCGGC GGTTGCGCGC GGGCACCGTG TGGGTCAACT GCTACGACGA AGGGGGCGAC
ATGAACTTCC CGTTCGGCGG CTACAAGCAA TCGGGCAACG GCCGCGACAA GTCGTTGCAC
GCACTGGAGA AGTACACCGA GCTGAAGTCC ACGCTCGTGC GGCTGCGCTA A
 
Protein sequence
MDKTTLADWQ DKAATLAIEG RAFIDGAYRD AHGGKTFDCV SPIDGRVLAK VADCGAADVD 
AAVAAARRAF DAQAWAGLNP RERKAILLRW AALMRAHLDE LALLETLDAG KPIGDTTSVD
VPGAAYCVEW FAEAIDKVGG EVVPADHHLV GLVTREPLGV VAAVVPWNFP ILMASWKFGP
ALAAGNSVVL KPSEKSPLTA IRVARLAHEA GIPAGVFNVV PGGGEPGKLL ALHRDVDCLA
FTGSTGVGKL IMQYAGQSNL KRVWLELGGK SPNIVLPDCP DLDRAAKAAA GAIFYNMGEM
CTAGSRLLVH REIKDAFVEK LVAAARAYKP GNPLDPNVSM GAIVDAIQLE RVLGYIEAGR
AEARLLLGGA RVNEASGGFY IEPTVFDTAP DTRIAREEIF GPVLSMITFD SVDEAVRIAN
DSEYGLGAAV WTANLTTAHE LARRLRAGTV WVNCYDEGGD MNFPFGGYKQ SGNGRDKSLH
ALEKYTELKS TLVRLR