Gene BURPS1106A_A0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0642 
Symbol 
ID4904739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp624079 
End bp625569 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID640143748 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_001074678 
Protein GI126457646 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA CCACTTTGGC TGACTGGCAG GACAAGGCCG CGACGCTCGC GATCGAGGGG 
CGCGCATTCA TCGACGGCGC GTATCGCGAC GCGCACGGCG GCAAGACCTT CGATTGCGTG
AGCCCGATCG ACGGGCGCGT GCTCGCGAAG GTCGCCGATT GCGGCGCGGC CGATGTCGAC
GCGGCGGTGG CCGCCGCGCG GCGCGCGTTC GACGCGCAGG CGTGGGCGGG CCTGAACCCG
CGCGAGCGCA AGGCGATCCT GCTGCGCTGG GCCGCGCTGA TGCGCGCGCA TCTCGACGAG
CTGGCGCTGC TCGAGACGCT CGACGCGGGC AAGCCGATCG GCGACACGAC GAGCGTCGAC
GTGCCGGGCG CCGCGTACTG CGTCGAATGG TTCGCCGAGG CGATCGACAA GGTGGGCGGC
GAAGTGGTGC CCGCCGATCA TCATCTCGTC GGCCTCGTCA CGCGCGAGCC GCTCGGCGTC
GTCGCCGCCG TCGTGCCGTG GAATTTTCCG ATCCTGATGG CGTCGTGGAA GTTCGGCCCG
GCGCTCGCCG CGGGCAACAG CGTCGTGCTC AAGCCGTCGG AGAAATCGCC GCTCACGGCG
ATCAGGGTCG CGCGGCTCGC GCACGAGGCG GGGATTCCGG CCGGCGTGTT CAACGTCGTG
CCGGGCGGCG GCGAGCCGGG CAAGCTGCTC GCGCTGCATC GCGACGTCGA CTGTCTCGCG
TTCACCGGCT CCACGGGTGT CGGCAAGCTG ATCATGCAGT ACGCGGGGCA ATCGAACCTG
AAGCGCGTGT GGCTCGAGCT GGGCGGCAAG TCGCCGAACA TCGTGCTGCC CGACTGCCCG
GATCTCGACC GCGCGGCGAA GGCGGCGGCG GGCGCGATCT TCTACAACAT GGGCGAGATG
TGCACGGCGG GATCGCGCCT GCTCGTGCAC CGCGAGATCA AGGACGCGTT CGTCGAAAAG
CTCGTCGCCG CGGCGCGCGC GTACAAGCCG GGCAATCCGC TCGATCCGAA CGTGTCGATG
GGCGCGATCG TCGACGCGAT CCAGCTCGAG CGCGTGCTCG GCTACATCGA GGCGGGCCGC
GCCGAAGCGC GGCTGCTGCT CGGCGGCGCG CGCGTGAACG AGGCGAGCGG CGGCTTCTAC
ATCGAGCCGA CCGTGTTCGA CACCGCGCCC GACACGCGGA TCGCGCGCGA GGAAATCTTC
GGTCCGGTGC TGTCGATGAT CACGTTCGAT TCGGTCGACG AAGCGGTGAG GATCGCGAAC
GACAGCGAAT ACGGGCTCGG CGCGGCCGTG TGGACCGCGA ACCTGACGAC CGCGCACGAA
CTCGCGCGGC GGTTGCGCGC GGGCACCGTG TGGGTCAACT GCTACGACGA AGGGGGCGAC
ATGAACTTCC CGTTCGGCGG CTACAAGCAA TCGGGCAACG GCCGCGACAA GTCGTTGCAC
GCACTGGAGA AGTACACCGA GCTGAAGTCC ACGCTCGTGC GGCTGCGCTA A
 
Protein sequence
MDKTTLADWQ DKAATLAIEG RAFIDGAYRD AHGGKTFDCV SPIDGRVLAK VADCGAADVD 
AAVAAARRAF DAQAWAGLNP RERKAILLRW AALMRAHLDE LALLETLDAG KPIGDTTSVD
VPGAAYCVEW FAEAIDKVGG EVVPADHHLV GLVTREPLGV VAAVVPWNFP ILMASWKFGP
ALAAGNSVVL KPSEKSPLTA IRVARLAHEA GIPAGVFNVV PGGGEPGKLL ALHRDVDCLA
FTGSTGVGKL IMQYAGQSNL KRVWLELGGK SPNIVLPDCP DLDRAAKAAA GAIFYNMGEM
CTAGSRLLVH REIKDAFVEK LVAAARAYKP GNPLDPNVSM GAIVDAIQLE RVLGYIEAGR
AEARLLLGGA RVNEASGGFY IEPTVFDTAP DTRIAREEIF GPVLSMITFD SVDEAVRIAN
DSEYGLGAAV WTANLTTAHE LARRLRAGTV WVNCYDEGGD MNFPFGGYKQ SGNGRDKSLH
ALEKYTELKS TLVRLR