Gene BURPS1106A_1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1083 
Symbol 
ID4902649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1065183 
End bp1066622 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content73% 
IMG OID640134313 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_001065363 
Protein GI126452887 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAG CGAAGCACTT CATCGCGGGC GCATGGGCGC CGCCCGCGGG CGGCGAGACG 
ATCGCCGTGA TCGACCCGTC CGACGGCGAG CCGTTCGCGC GGCTCGCGCG CGGCACCGCG
CCCGATGTCG GCGCGGCCGT GCAAGCGGCG CGCGCCGCGT TCGACGGCTC GTGGGGCGCG
CTCGGCGCGG CCGACCGCGG GCGCATGCTG TACCGGCTGT CGATGCTCGT CGCCGCGTGC
CGCGAGGAGC TCGCGCTCAT CGAATCGCGC GACACCGGCA AGCCGCTCAC GCAGGCGCGC
GCGGACGCCG ACGCGCTCGC CCGCTACCTC GAGTTCTACG CGGGCGCGGC CGACAAGCTG
CACGGCGAGA CGCTGCCCTA CCGCGACGGC TACACGGTAC TCACGCTGCG CGAGCCGCAC
GGCGTGACGG GCCACATCGT GCCGTGGAAT TATCCGATGC AGATCCTCGG GCGCAGCGTC
GGCGCGGCGC TCGCCGCGGG CAACGCGTGC GTCGTCAAGC CCTCGGAGGA CGCGTGCCTG
TCGATCCTGC GCGTCGCCAC GCTCGCCGCC GAAGCCGGGC TGCCCGAGGG CGCGTTCAAC
GTCGTGACGG GCTACGGCCA CGAAGCGGGC GCGGCGCTCG CGCGCCATCC CGGCGTCGAT
CACCTGTCGT TCACCGGTTC GCCGGATACA GGCCGCCTCG TCGCGCAGAT GGCGGCCGAG
CACCACGCGA GCGTCACGCT CGAGCTCGGC GGCAAGTCGC CGCAGATCGT GTTCGCCGAC
GCGGATCTCG ACGCGGCATT GCCCGTCCTC GTGTCCGCGA TCGTCCAGAA CGCCGGCCAG
ACCTGTTCGG CCGGCAGCCG CGTGCTGATC GACAAGGCGG TCTACGAGCC GCTCGTCGAG
CGGCTCGCGA CCGCGTTCAA CGGGCTGAAG GTCGGCCCCG GCCGCGCCGA TCTCGATTGC
GGGCCGCTCA TCAACGCGAA GCAGCAGCAG CGCGTGTGGG ACTTCCTCTC CGATGCGCAG
CACGACGGCA TCACGATGGC CGCGCACGGC CAGGTCGTGC CGGACGCGCC CGAAACGGGC
TTCTACCAGG CGCCCGCGCT GCTTCGCGAC GTGCCGCACA CGCACCGGCT CGCACAGGAG
GAAGTGTTCG GGCCGGTGCT CGCCGCGATG CCGTTCGCCG ACGAGGACGA AGCGCTCGCG
CTCGCGAACG GCACGCCGTT CGGGCTCGTC GCCGGCATCT GGACGCGCGA CGGCGCGCGG
CAGATGCGGC TCGCACGCAA GGTGCGCGCG GGGCAGGTGT TCGTCAACAA CTACGGCGCG
GGCGGCGGCG TCGAGTTGCC GTTCGGCGGT ACCGGGCGCT CGGGCTACGG CCGCGAGAAG
GGCTTCGAGG CGCTGTACGG CTTCACCGTG CTGAAGACGA TCGCGCTGCG GCACGGCTGA
 
Protein sequence
MEEAKHFIAG AWAPPAGGET IAVIDPSDGE PFARLARGTA PDVGAAVQAA RAAFDGSWGA 
LGAADRGRML YRLSMLVAAC REELALIESR DTGKPLTQAR ADADALARYL EFYAGAADKL
HGETLPYRDG YTVLTLREPH GVTGHIVPWN YPMQILGRSV GAALAAGNAC VVKPSEDACL
SILRVATLAA EAGLPEGAFN VVTGYGHEAG AALARHPGVD HLSFTGSPDT GRLVAQMAAE
HHASVTLELG GKSPQIVFAD ADLDAALPVL VSAIVQNAGQ TCSAGSRVLI DKAVYEPLVE
RLATAFNGLK VGPGRADLDC GPLINAKQQQ RVWDFLSDAQ HDGITMAAHG QVVPDAPETG
FYQAPALLRD VPHTHRLAQE EVFGPVLAAM PFADEDEALA LANGTPFGLV AGIWTRDGAR
QMRLARKVRA GQVFVNNYGA GGGVELPFGG TGRSGYGREK GFEALYGFTV LKTIALRHG