Gene BURPS1106A_A0492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A0492 
Symbol 
ID4903472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp482530 
End bp483984 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content69% 
IMG OID640143598 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_001074534 
Protein GI126457220 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03250] putative phosphonoacetaldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0820684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACGA TGCCGCTTCG CGATCACGCC GCCTACCGCC GCGAGACGCT GCGCTGGTGC 
GGGCAACGGG CGGCGCGTGC GCGCACGCTC GACGTATTCG ATCCGTATTC GGGGCTGCGC
GTCGGCACCG TGCCGCTCGC GACGGTCGAC GACGTGCGCC GCGCGTTCGA CTATGCGGTC
GCGTACCGGC CTGCGCTGTC GCGCTACGAG CGCTCGCAGA TCCTCGAGCG CGCGGCCGCA
CGCCTCACCG CGCGCGCGGA AGAGGCGTCG ACGCTGATCT CGCTCGAATC GGGGCTGTCG
AAACAGGATT CGCGCTACGA GATCGGCCGC GTCGCCGACG TGTTCAAGTT CGCGTCGATC
GAGGCGCTGC GCGACGATGC GCAGAGCTAC TCGTGCGATC TGACGCCGCA CGGCAAGTCG
CGCCGCGTGT TCTCGCAGCG CCAGCCGCTC GACGGCGTGA TCGTCGCGAT CACGCCGTTC
AATCATCCGA TGAACCAGGT CGCGCACAAG ATCGCGCCGG CGATCGCGAC GAACAACCGC
GTGATCGTGA AACCGTCGGA GAAGGTGCCG CTGTCGGCGC TGTATCTCGC CGACGTGCTG
TACGAAGCGG GCCTGCCCGA GCCGATGCTG CAGGTGCTGA CGGGCGATCC GCGCGAGATC
GCCGACGAAC TGCTCACGCA CCCGAGCGCG ACGCTGATCA CGTTCACGGG CGGCGTCGCG
ATCGGCAAGT CCATCGCGGC GAAGGCGGGC TACCGGCGCA TCGTGCTCGA GCTGGGCGGC
AACGATCCGC TGATCGTGCT CGACGATGCC GATCTCGAAC GCGCGGCCGC GCTCGCCGCG
CAGGGCTCGT ACAAGAACTC CGGCCAGCGC TGCACGGCGG TCAAGCGCAT CCTCGTGCAC
AAGCGCGTCG CGCCGCGCTT CACGGAGCTG CTCGTCGAGC ACACGCGCGC ATGGACGTAC
GGCGATCCGT TCGACCCGGC GAACCGGATG GGCACCGTGA TCGACGAGGC GGCGGCCGCG
CTGTTCGAGG CGCGCGTCGA CGAAGCGGTG GCGCAGGGCG CGCGCCTGCT GACGGGCAAC
GCGCGGCGCG GCGCGCTGTA TGCGCCGACA GTGCTCGATC GCGTCGACGC ATCGATGACG
CTCGTGCGCG AGGAGACGTT CGGCCCCGTG TCGCCGATCA TCGCGTTCGA TACGATCGAC
GACGCGATCC GCATCAGCAA CGGCACCGCG TTCGGCCTGT CGTCGGGCGT GTGCACGGAT
CGCGCCGACG CGATCGTGCG CTTCGTCAAC GAGCTGAACG TCGGCACGGT GAACGTATGG
GAAGTGCCGG GCTACCGGCT CGAGCTCACG CCGTTCGGCG GCATCAAGGA TTCCGGGCTC
GGATACAAGG AAGGGGTGCA GGAGGCGATG AAGAGCTTCA CGAACCTGAA GACGTTTTCT
TTGCCGTGGG CGTGA
 
Protein sequence
MTTMPLRDHA AYRRETLRWC GQRAARARTL DVFDPYSGLR VGTVPLATVD DVRRAFDYAV 
AYRPALSRYE RSQILERAAA RLTARAEEAS TLISLESGLS KQDSRYEIGR VADVFKFASI
EALRDDAQSY SCDLTPHGKS RRVFSQRQPL DGVIVAITPF NHPMNQVAHK IAPAIATNNR
VIVKPSEKVP LSALYLADVL YEAGLPEPML QVLTGDPREI ADELLTHPSA TLITFTGGVA
IGKSIAAKAG YRRIVLELGG NDPLIVLDDA DLERAAALAA QGSYKNSGQR CTAVKRILVH
KRVAPRFTEL LVEHTRAWTY GDPFDPANRM GTVIDEAAAA LFEARVDEAV AQGARLLTGN
ARRGALYAPT VLDRVDASMT LVREETFGPV SPIIAFDTID DAIRISNGTA FGLSSGVCTD
RADAIVRFVN ELNVGTVNVW EVPGYRLELT PFGGIKDSGL GYKEGVQEAM KSFTNLKTFS
LPWA