Gene BMASAVP1_A3320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A3320 
Symbol 
ID4680767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp3279121 
End bp3280740 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content67% 
IMG OID639847574 
Productaldehyde dehydrogenase family protein 
Protein accessionYP_994599 
Protein GI121599516 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCGCG GCGCGGCGTG CTGGCACGGT TTTGGCGTAA CGAACGGCGT CGCGCACGTC 
GCGCGGCGCA TCACCGATAC CGACATGCGA GGAGCAACGA TGAATCACGC GGACATGCAA
CATCTGAACA TCGAATTCCC GTACCGCAAG CAGTACGGGA ATTTCATCGG CGGCGAATGG
GTCGCCCCGG TCGGCGGCGA GTATTTCGAC AACGTCTCGC CCGTCACCGG CCGGCCGTTC
ACCGCGATCC CTCGCTCGCG CGAAGCCGAC ATCGAGCTCG CGCTCGACGC CGCTCACGCG
GCCAAGGCGG GCTGGGCCGC GAAGGGCGCG GCCGAGCGCG CGAACGTGCT GCTGAGGATC
GCCGACCGGA TGGAGGCGAA CCTCACGCGC CTCGCCGTCG CCGAGACGAT CGACAACGGC
AAGCCGCTGC GCGAAACCAC CGCAGCCGAC GTGCCGCTCG CGATCGACCA CTTCCGCTAC
TTCGCGGGCT GCATCCGCGC GCAGGAAGGC TCGATCGCCG ATATCGGCGG CGACATGGTG
GCCTACCACT TCCACGAGCC GCTCGGCGTC GTCGGCCAGA TCATCCCGTG GAACTTCCCG
CTGCTGATGG CCGCGTGGAA GCTCGCGCCG GCGCTCGCGG CCGGCAACTG CGTCGTGCTC
AAGCCGGCCG AGCAGACGCC CGCGTCGATC CTCGTGTTCG CCGAGCTGAT CCAGGATCTG
CTGCCGCCCG GCGTGCTCAA CATCGTCAAC GGCTTCGGCC TCGAGGCCGG CAAGCCGCTC
GCGTCGAGCA AGCGGATCGC GAAGATCGCG TTCACGGGCG AGACGTCGAC GGGCCGCCTC
ATCATGCAGT ACGCGAGCGA GAACCTGATT CCCGTCACGC TCGAGCTGGG CGGCAAGAGC
CCGAATATTT TCTTCGCCGA CGTGATGGAT CGCGACGACA GCTACTTCGA CAAGGCGCTC
GAAGGCTTCG CGATGTTCGC GCTGAACCAG GGCGAAGTCT GCACGTGCCC ATCGCGCGCG
CTCGTCGAGG AGAGCATCTA CGATCGCTTC ATCGAACGCG CGCTCAAGCG CGTCGAGGCG
ATCAAGCAGG GCCATCCGCT CGATTCGCAG ACGATGATCG GCGCGCAGGC GTCGGCCGAG
CAGCTCGAGA AGATCCTGTC GTACATCGAC ATCGGCCGCG GCGAAGGCGC GCAATGCCTG
ACGGGCGGCG AGCGCAACGT GCTCGGCGGC GAGCTCGCCG AAGGCTATTA CGTGAAGCCG
ACCGTGTTCC GCGGCCACAA CAAGATGCGC ATCTTCCAGG AAGAAATCTT CGGGCCGGTG
CTCGCGGTGA CGACGTTCAA GACCGAGGAG GAAGCGCTCG AGATCGCGAA CGACACGCTG
TACGGCCTGG GCGCCGGCGT CTGGACGCGC GACGGCAACC GCGCGTACCG CTTCGGCCGC
GGCATCCAGG CGGGCCGCGT GTGGACGAAC TGCTATCACG CGTATCCGGC GCACGCGGCG
TTCGGCGGCT ACAAGCAATC CGGCATCGGC CGCGAGACGC ACAAGATGAT GCTCGACCAC
TACCAGCAGA CGAAGAACCT GCTCGTCAGC TACAGCGAAA AGCCGCTCGG GTTCTTCTGA
 
Protein sequence
MHRGAACWHG FGVTNGVAHV ARRITDTDMR GATMNHADMQ HLNIEFPYRK QYGNFIGGEW 
VAPVGGEYFD NVSPVTGRPF TAIPRSREAD IELALDAAHA AKAGWAAKGA AERANVLLRI
ADRMEANLTR LAVAETIDNG KPLRETTAAD VPLAIDHFRY FAGCIRAQEG SIADIGGDMV
AYHFHEPLGV VGQIIPWNFP LLMAAWKLAP ALAAGNCVVL KPAEQTPASI LVFAELIQDL
LPPGVLNIVN GFGLEAGKPL ASSKRIAKIA FTGETSTGRL IMQYASENLI PVTLELGGKS
PNIFFADVMD RDDSYFDKAL EGFAMFALNQ GEVCTCPSRA LVEESIYDRF IERALKRVEA
IKQGHPLDSQ TMIGAQASAE QLEKILSYID IGRGEGAQCL TGGERNVLGG ELAEGYYVKP
TVFRGHNKMR IFQEEIFGPV LAVTTFKTEE EALEIANDTL YGLGAGVWTR DGNRAYRFGR
GIQAGRVWTN CYHAYPAHAA FGGYKQSGIG RETHKMMLDH YQQTKNLLVS YSEKPLGFF