Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A3320 |
Symbol | |
ID | 4680767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | + |
Start bp | 3279121 |
End bp | 3280740 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639847574 |
Product | aldehyde dehydrogenase family protein |
Protein accession | YP_994599 |
Protein GI | 121599516 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACCGCG GCGCGGCGTG CTGGCACGGT TTTGGCGTAA CGAACGGCGT CGCGCACGTC GCGCGGCGCA TCACCGATAC CGACATGCGA GGAGCAACGA TGAATCACGC GGACATGCAA CATCTGAACA TCGAATTCCC GTACCGCAAG CAGTACGGGA ATTTCATCGG CGGCGAATGG GTCGCCCCGG TCGGCGGCGA GTATTTCGAC AACGTCTCGC CCGTCACCGG CCGGCCGTTC ACCGCGATCC CTCGCTCGCG CGAAGCCGAC ATCGAGCTCG CGCTCGACGC CGCTCACGCG GCCAAGGCGG GCTGGGCCGC GAAGGGCGCG GCCGAGCGCG CGAACGTGCT GCTGAGGATC GCCGACCGGA TGGAGGCGAA CCTCACGCGC CTCGCCGTCG CCGAGACGAT CGACAACGGC AAGCCGCTGC GCGAAACCAC CGCAGCCGAC GTGCCGCTCG CGATCGACCA CTTCCGCTAC TTCGCGGGCT GCATCCGCGC GCAGGAAGGC TCGATCGCCG ATATCGGCGG CGACATGGTG GCCTACCACT TCCACGAGCC GCTCGGCGTC GTCGGCCAGA TCATCCCGTG GAACTTCCCG CTGCTGATGG CCGCGTGGAA GCTCGCGCCG GCGCTCGCGG CCGGCAACTG CGTCGTGCTC AAGCCGGCCG AGCAGACGCC CGCGTCGATC CTCGTGTTCG CCGAGCTGAT CCAGGATCTG CTGCCGCCCG GCGTGCTCAA CATCGTCAAC GGCTTCGGCC TCGAGGCCGG CAAGCCGCTC GCGTCGAGCA AGCGGATCGC GAAGATCGCG TTCACGGGCG AGACGTCGAC GGGCCGCCTC ATCATGCAGT ACGCGAGCGA GAACCTGATT CCCGTCACGC TCGAGCTGGG CGGCAAGAGC CCGAATATTT TCTTCGCCGA CGTGATGGAT CGCGACGACA GCTACTTCGA CAAGGCGCTC GAAGGCTTCG CGATGTTCGC GCTGAACCAG GGCGAAGTCT GCACGTGCCC ATCGCGCGCG CTCGTCGAGG AGAGCATCTA CGATCGCTTC ATCGAACGCG CGCTCAAGCG CGTCGAGGCG ATCAAGCAGG GCCATCCGCT CGATTCGCAG ACGATGATCG GCGCGCAGGC GTCGGCCGAG CAGCTCGAGA AGATCCTGTC GTACATCGAC ATCGGCCGCG GCGAAGGCGC GCAATGCCTG ACGGGCGGCG AGCGCAACGT GCTCGGCGGC GAGCTCGCCG AAGGCTATTA CGTGAAGCCG ACCGTGTTCC GCGGCCACAA CAAGATGCGC ATCTTCCAGG AAGAAATCTT CGGGCCGGTG CTCGCGGTGA CGACGTTCAA GACCGAGGAG GAAGCGCTCG AGATCGCGAA CGACACGCTG TACGGCCTGG GCGCCGGCGT CTGGACGCGC GACGGCAACC GCGCGTACCG CTTCGGCCGC GGCATCCAGG CGGGCCGCGT GTGGACGAAC TGCTATCACG CGTATCCGGC GCACGCGGCG TTCGGCGGCT ACAAGCAATC CGGCATCGGC CGCGAGACGC ACAAGATGAT GCTCGACCAC TACCAGCAGA CGAAGAACCT GCTCGTCAGC TACAGCGAAA AGCCGCTCGG GTTCTTCTGA
|
Protein sequence | MHRGAACWHG FGVTNGVAHV ARRITDTDMR GATMNHADMQ HLNIEFPYRK QYGNFIGGEW VAPVGGEYFD NVSPVTGRPF TAIPRSREAD IELALDAAHA AKAGWAAKGA AERANVLLRI ADRMEANLTR LAVAETIDNG KPLRETTAAD VPLAIDHFRY FAGCIRAQEG SIADIGGDMV AYHFHEPLGV VGQIIPWNFP LLMAAWKLAP ALAAGNCVVL KPAEQTPASI LVFAELIQDL LPPGVLNIVN GFGLEAGKPL ASSKRIAKIA FTGETSTGRL IMQYASENLI PVTLELGGKS PNIFFADVMD RDDSYFDKAL EGFAMFALNQ GEVCTCPSRA LVEESIYDRF IERALKRVEA IKQGHPLDSQ TMIGAQASAE QLEKILSYID IGRGEGAQCL TGGERNVLGG ELAEGYYVKP TVFRGHNKMR IFQEEIFGPV LAVTTFKTEE EALEIANDTL YGLGAGVWTR DGNRAYRFGR GIQAGRVWTN CYHAYPAHAA FGGYKQSGIG RETHKMMLDH YQQTKNLLVS YSEKPLGFF
|
| |