Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_1189 |
Symbol | |
ID | 4676781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008784 |
Strand | + |
Start bp | 1197929 |
End bp | 1199899 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639843708 |
Product | aldehyde dehydrogenase (NADP) family protein |
Protein accession | YP_990788 |
Protein GI | 121596748 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTGGCTGT TCGGCTTCAA GTCGATCGTT GCGCGAGCCC GCGCCGAAGC CGATATCCCG TCATGGGCCC TCAACCGGCG GCTCGCGTTC GCGACGATTG CAATCCCGGC CGGGAATGCC GCGTGTTCGC ATTCGTTTGC CGGACGGCGC GCATGCGCGC AGGACATGGC GACCGGTGCG GGCACGCGTG CGCCGGTCGG GATGCGCGGC GCACGCGTTG CCGTCTGCCA CGATAATGGA ATGCGTGCGC GCCGGGGCGG TGCGATCCGT CGAACCGGCC GGTCGCCTCG ACTCGTTCAA TTCGACCCAT TCGACCCATT CGTCCGCAGG CGGCGCACGC CGCATCGCGA AGCGCACCGC CCGCGGCCAG GTTTCCAATT CGGAGGAAGC ATGCAGATCA CCGGCGAGAT GTTGATTGGC GCGGCCGCGG TGCGCGGTAG CGAAGGCACG ATGCGCGCTT ACGCGCCGGC GCAGGGCGTC GAGCTCGAGC CGACGTTCGG CGCGGGCGGT GCGGCCGACG TCGATCGCGC GTGCCGCCTC GCGAACGCCG CTTTCGATCC CTTTCGTCAG GCGCCGCTCG AGACGCGCGC ACGCTTTCTC GAGGCGATCG CCGAGCGCAT CGTCGGGCTC GGCGATCCAT TGATCGAACG CGCGCACGCG GAATCGGCGC TGCCCGTCGC GCGGCTCGAA GGCGAGCGCG CGCGCACGGT CGGTCAGCTC AGGCTCTTCG CGGCGATCGT GCGCGACGGC CGCTGGCTGA GCGCGACGCT CGATTCCGCG CAGCCCGAGC GCAAGCCGCT GCCGCGCGCC GATCTGCGCT TGCAGAAGAT TCCCGTCGGC CCGGTCGCGG TGTTCGGCGC GAGCAATTTC CCGCTCGCGT TCTCGGTCGC GGGCGGCGAC ACCGCTTCGG CGTTCGCGGC CGGCTGCCCC GTCGTCGCGA AGGCGCACCC CGCGCATCTC GGCACGTCGG AGCTCGTCGG GCGCGCGATC CGGCAGGCTG TCGCCGATTG CGGTTTGCAC GAGGGCGTGT TCTCGCTCGT CGTCGGCGTG GGCAACGCGA TCGGCGAGGC GCTCGTCGCG CATCCCGCGA TCAGGGCGGT CGGCTTCACC GGCTCGCGCG CGGGCGGCCT TGCGCTGATG GGCGTTGCCG CGCGGCGGCA CGAGCCGATT CCGGTCTTCG CGGAAATGAG CAGCATCAAT CCGTTCTTCG TGTTGCCCGG CGCGTTGCGC GCACGCGGTG CGCAAATCGC GCAAGGCTTC GTCGAATCGC TGACGCTCGG CGTCGGGCAG TTCTGCACGA ACCCGGGGCT CGTCGTCGCA CTCGAAGGGC CCGACCTGAA GGCGTTCGTC GACGCGGCCG CGCAGGCGCT CTCGCAAAAG GGCGCGCAGA CGATGCTGAC CTCGGGCATC GCGTCGTCTT ACGAGAGCGC GGTCGCGGCG CGCCGCGCGG CCGCGGGCGT CAGCGAGGTC GCGCGCGGCG CGCGCAGCGA CGCGCGGAAC GCCGCGTTGC CCGCGCTCTT CACGACGACG CACACGCAGT TCGTCCAGAA CCCGCAGCTC GAAGCCGAGA TCTTCGGGCC GACGTCGCTC GTCGTCGCGT GCCGCGACAT CGACGAGATG ATCGCGCTCG CCGAGCATGT CGAGGGGCAA CTGAGCGCGA CGCTGCATCT CGAAGACGAC GATGTCGATC TGGCGCGCAA ACTGTTGCCG ACGCTCGAGC GCCGCGCCGG CCGCATCGTC GCGAACGGCT ATCCGACGGG CGTCGAGGTC GCGTACGCGA TGGTGCACGG CGGGCCGTTT CCGGCGACGT CGGACCCGCG CAGCACATCG GTGGGTGCGC TTGCGATCGA GCGCTTCCTG CGGCCCGTCT GCTATCAGGA TTTGCCGGCG GCGTTGTTGC CCGAGGCACT CGCCGACGCG AATCCGCTCG GCCTCTGGCG CCTGCGCGAC GGCCAACTCG GCAAGGCATG A
|
Protein sequence | MWLFGFKSIV ARARAEADIP SWALNRRLAF ATIAIPAGNA ACSHSFAGRR ACAQDMATGA GTRAPVGMRG ARVAVCHDNG MRARRGGAIR RTGRSPRLVQ FDPFDPFVRR RRTPHREAHR PRPGFQFGGS MQITGEMLIG AAAVRGSEGT MRAYAPAQGV ELEPTFGAGG AADVDRACRL ANAAFDPFRQ APLETRARFL EAIAERIVGL GDPLIERAHA ESALPVARLE GERARTVGQL RLFAAIVRDG RWLSATLDSA QPERKPLPRA DLRLQKIPVG PVAVFGASNF PLAFSVAGGD TASAFAAGCP VVAKAHPAHL GTSELVGRAI RQAVADCGLH EGVFSLVVGV GNAIGEALVA HPAIRAVGFT GSRAGGLALM GVAARRHEPI PVFAEMSSIN PFFVLPGALR ARGAQIAQGF VESLTLGVGQ FCTNPGLVVA LEGPDLKAFV DAAAQALSQK GAQTMLTSGI ASSYESAVAA RRAAAGVSEV ARGARSDARN AALPALFTTT HTQFVQNPQL EAEIFGPTSL VVACRDIDEM IALAEHVEGQ LSATLHLEDD DVDLARKLLP TLERRAGRIV ANGYPTGVEV AYAMVHGGPF PATSDPRSTS VGALAIERFL RPVCYQDLPA ALLPEALADA NPLGLWRLRD GQLGKA
|
| |