Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A0734 |
Symbol | |
ID | 4887256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 698186 |
End bp | 699772 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640130674 |
Product | aldehyde dehydrogenase (NAD) family protein |
Protein accession | YP_001061733 |
Protein GI | 126444965 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0834633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCGCGC CCGCCCGCCC GCGCGCGGCG CCGCGCGTCA CATGCGCTCG CGCATGCGGC GCCGCGCGGC GAATCCCACT CTACGAGGCT ATTCCGATGG ACAAGACCAC TTTGGCTGAC TGGCAGGACA AGGCCGCGAC GCTCGCGATC GAGGGGCGCG CATTCATCGA CGGCGCGTAT CGCGACGCGC ACGGCGGCAA GACCTTCGAT TGCGTGAGCC CGATCGACGG GCGCGTGCTC GCGAAGGTCG CCGATTGCGG CGCGGCCGAT GTCGACGCGG CGGTGGCCGC CGCGCGGCGC GCGTTCGACG CGCAGGCGTG GGCGGGCCTG AACCCGCGCG AGCGCAAGGC GATCCTGCTG CGCTGGGCCG CGCTGATGCG CGCGCATCTC GACGAGCTGG CGCTGCTCGA GACGCTCGAC GCGGGCAAGC CGATCGGCGA CACGACGAGC GTCGACGTGC CGGGCGCCGC GTACTGCGTC GAATGGTTCG CCGAGGCGAT CGACAAGGTG GGCGGCGAAG TGGTGCCCGC CGATCATCAT CTCGTCGGCC TCGTCACGCG CGAGCCGCTC GGCGTCGTCG CCGCCGTCGT GCCGTGGAAT TTTCCGATCC TGATGGCGTC GTGGAAGTTC GGCCCGGCGC TCGCCGCGGG CAACAGCGTC GTGCTCAAGC CGTCGGAGAA ATCGCCGCTC ACGGCGATCA GGGTCGCGCG GCTCGCGCAC GAGGCGGGGA TTCCGGCCGG CGTGTTCAAC GTCGTGCCGG GCGGCGGCGA GCCGGGCAAG CTGCTCGCGC TGCATCGCGA CGTCGACTGT CTCGCGTTCA CCGGCTCCAC GGGTGTCGGC AAGCTGATCA TGCAGTACGC GGGGCAATCG AACCTGAAGC GCGTGTGGCT CGAGCTGGGC GGCAAGTCGC CGAACATCGT GCTGCCCGAC TGCCCGGATC TCGACCGCGC GGCGAAGGCG GCGGCGGGCG CGATCTTCTA CAACATGGGC GAGATGTGCA CGGCGGGATC GCGCCTGCTC GTGCACCGCG AGATCAAGGA CGCGTTCGTC GAAAAGCTCG TCGCCGCGGC GCGCGCGTAC AAGCCGGGCA ATCCGCTCGA TCCGAACGTG TCGATGGGCG CGATCGTCGA CGCGATCCAG CTCGAGCGCG TGCTCGGCTA CATCGAGGCG GGCCGCGCCG AAGCGCGGCT GCTGCTCGGC GGCGCGCGCG TGAACGAGGC GAGCGGCGGC TTCTACATCG AGCCGACCGT GTTCGACACC GCGCCCGACA CACGGATCGC GCGCGAGGAA ATCTTCGGCC CGGTGCTGTC GATGATCACG TTCGATTCGG TCGACGAAGC GGTGAGGATC GCGAACGACA GCGAATACGG GCTCGGCGCG GCCGTGTGGA CCGCGAACCT GACGACCGCG CACGAACTCG CGCGGCGGTT GCGCGCGGGC ACCGTGTGGG TCAACTGCTA CGACGAAGGG GGCGACATGA ACTTCCCGTT CGGCGGCTAC AAGCAGTCGG GCAACGGCCG CGACAAGTCG TTGCATGCAC TGGAGAAGTA CACCGAGCTG AAGTCCACGC TCGTGCGGCT GCGCTAA
|
Protein sequence | MRAPARPRAA PRVTCARACG AARRIPLYEA IPMDKTTLAD WQDKAATLAI EGRAFIDGAY RDAHGGKTFD CVSPIDGRVL AKVADCGAAD VDAAVAAARR AFDAQAWAGL NPRERKAILL RWAALMRAHL DELALLETLD AGKPIGDTTS VDVPGAAYCV EWFAEAIDKV GGEVVPADHH LVGLVTREPL GVVAAVVPWN FPILMASWKF GPALAAGNSV VLKPSEKSPL TAIRVARLAH EAGIPAGVFN VVPGGGEPGK LLALHRDVDC LAFTGSTGVG KLIMQYAGQS NLKRVWLELG GKSPNIVLPD CPDLDRAAKA AAGAIFYNMG EMCTAGSRLL VHREIKDAFV EKLVAAARAY KPGNPLDPNV SMGAIVDAIQ LERVLGYIEA GRAEARLLLG GARVNEASGG FYIEPTVFDT APDTRIAREE IFGPVLSMIT FDSVDEAVRI ANDSEYGLGA AVWTANLTTA HELARRLRAG TVWVNCYDEG GDMNFPFGGY KQSGNGRDKS LHALEKYTEL KSTLVRLR
|
| |