Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2902 |
Symbol | fdhA |
ID | 4884569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2858243 |
End bp | 2861197 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640128830 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001059920 |
Protein GI | 126438525 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.679741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCTT CCGCATTCGA TTCGTCCCGG CAAGGCTGCG GCTCCGGCCA GTGCGCGTGC CGGCGCGCCG CGCAGACGCG GCGCGCCGAT CCGTTCGACG ACACCGACTA CGGCACCCCG ACGCGCCACG CGGACACCGA CGTCACGCTC GATATCGACG GCCGCGCGGT CACGGTGCCG GCGGGCACGT CGGTGATGCG CGCGGCGATC GAAGCCGGCA TCAACGTGCC GAAGCTCTGC GCGACCGATT CGCTCGAGCC GTTCGGCTCG TGTCGCCTGT GCCTCGTCGA GATCGAAGGC CGGCGCGGCT ATCCGGCCTC GTGCACGACG CCCGTCGAGG CCGGCATGAG GGTGCGCACG CAAAGCGACC GGCTGCAGGC GCTGCGCCGC AACGTGATGG AGCTCTACAT CTCCGATCAT CCGCTCGACT GTCTCACGTG CGCGGCGAAC GGCGATTGCG AGCTGCAGGA CATGGCGGGC GCGGTCGGCC TGCGCGAGGT GCGCTACGGC TTCGACGGCA AGAACCATCT GAGCGACGCG AAGGACGAAT CGAACCCGTA CTTCAGCTAC GATCCGTCGA AATGCATCGT TTGCAATCGC TGCGTGCGCG CGTGCGAGGA GACGCAGGGC ACGTTCGCGC TGACGATCGC CTCGCGCGGC TTCGAATCGC GGGTCGCGGC GAGCGCGGGC GAGGCGTTCA TGGATTCGGA ATGCGTATCG TGCGGCGCAT GCGTCGCCGC GTGCCCGACC GCGACGCTCG TCGAGAAAAG CGTCGCGCGG CTCGGTCAGC CCGAGCACGA GATCGTGACG ACCTGCGCGT ACTGCGGCGT CGGCTGCGCG CTGAAGGCGG AAATGAAGGG CGAGCAGGTC GTGCGGATGA CGCCGCACAA GAACGGCCTC GCGAACGAAG GCCACGCGTG CGTGAAAGGC CGCTTCGCGT GGGGCTATGC GACGCACAAG GACCGCATCA CGAAGCCGAT GATCCGCGAG AAGATCACCG ACGCGTGGCG CGAAGTGAGC TGGGACGAAG CAATCGGCTA CGCGGCCGCG CGCTTTCGCC GCATCCAGGA CGAGCACGGC CGCGATTCGA TCGGCGGCAT CACGTCGTCG CGCTGCACGA ACGAAGAGAC GTATCTCGTG CAGAAGCTCG TGCGCGCGGC GTTCGGCAAC AACAATGTCG ACACTTGCGC GCGCGTGTGC CATTCGCCGA CGGGCTACGG CCTGAAAACC ACGCTCGGCG AATCGGCCGG CACGCAGACG TTCGCGTCGG TCGGCTCGGC CGACGTGATC GTCGTGATCG GCGCGAATCC GACCGACGGC CATCCGGTGT TCGGCTCGCG CCTGAAGCGC CGCGTGCGCG AGGGCGCGCA GCTGATCGTC GTCGATCCGC GCCGGATCGA TCTCGTCGAC GGCCCGCACG TGAAGGCCGT CCACCATCTG CAACTGCGCC CCGGCACGAA CGTCGCGCTC GTCAACGCGC TCGCGCACGT GATCGTCACC GAGGGCCTCG TCGACGAAGC GTTCGTCGCC GAGCGCTGCG AGCCGCAGGC GTTCGACGTA TGGCGCGCGT TCGCCGCGCG GCCCGAGAAT TCGCCCGAGG CGACGGCGGA CATCACGGGC GTGCCGGCCG ACGCGGTGCG CGCCGCCGCG CGCCTGTACG CGACGGGCGG GCGCGCGGCG ATTTTCTACG GGCTCGGCGT GACCGAGCAC GCGCAGGGCT CGACGATGGT GATGGGCATC GCGAACCTCG CGATGGCGAC GGGCAATCTC GGCATCGAAG GCGCGGGCGT GAACCCGCTG CGCGGGCAGA ACAACGTGCA GGGCTCGTGC GACATGGGTT CGTTCCCGCA CGAGCTGCCC GGCTACCGGC ACATCGGCGA CGCCGCGGTG CGCGCGCGCT TCGACGAAGC GTGGTCGACG ACGCTGCAGC CCGAGCCCGG CCTGCGCATT CCGAACATGT TCGACGCGGC GCTCGACGGC AGTTTCAAGG GGCTCTATTG CCAGGGCGAG GACATCGTCC AGTCCGATCC GAACACGCAG CACGTCGCGG CCGCGCTGGC GTCGCTCGAC TGCCTCGTCG TGCAGGACAT CTTCCTGAAC GAAACCGCGA AGTACGCGCA CGTATTCCTG CCGGGCGCGA CCTTCCTCGA GAAGGACGGC ACGTTCACGA ACGCGGAGCG CCGGATCTCG CGCGTGCGCC GCGCGATGCG GCCGCTTTCT GGCTACGCGG ACTGGGAAGT GACGCTGATG CTGTCGCGCG CGCTCGGCTA CGACATGCAT TACGCGCATC CGTCCGAGAT CATGGACGAG ATCGCGCGCC TCACGCCGAC GTTCGCGGGC GTGTCGTACG CGCTGCTCGA CGAGCTGGGC AGCGTCCAGT GGCCGTGCAA CGACGCGGCG CCGCAAGGCA CGCCGACGAT GCACGTCGAT CAGTTCGTGC GCGGCAAGGG CAAGTTCGTC ATCACGCAGT ACATCGCGTC GCCGGAGAAG GTGACGCCGC GCTATCCGCT CATCCTGACG ACGGGCCGCA TTCTGTCGCA GTACAACGTC GGCGCGCAGA CGCGCCGCAC GGAGAACGTC CGCTGGCACG GCGAGGATCG GCTCGAGATC CATCCGCACG ACGCGAACGA TCGCGGCATC CGGACAGGCG ACTGGGTCGG CGTCGCATCG CGCGCCGGGC AGACGGTGCT GCGCGCGCTC GTCACCGAGC GGATGCAGCC GGGCGTCGTC TACACGACGT TCCACTTCCC GGAATCGGGC GCGAACGTGA TCACGACCGA CAGCTCGGAC TGGGCGACCA ACTGCCCCGA GTACAAGGTG ACGGCCGTGC AGGTCGCGCC CGTCGCGCAG CCGTCCGAAT GGCAGCGCGC GTACACGCGC TTTCGCGCGG AGCAGCTCGC GCTGCTCGAG CAGCGCACGG CCGAGCGCGC GCCGGCCATC GCCACGGGCA AGTGA
|
Protein sequence | MTSSAFDSSR QGCGSGQCAC RRAAQTRRAD PFDDTDYGTP TRHADTDVTL DIDGRAVTVP AGTSVMRAAI EAGINVPKLC ATDSLEPFGS CRLCLVEIEG RRGYPASCTT PVEAGMRVRT QSDRLQALRR NVMELYISDH PLDCLTCAAN GDCELQDMAG AVGLREVRYG FDGKNHLSDA KDESNPYFSY DPSKCIVCNR CVRACEETQG TFALTIASRG FESRVAASAG EAFMDSECVS CGACVAACPT ATLVEKSVAR LGQPEHEIVT TCAYCGVGCA LKAEMKGEQV VRMTPHKNGL ANEGHACVKG RFAWGYATHK DRITKPMIRE KITDAWREVS WDEAIGYAAA RFRRIQDEHG RDSIGGITSS RCTNEETYLV QKLVRAAFGN NNVDTCARVC HSPTGYGLKT TLGESAGTQT FASVGSADVI VVIGANPTDG HPVFGSRLKR RVREGAQLIV VDPRRIDLVD GPHVKAVHHL QLRPGTNVAL VNALAHVIVT EGLVDEAFVA ERCEPQAFDV WRAFAARPEN SPEATADITG VPADAVRAAA RLYATGGRAA IFYGLGVTEH AQGSTMVMGI ANLAMATGNL GIEGAGVNPL RGQNNVQGSC DMGSFPHELP GYRHIGDAAV RARFDEAWST TLQPEPGLRI PNMFDAALDG SFKGLYCQGE DIVQSDPNTQ HVAAALASLD CLVVQDIFLN ETAKYAHVFL PGATFLEKDG TFTNAERRIS RVRRAMRPLS GYADWEVTLM LSRALGYDMH YAHPSEIMDE IARLTPTFAG VSYALLDELG SVQWPCNDAA PQGTPTMHVD QFVRGKGKFV ITQYIASPEK VTPRYPLILT TGRILSQYNV GAQTRRTENV RWHGEDRLEI HPHDANDRGI RTGDWVGVAS RAGQTVLRAL VTERMQPGVV YTTFHFPESG ANVITTDSSD WATNCPEYKV TAVQVAPVAQ PSEWQRAYTR FRAEQLALLE QRTAERAPAI ATGK
|
| |