Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2259 |
Symbol | |
ID | 4904701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2241278 |
End bp | 2244349 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640145364 |
Product | formate dehydrogenase-O, major subunit, selenocysteine-containing |
Protein accession | YP_001076292 |
Protein GI | 226830791 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.181302 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCAAC TGTCCCGGCG CCAGTTCCTG AAGCTGTCCG CGACGACGCT CGCCGGATCG AGCCTAGCCC TGATGGGCTT CTCGCCGGCC GAAGCGCTCG CCGAGGTCCG CCAATACAAG CTGGCGCGCA CTGTCGAAAC CCGCAACACC TGCCCTTACT GCTCGGTCGG CTGCGGGATC CTGATGTACG GCCTCGGCGA CGGCGCGAAG AACGCCACGT CGAGCATCGT CCACATCGAG GGCGACCCCG ATCACCCGGT CAACCGCGGC ACGCTGTGCC CGAAGGGCGC GAGCCTCATC GACTTCATCC ATAGCCCGAG CCGCCTCACG CAGCCCGAAT ACCGCGCGGC CGGCTCCGAC AAGTGGCAGC CGATCTCGTG GAGCGACGCG CTCGACCGGA TCGCGAAGCT GATGAAGGCG GACCGCGACG CGAACTTCGT CGAGACGACG GACGACGGCA TGAAGGTCAA CCGCTGGCTC ACGACGGGCA TGCTGGCCGC CTCGGCGGGC AGCAACGAGG TCGGCTATCT GACGCACAAG ACCGTGCGCA GCATGGGGAT GCTCGCGTTC GACAACCAGG CTCGTGTCTG ACATGGCCCG ACGGTGGCAG GTCTTGCCCC GACGTTTGGC CGTGGCGCGA TGACGAACCA TTGGGTCGAC ATCAAGAACG CGGACGTTAT TCTCGTGATG GGCGGCAATG CCGCCGAGGC CCATCCGTGC GGTTTCAAGT GGGTCACCGA AGCGAAGGCG CATCGCAACG CGCGCCTCGT CGTCGTCGAT CCGCGCTTCA CGCGCACCGC ATCGGTCGCC GATTATTACG CGCCGATTCG CACCGGCACG GACATCGCGT TCCTCGGCGG GGTGATCCAT TACCTGCTGA CGAACGACAA GATCCAGCAC GAGTACGTCA AGCATTACAC GGATTTCTCG TTCATCGTTC GCGAGGATTT CGCGTTCGAC GACGGCATCT ATTCCGGCTA CGACGCGGAC AAGCACGCGT ACCCGGACAA GTCGACGTGG GATTACGAGC GCGGCGACGA CGGCTTCGTG AAGGTCGACG AAACGCTCGC GCACCCGCGC TGCGTGTACA ACCTGCTCAA GCAGCACTAC GCGCGCTACA CGCCGGAGAT GGTCGAGAAG ATCTGCGGCA CGCCGAAGGA CAAGTTCCTG AAGGTATGCG AGATGCTCGC GACGACGGCC GTGCCCGGCC GCGCCGGCAC GGTGCTGTAC GCGCTCGGCT GGACGCACCA CTCGGTCGGC GCGCAGATGA TCCGCACGGG CGCGATGGTG CAGTTGCTGC TCGGCAACAT CGGCATCGCG GGCGGCGGGA TGAACGCGCT GCGCGGGCAC TCGAACATCC AGGGGTTGAC CGACCTCGGG CTGATGTCGA ACCTGCTGCC GGGTTACATG ACGCTGCCGA TGCAGGCCGA GCAGGATTTC GACGCCTACA TCCAGAAGCG CGCGCAGCAG CCGCTGCGGC CCAACCAGTT GAGCTACTGG AAGAACTACC GCGCGTTCCA CGTGAGCTTC ATGAAGGCGT GGTGGGGCGA CGCGGCGAGC GCCGAGAACA ACTGGGGCTA CGACTACCTG CCGAAGCTCG ACAAGCAGTA CGACCTGCTG CAGACGATCG AGCTGATGCA CGCGGGCAAG ATGAACGGCT ACATCTGCCA GGGCTTCAAC CCGCTCGCGG CGGCGCCGTC CAAGCGCAAG ACGTCCGAGG CGCTCGCGAA GCTGAAGTGG CTCGTGATCA TGGATCCGCT CGCCACCGAG ACGTCGGAGT TCTGGAAGAA CCACGGCGAG TTCAACGATG TCGATTCGTC GAAGATCCAG ACCGAGGTGT TCCGGCTGCC GACGTCGTGC TTCGCCGAGG AGCGCGGCTC GCTCGTGAAC TCCGGCCGCG TGCTGCAGTG GCACTGGCAG GGCGCGGAGC CGCCGGGCCA GGCGAAGAGC GACCTCGAGA TCATGTCGGG GATCTTCCTG CGCATGCGCG ACATGTACCG CAAGGACGGC GGCAAGTATC CCGACCCGAT CGTCAACCTG AGCTGGCCGT ACGCGAACCC GGAAAGCCCG ACGCCCGAAG AGCTCGCGAT GGAGTTCAAC GGCCGTGCGC TCGCGGATCT GCCTGATCCG AAAGATCCGA CGAAGACGCT CGTGAAGAAG GGTGAGCAGC TCGCCGGCTT CGCGCAGTTG AAGGACGACG GCACGACCGC GAGCGGCTGC TGGATCTTCT GCGGCGCGTG GACGCAAGCG GGCAACCAGA TGGCGCGGCG CGACAACGCG GACCCGACGG GCATCGGCCA GACGCTGAAC TGGGCGTGGG CGTGGCCGGC GAACCGGCGG ATCCTGTACA ACCGCGCGTC GTGCGACGTG AACGGCAAGC CGTTCGATCC GAGCCGCAAG CTGATCGGCT GGAACGGCAA GACGTGGACG GGCGCGGACG TTCCCGACTA CAAGCTCGAC GAGCCGCCCG AGACCGGCAT GGGCCCGTTC ATCATGAACC CGGAGGGCGT CGCACGCTTT TTCGCGCGCG CCGGGATGAA CGAAGGCCCG TTCCCCGAGC ACTACGAGCC GTTCGAGACG CCGCTCGCCG CGAACCCGCT GCATCCGGGC AACCCGCGCG CGCTGAACAA CCCGGCCGCC CGCGTGTTCC CGGACGATCG CGCGTCGTTC GGCAAGGTCG ACCAGTTCCC GCATGTCGCG ACGACCTATC GCCTGACCGA GCACTTCCAT TACTGGACGA AGCATGCGCG GCTGAACGCG ATCGTCCAGC CGCAGCAGTT CGTCGAGATC GGCGAGGATC TCGCGAAGGA GATCGGCGTC GCGCACGGCG AGCAAGTGAA GGTGTCGTCC AACCGCGGGC ACATCGTCGC GGTCGCGCTC GTCACCAAGC GCATCAAGCC GCTCATGGTC GACGGCAGGA AGGTGCAGAC GGTCGGCGTG CCGTTGCACT GGGGCTTCAA GGGATTGACG AAGCCCGGCT ATCTCGCGAA CACCCTGACT CCGTCCGTCG GCGACGGCAA CTCGCAGACA CCGGAATTCA AATCGTTCCT GGTGAAAGTG GAAAAGGCGT AA
|
Protein sequence | MLQLSRRQFL KLSATTLAGS SLALMGFSPA EALAEVRQYK LARTVETRNT CPYCSVGCGI LMYGLGDGAK NATSSIVHIE GDPDHPVNRG TLCPKGASLI DFIHSPSRLT QPEYRAAGSD KWQPISWSDA LDRIAKLMKA DRDANFVETT DDGMKVNRWL TTGMLAASAG SNEVGYLTHK TVRSMGMLAF DNQARVUHGP TVAGLAPTFG RGAMTNHWVD IKNADVILVM GGNAAEAHPC GFKWVTEAKA HRNARLVVVD PRFTRTASVA DYYAPIRTGT DIAFLGGVIH YLLTNDKIQH EYVKHYTDFS FIVREDFAFD DGIYSGYDAD KHAYPDKSTW DYERGDDGFV KVDETLAHPR CVYNLLKQHY ARYTPEMVEK ICGTPKDKFL KVCEMLATTA VPGRAGTVLY ALGWTHHSVG AQMIRTGAMV QLLLGNIGIA GGGMNALRGH SNIQGLTDLG LMSNLLPGYM TLPMQAEQDF DAYIQKRAQQ PLRPNQLSYW KNYRAFHVSF MKAWWGDAAS AENNWGYDYL PKLDKQYDLL QTIELMHAGK MNGYICQGFN PLAAAPSKRK TSEALAKLKW LVIMDPLATE TSEFWKNHGE FNDVDSSKIQ TEVFRLPTSC FAEERGSLVN SGRVLQWHWQ GAEPPGQAKS DLEIMSGIFL RMRDMYRKDG GKYPDPIVNL SWPYANPESP TPEELAMEFN GRALADLPDP KDPTKTLVKK GEQLAGFAQL KDDGTTASGC WIFCGAWTQA GNQMARRDNA DPTGIGQTLN WAWAWPANRR ILYNRASCDV NGKPFDPSRK LIGWNGKTWT GADVPDYKLD EPPETGMGPF IMNPEGVARF FARAGMNEGP FPEHYEPFET PLAANPLHPG NPRALNNPAA RVFPDDRASF GKVDQFPHVA TTYRLTEHFH YWTKHARLNA IVQPQQFVEI GEDLAKEIGV AHGEQVKVSS NRGHIVAVAL VTKRIKPLMV DGRKVQTVGV PLHWGFKGLT KPGYLANTLT PSVGDGNSQT PEFKSFLVKV EKA
|
| |