Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II0707 |
Symbol | |
ID | 3845142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 827701 |
End bp | 830772 |
Gene Length | 3072 bp |
Protein Length | 1023 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637838012 |
Product | formate dehydrogenase, alpha subunit, selenocysteine-containing |
Protein accession | YP_438906 |
Protein GI | 83716383 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCAAT TGTCCCGGCG CCAGTTCCTG AAGCTGTCCG CGACGACGCT CGCCGGATCG AGCCTAGCCC TGTTGGGCTT CTCGCCGGCC GAAGCGCTCG CCGAGGTCCG CCAATACAAG CTGGCGCGCA CTGTCGAAAC CCGCAACACG TGTCCTTACT GCTCGGTCGG TTGCGGGATA CTGATGTACG GCCTCGGCGA CGGCGCGAAG AACGCCACGT CGAGCATCAT CCACATCGAG GGCGACCCCG ACCACCCGGT CAACCGCGGC ACGCTGTGCC CGAAGGGCGC GAGCCTCATC GATTTCATCC ATAGCCCGAG CCGCCTCACG CAGCCCGAGT ACCGCGCGGC CGGCTCCGAC AAGTGGCAGC CGATCTCGTG GAGCGACGCG CTCGACCGGA TCGCGAAGCT GATGAAGGCG GACCGCGACG CGAACTTCGT CGAGACGACG GACGACGGCA AGAAGGTCAA CCGCTGGCTC ACGACGGGCA TGCTGGCCGC ATCGGCGGGC AGCAACGAAG TCGGCTATCT GACGCACAAG ACCGTGCGCA GCATGGGGAT GCTCGCGTTC GACAACCAGG CTCGTGTCTG ACATGGCCCG ACGGTGGCAG GTCTTGCCCC GACGTTTGGC CGTGGCGCGA TGACGAACCA TTGGGTCGAC ATCAAGAACG CGGACGTTAT TCTCGTGATG GGCGGCAACG CCGCCGAAGC GCACCCGTGC GGCTTCAAAT GGGTCACCGA AGCGAAGGCG CATCGCAATG CGCGCCTCGT CGTCGTCGAT CCGCGCTTCA CGCGCACCGC ATCGGTCGCC GACTATTACG CGCCGATTCG CACCGGCACG GACATCGCGT TCCTCGGCGG GGTGATCAAC TACCTGCTGA CGAACGACAA GATCCAGCAC GAGTACGTCA AGAACTACAC GGATTTCCCG TTCATCGTTC GCGAGGATTT CGCGTTCAAC GACGGCATCT ATTCCGGCTA CGACGCGGAC AAGCACGCGT ACCCGGACAA GTCGACGTGG GAGTACGAGC GCGGCGACGA CGGCTTCGTG AAGGTCGACG ACACGCTCGC GCACCCGCGC TGCGTGTACA ACCTGCTCAA GCAGCACTAC TCGCGCTATA CGCCGGAGAT GGTCGAGAAG ATCTGCGGCA CGCCTAAGGA CAAGTTCCTG AAGGTGTGCG AGATGCTCGC GACGACGGCC GTGCCAGGCC GCGCCGGCAC GGTGCTGTAC GCGCTCGGCT GGACGCACCA CTCGGTCGGC GCGCAGATGA TCCGCACGGG CGCGATGGTG CAACTGCTGC TCGGCAACAT CGGCATCGCG GGCGGCGGGA TGAACGCGCT GCGCGGGCAC TCGAACATCC AGGGGTTGAC CGACCTCGGG CTGATGTCGA ACCTGCTGCC GGGCTACATG ACGCTGCCGA TGCAGGCCGA GCAGGATTTC GACGCCTACA TCCAGAAGCG CGCGCAGCAG CCGCTGCGGC CCAACCAGCT GAGCTACTGG AAGAACTATC GCGCGTTCCA CGTGAGCTTC ATGAAGGCGT GGTGGGGCGA TGCCGTCAGC GCCGAGAACA ACTGGGGCTA CGACTACCTG CCGAAGCTCG ACAAGCCGTA CGACCTCCTG CAGACGATCG AGCTGATGCA CGCGGGCAAG ATGAACGGCT ATATCTGCCA GGGCTTCAAC CCGCTCGCGG CGGCACCGTC CAAGCGCAAG ACGTCCGAGG CGCTCGCGAA GCTGAAGTGG CTCGTGATCA TGGACCCGCT CGCGACCGAG ACGTCCGAGT TCTGGAAGAA TCACGGCGAG CACAACGACG TCGATTCGTC GAAGATCCAG ACGGAGGTGT TCCGGCTGCC GACGTCGTGC TTCGCGGAGG AGCGCGGCTC GCTCGTCAAC TCCGGCCGCG TGCTGCAGTG GCACTGGCAG GGCGCGGAGC CGCCCGGCCA GGCGAAGAGC GACCTCGAGA TCATGTCGGG GATCTTCCTG CGGATGCGCG ACATGTACAG GAAGGACGGC GGCAAGTATC CCGACCCGAT CGTCAACCTG AGCTGGCCGT ACGCGAACCC GGAAAGCCCG ACGCCCGAGG AGCTCGCGAT GGAGTTCAAC GGCCGCGCGC TCGCCGATCT GCCTGACCCG AAGGACCCGA CGAAGACGCT CGTGAAGAAG GGCGAGCAGC TCGCCGGCTT CGCGCAACTG AAGGACGACG GCACGACCGC GAGCGGCTGC TGGATCTTCT GCGGCGCGTG GACGCAAGCG GGCAACCAGA TGGCGCGGCG CGACAACTCG GACCCGACCG GCATCGGCCA GACGCTCAAT TGGGCGTGGG CGTGGCCCGC GAACCGGCGC ATCCTGTACA ACCGCGCGTC GTGCGACGTC GCCGGCAAGC CGTTCGACCC GACGCGCAAG CTGATCGGCT GGAACGGCAA GACGTGGACG GGCGCGGACG TTCCCGACTA CAAGATCGAC GAGCCGCCCG AGACCGGCAT GGGCCCGTTC ATCATGAACC CGGAAGGCGT CGCGCGCTTC TTCGCGCGCG CCGCGATGAA CGAAGGCCCG TTCCCCGAGC ACTACGAGCC GTTCGAGACA CCGCTCGCCG CGAATCCGCT GCATCCGAAC AACCCGCAGG CGCTGAACAA CCCGGCTGCC CGCGTGTTCC CGGACGATCG CGCGGCGTTC GGCAAGGTCG ACCAGTTCCC GCACGTCGCG ACGACCTATC GTCTGACCGA GCACTTCCAC TACTGGACGA AGCATGCGCG GCTGAACGCG ATCATCCAGC CCGAGCAGTT CGTCGAGATC GGCGAGGAGC TCGCGAAGGA GGTCGGCGTC GCGCACGGCG ATCGCGTGAA GGTGTCGTCG AACCGCGGGC ACATCGTCGC GGTCGCGCTC GTCACGAAGC GGATCAAGCC GCTCACGGTC GACGGCCGCA AGGTGCAGAC GGTCGGCATT CCGTTGCATT GGGGCTTCAA GGGGTTGACG AAGCCCGGCT ATCTCGCGAA CACCCTGACT CCGTCCGTCG GCGACGGCAA CTCGCAGACA CCGGAATTCA AGTCGTTCCT GGTGAAAGTG GAAAAGGCGT AA
|
Protein sequence | MLQLSRRQFL KLSATTLAGS SLALLGFSPA EALAEVRQYK LARTVETRNT CPYCSVGCGI LMYGLGDGAK NATSSIIHIE GDPDHPVNRG TLCPKGASLI DFIHSPSRLT QPEYRAAGSD KWQPISWSDA LDRIAKLMKA DRDANFVETT DDGKKVNRWL TTGMLAASAG SNEVGYLTHK TVRSMGMLAF DNQARVUHGP TVAGLAPTFG RGAMTNHWVD IKNADVILVM GGNAAEAHPC GFKWVTEAKA HRNARLVVVD PRFTRTASVA DYYAPIRTGT DIAFLGGVIN YLLTNDKIQH EYVKNYTDFP FIVREDFAFN DGIYSGYDAD KHAYPDKSTW EYERGDDGFV KVDDTLAHPR CVYNLLKQHY SRYTPEMVEK ICGTPKDKFL KVCEMLATTA VPGRAGTVLY ALGWTHHSVG AQMIRTGAMV QLLLGNIGIA GGGMNALRGH SNIQGLTDLG LMSNLLPGYM TLPMQAEQDF DAYIQKRAQQ PLRPNQLSYW KNYRAFHVSF MKAWWGDAVS AENNWGYDYL PKLDKPYDLL QTIELMHAGK MNGYICQGFN PLAAAPSKRK TSEALAKLKW LVIMDPLATE TSEFWKNHGE HNDVDSSKIQ TEVFRLPTSC FAEERGSLVN SGRVLQWHWQ GAEPPGQAKS DLEIMSGIFL RMRDMYRKDG GKYPDPIVNL SWPYANPESP TPEELAMEFN GRALADLPDP KDPTKTLVKK GEQLAGFAQL KDDGTTASGC WIFCGAWTQA GNQMARRDNS DPTGIGQTLN WAWAWPANRR ILYNRASCDV AGKPFDPTRK LIGWNGKTWT GADVPDYKID EPPETGMGPF IMNPEGVARF FARAAMNEGP FPEHYEPFET PLAANPLHPN NPQALNNPAA RVFPDDRAAF GKVDQFPHVA TTYRLTEHFH YWTKHARLNA IIQPEQFVEI GEELAKEVGV AHGDRVKVSS NRGHIVAVAL VTKRIKPLTV DGRKVQTVGI PLHWGFKGLT KPGYLANTLT PSVGDGNSQT PEFKSFLVKV EKA
|
| |