Gene BTH_II0707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0707 
Symbol 
ID3845142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp827701 
End bp830772 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content66% 
IMG OID637838012 
Productformate dehydrogenase, alpha subunit, selenocysteine-containing 
Protein accessionYP_438906 
Protein GI83716383 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCAAT TGTCCCGGCG CCAGTTCCTG AAGCTGTCCG CGACGACGCT CGCCGGATCG 
AGCCTAGCCC TGTTGGGCTT CTCGCCGGCC GAAGCGCTCG CCGAGGTCCG CCAATACAAG
CTGGCGCGCA CTGTCGAAAC CCGCAACACG TGTCCTTACT GCTCGGTCGG TTGCGGGATA
CTGATGTACG GCCTCGGCGA CGGCGCGAAG AACGCCACGT CGAGCATCAT CCACATCGAG
GGCGACCCCG ACCACCCGGT CAACCGCGGC ACGCTGTGCC CGAAGGGCGC GAGCCTCATC
GATTTCATCC ATAGCCCGAG CCGCCTCACG CAGCCCGAGT ACCGCGCGGC CGGCTCCGAC
AAGTGGCAGC CGATCTCGTG GAGCGACGCG CTCGACCGGA TCGCGAAGCT GATGAAGGCG
GACCGCGACG CGAACTTCGT CGAGACGACG GACGACGGCA AGAAGGTCAA CCGCTGGCTC
ACGACGGGCA TGCTGGCCGC ATCGGCGGGC AGCAACGAAG TCGGCTATCT GACGCACAAG
ACCGTGCGCA GCATGGGGAT GCTCGCGTTC GACAACCAGG CTCGTGTCTG ACATGGCCCG
ACGGTGGCAG GTCTTGCCCC GACGTTTGGC CGTGGCGCGA TGACGAACCA TTGGGTCGAC
ATCAAGAACG CGGACGTTAT TCTCGTGATG GGCGGCAACG CCGCCGAAGC GCACCCGTGC
GGCTTCAAAT GGGTCACCGA AGCGAAGGCG CATCGCAATG CGCGCCTCGT CGTCGTCGAT
CCGCGCTTCA CGCGCACCGC ATCGGTCGCC GACTATTACG CGCCGATTCG CACCGGCACG
GACATCGCGT TCCTCGGCGG GGTGATCAAC TACCTGCTGA CGAACGACAA GATCCAGCAC
GAGTACGTCA AGAACTACAC GGATTTCCCG TTCATCGTTC GCGAGGATTT CGCGTTCAAC
GACGGCATCT ATTCCGGCTA CGACGCGGAC AAGCACGCGT ACCCGGACAA GTCGACGTGG
GAGTACGAGC GCGGCGACGA CGGCTTCGTG AAGGTCGACG ACACGCTCGC GCACCCGCGC
TGCGTGTACA ACCTGCTCAA GCAGCACTAC TCGCGCTATA CGCCGGAGAT GGTCGAGAAG
ATCTGCGGCA CGCCTAAGGA CAAGTTCCTG AAGGTGTGCG AGATGCTCGC GACGACGGCC
GTGCCAGGCC GCGCCGGCAC GGTGCTGTAC GCGCTCGGCT GGACGCACCA CTCGGTCGGC
GCGCAGATGA TCCGCACGGG CGCGATGGTG CAACTGCTGC TCGGCAACAT CGGCATCGCG
GGCGGCGGGA TGAACGCGCT GCGCGGGCAC TCGAACATCC AGGGGTTGAC CGACCTCGGG
CTGATGTCGA ACCTGCTGCC GGGCTACATG ACGCTGCCGA TGCAGGCCGA GCAGGATTTC
GACGCCTACA TCCAGAAGCG CGCGCAGCAG CCGCTGCGGC CCAACCAGCT GAGCTACTGG
AAGAACTATC GCGCGTTCCA CGTGAGCTTC ATGAAGGCGT GGTGGGGCGA TGCCGTCAGC
GCCGAGAACA ACTGGGGCTA CGACTACCTG CCGAAGCTCG ACAAGCCGTA CGACCTCCTG
CAGACGATCG AGCTGATGCA CGCGGGCAAG ATGAACGGCT ATATCTGCCA GGGCTTCAAC
CCGCTCGCGG CGGCACCGTC CAAGCGCAAG ACGTCCGAGG CGCTCGCGAA GCTGAAGTGG
CTCGTGATCA TGGACCCGCT CGCGACCGAG ACGTCCGAGT TCTGGAAGAA TCACGGCGAG
CACAACGACG TCGATTCGTC GAAGATCCAG ACGGAGGTGT TCCGGCTGCC GACGTCGTGC
TTCGCGGAGG AGCGCGGCTC GCTCGTCAAC TCCGGCCGCG TGCTGCAGTG GCACTGGCAG
GGCGCGGAGC CGCCCGGCCA GGCGAAGAGC GACCTCGAGA TCATGTCGGG GATCTTCCTG
CGGATGCGCG ACATGTACAG GAAGGACGGC GGCAAGTATC CCGACCCGAT CGTCAACCTG
AGCTGGCCGT ACGCGAACCC GGAAAGCCCG ACGCCCGAGG AGCTCGCGAT GGAGTTCAAC
GGCCGCGCGC TCGCCGATCT GCCTGACCCG AAGGACCCGA CGAAGACGCT CGTGAAGAAG
GGCGAGCAGC TCGCCGGCTT CGCGCAACTG AAGGACGACG GCACGACCGC GAGCGGCTGC
TGGATCTTCT GCGGCGCGTG GACGCAAGCG GGCAACCAGA TGGCGCGGCG CGACAACTCG
GACCCGACCG GCATCGGCCA GACGCTCAAT TGGGCGTGGG CGTGGCCCGC GAACCGGCGC
ATCCTGTACA ACCGCGCGTC GTGCGACGTC GCCGGCAAGC CGTTCGACCC GACGCGCAAG
CTGATCGGCT GGAACGGCAA GACGTGGACG GGCGCGGACG TTCCCGACTA CAAGATCGAC
GAGCCGCCCG AGACCGGCAT GGGCCCGTTC ATCATGAACC CGGAAGGCGT CGCGCGCTTC
TTCGCGCGCG CCGCGATGAA CGAAGGCCCG TTCCCCGAGC ACTACGAGCC GTTCGAGACA
CCGCTCGCCG CGAATCCGCT GCATCCGAAC AACCCGCAGG CGCTGAACAA CCCGGCTGCC
CGCGTGTTCC CGGACGATCG CGCGGCGTTC GGCAAGGTCG ACCAGTTCCC GCACGTCGCG
ACGACCTATC GTCTGACCGA GCACTTCCAC TACTGGACGA AGCATGCGCG GCTGAACGCG
ATCATCCAGC CCGAGCAGTT CGTCGAGATC GGCGAGGAGC TCGCGAAGGA GGTCGGCGTC
GCGCACGGCG ATCGCGTGAA GGTGTCGTCG AACCGCGGGC ACATCGTCGC GGTCGCGCTC
GTCACGAAGC GGATCAAGCC GCTCACGGTC GACGGCCGCA AGGTGCAGAC GGTCGGCATT
CCGTTGCATT GGGGCTTCAA GGGGTTGACG AAGCCCGGCT ATCTCGCGAA CACCCTGACT
CCGTCCGTCG GCGACGGCAA CTCGCAGACA CCGGAATTCA AGTCGTTCCT GGTGAAAGTG
GAAAAGGCGT AA
 
Protein sequence
MLQLSRRQFL KLSATTLAGS SLALLGFSPA EALAEVRQYK LARTVETRNT CPYCSVGCGI 
LMYGLGDGAK NATSSIIHIE GDPDHPVNRG TLCPKGASLI DFIHSPSRLT QPEYRAAGSD
KWQPISWSDA LDRIAKLMKA DRDANFVETT DDGKKVNRWL TTGMLAASAG SNEVGYLTHK
TVRSMGMLAF DNQARVUHGP TVAGLAPTFG RGAMTNHWVD IKNADVILVM GGNAAEAHPC
GFKWVTEAKA HRNARLVVVD PRFTRTASVA DYYAPIRTGT DIAFLGGVIN YLLTNDKIQH
EYVKNYTDFP FIVREDFAFN DGIYSGYDAD KHAYPDKSTW EYERGDDGFV KVDDTLAHPR
CVYNLLKQHY SRYTPEMVEK ICGTPKDKFL KVCEMLATTA VPGRAGTVLY ALGWTHHSVG
AQMIRTGAMV QLLLGNIGIA GGGMNALRGH SNIQGLTDLG LMSNLLPGYM TLPMQAEQDF
DAYIQKRAQQ PLRPNQLSYW KNYRAFHVSF MKAWWGDAVS AENNWGYDYL PKLDKPYDLL
QTIELMHAGK MNGYICQGFN PLAAAPSKRK TSEALAKLKW LVIMDPLATE TSEFWKNHGE
HNDVDSSKIQ TEVFRLPTSC FAEERGSLVN SGRVLQWHWQ GAEPPGQAKS DLEIMSGIFL
RMRDMYRKDG GKYPDPIVNL SWPYANPESP TPEELAMEFN GRALADLPDP KDPTKTLVKK
GEQLAGFAQL KDDGTTASGC WIFCGAWTQA GNQMARRDNS DPTGIGQTLN WAWAWPANRR
ILYNRASCDV AGKPFDPTRK LIGWNGKTWT GADVPDYKID EPPETGMGPF IMNPEGVARF
FARAAMNEGP FPEHYEPFET PLAANPLHPN NPQALNNPAA RVFPDDRAAF GKVDQFPHVA
TTYRLTEHFH YWTKHARLNA IIQPEQFVEI GEELAKEVGV AHGDRVKVSS NRGHIVAVAL
VTKRIKPLTV DGRKVQTVGI PLHWGFKGLT KPGYLANTLT PSVGDGNSQT PEFKSFLVKV
EKA