Gene BURPS1106A_A2259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2259 
Symbol 
ID4904701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2241278 
End bp2244349 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content66% 
IMG OID640145364 
Productformate dehydrogenase-O, major subunit, selenocysteine-containing 
Protein accessionYP_001076292 
Protein GI226830791 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR01553] formate dehydrogenase, alpha subunit, proteobacterial-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.181302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCAAC TGTCCCGGCG CCAGTTCCTG AAGCTGTCCG CGACGACGCT CGCCGGATCG 
AGCCTAGCCC TGATGGGCTT CTCGCCGGCC GAAGCGCTCG CCGAGGTCCG CCAATACAAG
CTGGCGCGCA CTGTCGAAAC CCGCAACACC TGCCCTTACT GCTCGGTCGG CTGCGGGATC
CTGATGTACG GCCTCGGCGA CGGCGCGAAG AACGCCACGT CGAGCATCGT CCACATCGAG
GGCGACCCCG ATCACCCGGT CAACCGCGGC ACGCTGTGCC CGAAGGGCGC GAGCCTCATC
GACTTCATCC ATAGCCCGAG CCGCCTCACG CAGCCCGAAT ACCGCGCGGC CGGCTCCGAC
AAGTGGCAGC CGATCTCGTG GAGCGACGCG CTCGACCGGA TCGCGAAGCT GATGAAGGCG
GACCGCGACG CGAACTTCGT CGAGACGACG GACGACGGCA TGAAGGTCAA CCGCTGGCTC
ACGACGGGCA TGCTGGCCGC CTCGGCGGGC AGCAACGAGG TCGGCTATCT GACGCACAAG
ACCGTGCGCA GCATGGGGAT GCTCGCGTTC GACAACCAGG CTCGTGTCTG ACATGGCCCG
ACGGTGGCAG GTCTTGCCCC GACGTTTGGC CGTGGCGCGA TGACGAACCA TTGGGTCGAC
ATCAAGAACG CGGACGTTAT TCTCGTGATG GGCGGCAATG CCGCCGAGGC CCATCCGTGC
GGTTTCAAGT GGGTCACCGA AGCGAAGGCG CATCGCAACG CGCGCCTCGT CGTCGTCGAT
CCGCGCTTCA CGCGCACCGC ATCGGTCGCC GATTATTACG CGCCGATTCG CACCGGCACG
GACATCGCGT TCCTCGGCGG GGTGATCCAT TACCTGCTGA CGAACGACAA GATCCAGCAC
GAGTACGTCA AGCATTACAC GGATTTCTCG TTCATCGTTC GCGAGGATTT CGCGTTCGAC
GACGGCATCT ATTCCGGCTA CGACGCGGAC AAGCACGCGT ACCCGGACAA GTCGACGTGG
GATTACGAGC GCGGCGACGA CGGCTTCGTG AAGGTCGACG AAACGCTCGC GCACCCGCGC
TGCGTGTACA ACCTGCTCAA GCAGCACTAC GCGCGCTACA CGCCGGAGAT GGTCGAGAAG
ATCTGCGGCA CGCCGAAGGA CAAGTTCCTG AAGGTATGCG AGATGCTCGC GACGACGGCC
GTGCCCGGCC GCGCCGGCAC GGTGCTGTAC GCGCTCGGCT GGACGCACCA CTCGGTCGGC
GCGCAGATGA TCCGCACGGG CGCGATGGTG CAGTTGCTGC TCGGCAACAT CGGCATCGCG
GGCGGCGGGA TGAACGCGCT GCGCGGGCAC TCGAACATCC AGGGGTTGAC CGACCTCGGG
CTGATGTCGA ACCTGCTGCC GGGTTACATG ACGCTGCCGA TGCAGGCCGA GCAGGATTTC
GACGCCTACA TCCAGAAGCG CGCGCAGCAG CCGCTGCGGC CCAACCAGTT GAGCTACTGG
AAGAACTACC GCGCGTTCCA CGTGAGCTTC ATGAAGGCGT GGTGGGGCGA CGCGGCGAGC
GCCGAGAACA ACTGGGGCTA CGACTACCTG CCGAAGCTCG ACAAGCAGTA CGACCTGCTG
CAGACGATCG AGCTGATGCA CGCGGGCAAG ATGAACGGCT ACATCTGCCA GGGCTTCAAC
CCGCTCGCGG CGGCGCCGTC CAAGCGCAAG ACGTCCGAGG CGCTCGCGAA GCTGAAGTGG
CTCGTGATCA TGGATCCGCT CGCCACCGAG ACGTCGGAGT TCTGGAAGAA CCACGGCGAG
TTCAACGATG TCGATTCGTC GAAGATCCAG ACCGAGGTGT TCCGGCTGCC GACGTCGTGC
TTCGCCGAGG AGCGCGGCTC GCTCGTGAAC TCCGGCCGCG TGCTGCAGTG GCACTGGCAG
GGCGCGGAGC CGCCGGGCCA GGCGAAGAGC GACCTCGAGA TCATGTCGGG GATCTTCCTG
CGCATGCGCG ACATGTACCG CAAGGACGGC GGCAAGTATC CCGACCCGAT CGTCAACCTG
AGCTGGCCGT ACGCGAACCC GGAAAGCCCG ACGCCCGAAG AGCTCGCGAT GGAGTTCAAC
GGCCGTGCGC TCGCGGATCT GCCTGATCCG AAAGATCCGA CGAAGACGCT CGTGAAGAAG
GGTGAGCAGC TCGCCGGCTT CGCGCAGTTG AAGGACGACG GCACGACCGC GAGCGGCTGC
TGGATCTTCT GCGGCGCGTG GACGCAAGCG GGCAACCAGA TGGCGCGGCG CGACAACGCG
GACCCGACGG GCATCGGCCA GACGCTGAAC TGGGCGTGGG CGTGGCCGGC GAACCGGCGG
ATCCTGTACA ACCGCGCGTC GTGCGACGTG AACGGCAAGC CGTTCGATCC GAGCCGCAAG
CTGATCGGCT GGAACGGCAA GACGTGGACG GGCGCGGACG TTCCCGACTA CAAGCTCGAC
GAGCCGCCCG AGACCGGCAT GGGCCCGTTC ATCATGAACC CGGAGGGCGT CGCACGCTTT
TTCGCGCGCG CCGGGATGAA CGAAGGCCCG TTCCCCGAGC ACTACGAGCC GTTCGAGACG
CCGCTCGCCG CGAACCCGCT GCATCCGGGC AACCCGCGCG CGCTGAACAA CCCGGCCGCC
CGCGTGTTCC CGGACGATCG CGCGTCGTTC GGCAAGGTCG ACCAGTTCCC GCATGTCGCG
ACGACCTATC GCCTGACCGA GCACTTCCAT TACTGGACGA AGCATGCGCG GCTGAACGCG
ATCGTCCAGC CGCAGCAGTT CGTCGAGATC GGCGAGGATC TCGCGAAGGA GATCGGCGTC
GCGCACGGCG AGCAAGTGAA GGTGTCGTCC AACCGCGGGC ACATCGTCGC GGTCGCGCTC
GTCACCAAGC GCATCAAGCC GCTCATGGTC GACGGCAGGA AGGTGCAGAC GGTCGGCGTG
CCGTTGCACT GGGGCTTCAA GGGATTGACG AAGCCCGGCT ATCTCGCGAA CACCCTGACT
CCGTCCGTCG GCGACGGCAA CTCGCAGACA CCGGAATTCA AATCGTTCCT GGTGAAAGTG
GAAAAGGCGT AA
 
Protein sequence
MLQLSRRQFL KLSATTLAGS SLALMGFSPA EALAEVRQYK LARTVETRNT CPYCSVGCGI 
LMYGLGDGAK NATSSIVHIE GDPDHPVNRG TLCPKGASLI DFIHSPSRLT QPEYRAAGSD
KWQPISWSDA LDRIAKLMKA DRDANFVETT DDGMKVNRWL TTGMLAASAG SNEVGYLTHK
TVRSMGMLAF DNQARVUHGP TVAGLAPTFG RGAMTNHWVD IKNADVILVM GGNAAEAHPC
GFKWVTEAKA HRNARLVVVD PRFTRTASVA DYYAPIRTGT DIAFLGGVIH YLLTNDKIQH
EYVKHYTDFS FIVREDFAFD DGIYSGYDAD KHAYPDKSTW DYERGDDGFV KVDETLAHPR
CVYNLLKQHY ARYTPEMVEK ICGTPKDKFL KVCEMLATTA VPGRAGTVLY ALGWTHHSVG
AQMIRTGAMV QLLLGNIGIA GGGMNALRGH SNIQGLTDLG LMSNLLPGYM TLPMQAEQDF
DAYIQKRAQQ PLRPNQLSYW KNYRAFHVSF MKAWWGDAAS AENNWGYDYL PKLDKQYDLL
QTIELMHAGK MNGYICQGFN PLAAAPSKRK TSEALAKLKW LVIMDPLATE TSEFWKNHGE
FNDVDSSKIQ TEVFRLPTSC FAEERGSLVN SGRVLQWHWQ GAEPPGQAKS DLEIMSGIFL
RMRDMYRKDG GKYPDPIVNL SWPYANPESP TPEELAMEFN GRALADLPDP KDPTKTLVKK
GEQLAGFAQL KDDGTTASGC WIFCGAWTQA GNQMARRDNA DPTGIGQTLN WAWAWPANRR
ILYNRASCDV NGKPFDPSRK LIGWNGKTWT GADVPDYKLD EPPETGMGPF IMNPEGVARF
FARAGMNEGP FPEHYEPFET PLAANPLHPG NPRALNNPAA RVFPDDRASF GKVDQFPHVA
TTYRLTEHFH YWTKHARLNA IVQPQQFVEI GEDLAKEIGV AHGEQVKVSS NRGHIVAVAL
VTKRIKPLMV DGRKVQTVGV PLHWGFKGLT KPGYLANTLT PSVGDGNSQT PEFKSFLVKV
EKA