Gene BURPS1106A_A3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A3106 
Symbol 
ID4905923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp3019678 
End bp3022599 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content67% 
IMG OID640146209 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001077135 
Protein GI126457309 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGACC GTGCGCGATC CGGCGGCGAA CCACGCGAAG TGAAGACGAC GACCTGCTAC 
ATGTGCGCAT GCCGCTGCGG CATCCGCGTG CATTTGCGCA ACGGCGAAGT CCGCTACATC
GACGGCAACC CCGACCATCC GCTGAACCAG GGCGTGATCT GCGCGAAAGG CGCATCGGGC
ATCATGAAAC AGTATTCGCC CGCGCGCCTC ACGCAGCCGC TGATGCGCAA GGCGGGCGCC
GAGCGCGGCA GCGCGCAGTT CGAGCCGGTA TCGTGGGACG TCGCGTTCTC CGTGCTCGAA
CAGCGGCTCG CGCATCTGCG CGCGACGGAT CCGAAGCGCT TCGCGCTCTT CACCGGCCGC
GACCAGATGC AGGCGCTCAC CGGCCTGTTC GCGAAGCAGT ACGGCACGCC GAATTACGCG
GCGCACGGCG GCTTTTGCTC GGCGAACATG GCGGCCGGCA TGATCTATAC GGTCGGCGGC
TCGTTCTGGG AATTCGGCGG CCCCGATCTC GATCGCGCGA AGCTGTTCTT CATGATCGGC
ACCGCCGAGG ATCATCATTC GAATCCGCTG AAGATCGCGA TCTCGAAATT CAAGCGCGCG
GGCGGACGGT TCGTCGCGAT CAACCCGGTC CGCACCGGCT ACGCGGCAAT CGCCGACGAA
TGGGTACCGA TCCGCCCCGG CACCGACGGC GCGCTGTTCA TGGCGATGAT TCGCGAGCTG
ATCGAGACCG GCGGCTACGA CCGCGACTTC GTCACGCGCT ACACGAACGC GGCCGAGCTG
CTGGACATGC GCGCCGAAGC CGACACGTTC GGCCTCTTCG TGCGCGATGC GTCGCGCCCC
GAGCGCAATC CGCTGTTTCC GCAAAATCAC CTGTGGTGGG ATCTCGGCAG CGGCCGGGCG
GTTGCGCATC ACACGCGCGG CGCGACGCCC GCGCTCGACG GCCGCTACGC GCTCGACGAC
GGCACGCCCG TCGCGCCCTC GTTCGCGCTG CTGCGCGAGC GCGTGGCCGA ATGCACGCCG
CAATGGGCGG AGCGAATCAC GGGCATTCCG GCCGCGACGA TCCGCCGGCT CGCGCATGAG
ATGGCGGACG TCGCGCGCGA TCACAAGATC ACGCTGCCGA TCCGCTGGAC CGACGCGTGG
GGCGAGACGC ACGATACCGT CACCGGCAAC CCGGTTGCAT TCCATGCGAT GCGCGGGCTC
GCCGCGCATT CGAACGGCTT CCAGTCGATA CGCGCGCTCG CGGTGCTGAT GTCGCTGCTC
GGCACGATCG ACCGGCCGGG CGGCTTCAGG CACAAGTCGC CTTATCCGCG AGCGGTGCCG
CCGTCGGCGA AACCGCCGAA CGGCCCCGAC GCGGTGCGCC CGAACACGCC GCTCGCGGCC
GGCCCGCTCG GCTGGCCGGC CGCGCCGGAG GACTTGTTCG TCGACGAGCA AGGCGGCCCG
GTGCGCATCG ACAAGGCGTT CTCGTGGGAA TATCCGCTCG CCGTGCACGG CCTGATGCAC
AGCGTGATCA CGAACGCATG GCGCGGCGAT CCGTATCCGA TCGATACGCT GATGATCTTC
ATGGCGAACA TGGCGTGGAA TTCGTCGATG AACACGGTCG AGGTGCGCCG GATGCTCGCG
GACAGGCACG ACAACGGCGA CTACAAGATC CCGTTCATCG TCGTGTGCGA CGCGTTCCAA
TCCGAGATGA CCGCGTTCGC CGATCTGATC CTGCCCGACA CGACGTATCT CGAACGGCAC
GACGCGATGT CGATGCTCGA CCGGCCGATC TCCGAGTTCG ACGGCCCCGT CGATTCGGTG
CGCATTCCGG TCGTGCCGCC GACGGGCGAA TGCAAGCCGT TCCAGGAAGT GCTGATCGAG
CTCGCGAGCC GGCTGAAGCT GCCCGCGTTC ACGAACGCCG ACGGCACGCG CAAGTTCCGC
GACTATCCGG ACTTCGTCAT CAACTATCAG ACCGCGCCCG ATTCGGGCGT CGGCTTCCTG
ATCGGCTGGC GCGGCGAGGA TGGCGGCGAC GCGCTCGTCG GCGCGCCGAA CCCGCGCCAG
TGGGACGAGT ACGAGAAGCA CGGCTGCGTG TTCCACTACA CGCTGCCGGA CACGCTGCAG
TACATGCGCG GCTGCAACGG CCCGTATCTG AAATGGGCGG TCGAAAAAGG TTTCCGGAAG
TACGACGCGC CGATCGTGAT CCACCTCTAC TCGGACGTGC TGCAGAAATT CCGACTCGCC
GCGCAGGGCA GGACGCGCGG CCGGCAGCCG CCCGAGCACC TGCGCGCGCG TATCGCACGA
CATTTCGATC CGCTGCCGTT CTGGTACGAA CCGCTCGAGC TCGGCGCGAC CGATTTGCAA
CGCTACCCGC TCGCGGCCGT CACGCAGCGG CCGATGGCGA TGTATCACTC GTGGGATTCG
CAGAACGCGT GGCTGCGGCA GATTCATGGG GAGAACGCTC TGTTCGTGAA TCCGAAGGTG
GCGCGCGACG CGGGCATCGA CGACGGCGGC TGGATCTACG TCGAATCGCA ATGGGGCAAG
GTGCGCTGCC GCGCGCGCTA CAGCGAAGTG GTCGAGCCGG GCACCGTCTG GACGTGGAAC
GCGATCGGCA AGGCAGCGGG CGCATGGAAT CTCGGCCCGG ACGCGAACGA ATCGCAGCGC
GCCTTCCTGT TGAACCACGT GATCACCGAC GAGTTGCCCG GCGAAGGCGC GCACGCGCCG
CGCATCTCGA ACTCCGATCC GATCACCGGC CAGGCCGCGT GGTACGACGT GCGCGTGCGC
ATCTACCCGG CCGAGGCCGA CGCGGACCAC ACGCTGCCGC AATTCGCGCC GATGCCTGCG
CTGCCCGGTG TGACCGGCGC GGTGCGGCGC ATCGTGCAAA CCTATTTCGC GGGGCGCGGC
GAATTCGCCG CGCGGCTGCG CGATGCGGCG AAACGCCGTT GA
 
Protein sequence
MDDRARSGGE PREVKTTTCY MCACRCGIRV HLRNGEVRYI DGNPDHPLNQ GVICAKGASG 
IMKQYSPARL TQPLMRKAGA ERGSAQFEPV SWDVAFSVLE QRLAHLRATD PKRFALFTGR
DQMQALTGLF AKQYGTPNYA AHGGFCSANM AAGMIYTVGG SFWEFGGPDL DRAKLFFMIG
TAEDHHSNPL KIAISKFKRA GGRFVAINPV RTGYAAIADE WVPIRPGTDG ALFMAMIREL
IETGGYDRDF VTRYTNAAEL LDMRAEADTF GLFVRDASRP ERNPLFPQNH LWWDLGSGRA
VAHHTRGATP ALDGRYALDD GTPVAPSFAL LRERVAECTP QWAERITGIP AATIRRLAHE
MADVARDHKI TLPIRWTDAW GETHDTVTGN PVAFHAMRGL AAHSNGFQSI RALAVLMSLL
GTIDRPGGFR HKSPYPRAVP PSAKPPNGPD AVRPNTPLAA GPLGWPAAPE DLFVDEQGGP
VRIDKAFSWE YPLAVHGLMH SVITNAWRGD PYPIDTLMIF MANMAWNSSM NTVEVRRMLA
DRHDNGDYKI PFIVVCDAFQ SEMTAFADLI LPDTTYLERH DAMSMLDRPI SEFDGPVDSV
RIPVVPPTGE CKPFQEVLIE LASRLKLPAF TNADGTRKFR DYPDFVINYQ TAPDSGVGFL
IGWRGEDGGD ALVGAPNPRQ WDEYEKHGCV FHYTLPDTLQ YMRGCNGPYL KWAVEKGFRK
YDAPIVIHLY SDVLQKFRLA AQGRTRGRQP PEHLRARIAR HFDPLPFWYE PLELGATDLQ
RYPLAAVTQR PMAMYHSWDS QNAWLRQIHG ENALFVNPKV ARDAGIDDGG WIYVESQWGK
VRCRARYSEV VEPGTVWTWN AIGKAAGAWN LGPDANESQR AFLLNHVITD ELPGEGAHAP
RISNSDPITG QAAWYDVRVR IYPAEADADH TLPQFAPMPA LPGVTGAVRR IVQTYFAGRG
EFAARLRDAA KRR