Gene BMAA2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMAA2047 
Symbol 
ID3086785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei ATCC 23344 
KingdomBacteria 
Replicon accessionNC_006349 
Strand
Start bp2239931 
End bp2242867 
Gene Length2937 bp 
Protein Length978 aa 
Translation table11 
GC content67% 
IMG OID637565910 
Productmolybdopterin oxidoreductase family protein 
Protein accessionYP_106559 
Protein GI53717523 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGAG CGAACATGGA CGACCGTGCG CGATCCGGCG GCGAACCACG CGAAGTGAAG 
ACGACGACCT GCTACATGTG CGCATGCCGC TGCGGCATCC GCGTGCACTT GCGCAACGGC
GAAGTCCGCT ACATCGACGG CAACCCCGAC CATCCGCTGA ACCAGGGCGT GATCTGCGCG
AAAGGCGCAT CGGGCATCAT GAAACAGTAT TCGCCCGCGC GCCTCACGCA GCCGCTGATG
CGCAAGGCGG GCGCCGAGCG CGGCAGCGCG CAGTTCGAGC CGGTATCGTG GGACGTCGCG
TTCTCCGTGC TCGAACAGCG GCTCGCGCAT CTGCGCGCGA CGGATCCGAA GCGCTTCGCG
CTCTTCACCG GCCGCGACCA GATGCAGGCG CTCACCGGCC TGTTCGCGAA GCAGTACGGC
ACGCCGAATT ACGCGGCGCA CGGCGGCTTT TGCTCGGCGA ACATGGCGGC CGGCATGATC
TATACGGTCG GCGGCTCGTT CTGGGAATTC GGCGGCCCCG ATCTCGATCG CGCGAAGCTG
TTCTTCATGA TCGGCACCGC CGAGGATCAT CATTCGAATC CGCTGAAGAT CGCGATCTCG
AAATTCAAGC GCGCGGGCGG ACGGTTCGTC GCGATCAACC CGGTCCGCAC CGGCTACGCG
GCAATCGCCG ACGAATGGGT ACCGATCCGC CCCGGCACCG ACGGCGCGCT GTTCATGGCG
ATGATTCGCG AGCTGATCGA GACCGGCGGC TACGACCGCG ACTTCGTCAC GCGCTACACG
AATGCGGCCG AGCTGCTGGA CATGCGCGCC GAAGTCGACA CGTTCGGCCT CTTCGTGCGC
GATGCGTCGC GCCCCGAGCG CAATCCGCTG TTTCCGCAAA ATCACCTGTG GTGGGATCTC
GGCAGCGGCC GGGCGGTTGC GCATCACACG CGCGGCGCGA CGCCCGCGCT CGACGGCCGC
TACGCGCTCG ACGACGGCAC GCCCGTCGCG CCCTCGTTCG CGCTGCTGCG CGAGCGCGTG
GCCGAATGCA CGCCGCAATG GGCGGAGCGA ATCACGGGCA TTCCGGCCGC GACGATCCGC
CGGCTCGCGC ATGAGATGGC GGACGTCGCG CGCGATCACA AGATCACGCT GCCGATCCGC
TGGACCGACG CGTGGGGCGA GACGCACGAT ACCGTCACCG GCAACCCGGT TGCATTCCAT
GCGATGCGCG GGCTCGCCGC GCATTCGAAC GGCTTCCAGT CGATACGCGC GCTCGCGGTG
CTGATGTCGC TGCTCGGCAC GATCGACCGG CCGGGCGGCT TCAGGCACAA GTCGCCCTAT
CCGCGCGCGG TGCCGCCGTC GGCGAAACCG CCGAACGGCC CCGACGCGGT GCGCCCGAAC
ACGCCGCTCG CGGCCGGCCC GCTCGGCTGG CCGGCCGCGC CGGAGGACTT GTTCGTCGAC
GAGCAAGGCG GCCCGGTACG CATCGACAAG GCGTTCTCGT GGGAATATCC GCTCGCCGTG
CACGGCCTGA TGCACAGCGT GATCACGAAC GCATGGCGCG GCGATCCGTA TCCGATCGAT
ACGCTGATGA TCTTCATGGC GAACATGGCG TGGAATTCGT CGATGAACAC GGTCGAGGTG
CGCCGGATGC TCGCGGACAG GCACGACAAC GGCGACTACA AGATCCCGTT CATCGTCGTG
TGCGACGCGT TCCAATCCGA GATGACCGCG TTCGCCGATC TGATCCTGCC CGACACGACG
TATCTCGAAC GGCACGACGC GATGTCGATG CTCGACCGGC CGATCTCCGA GTTCGACGGC
CCCGTCGATT CGGTGCGCAT TCCGGTCGTG CCGCCGACGG GCGAATGCAA GCCGTTCCAG
GAAGTGCTGA TCGAGCTCGC GAGCCGGCTG AAGCTGCCCG CGTTCACGAA CGCCGACGGC
ACGCGCAAGT TCCGCGACTA TCCGGACTTC GTCATCAACT ATCAGACCGC GCCCGATTCG
GGCGTCGGCT TCCTGATCGG CTGGCGCGGC GAGGATGGCG GCGACGCGCT CGTCGGCGCG
CCGAACCCGC GCCAGTGGGA CGAGTACGAG AAGCACGGCT GCGTGTTCCA CTACACGCTG
CCGGACACGC TGCAGTACAT GCGCGGCTGC AACGGCCCGT ATCTGAAATG GGCGGTCGAA
AAAGGTTTCC GGAAGTACGA CGCGCCGATC GTGATTCACC TCTACTCGGA CGTGCTGCAG
AAATTCCGAC TCGCCGCGCA GGGCAGGACG CGCGGCCGGC AGCCGCCCGA GCACCTGCGC
GCGCGTATCG CACGACATTT CGATCCGCTG CCGTTCTGGT ACGAACCGCT CGAGCTCGGC
GCGACCGATT TGCAACGCTA CCCGCTCGCG GCCGTCACGC AGCGGCCGAT GGCGATGTAT
CACTCGTGGG ATTCGCAGAA CGCGTGGCTG CGGCAGATTC ATGGGGAGAA CGCTCTGTTC
GTGAATCCGA AGGTGGCGCG CGACGCGGGC ATCGACGACG GCGGCTGGAT CTACGTCGAA
TCGCAATGGG GCAAGGTGCG CTGCCGCGCG CGCTACAGCG AAGTGGTCGA GCCGGGCACC
GTCTGGACGT GGAACGCGAT CGGCAAGGCA GCGGGCGCAT GGAATCTCGG CCCGGACGCG
AACGAATCGC AGCGCGCCTT CCTGTTGAAC CACGTGATCA CCGACGAGTT GCCCGGCGAA
GGCGCGCACG CGCCGCGCAT CTCGAACTCC GATCCGATCA CCGGCCAGGC CGCGTGGTAC
GACGTGCGCG TGCGCATCTA CCCGGCCGAG GCCGACGCGG ACCACACGCT GCCGCAATTC
GCGCCGATGC CTGCGCTGCC CGGTGTGACG GGCGCGGTGC GGCGCATCGT GCAAACCTAT
TTCGCGGGGC GCGGCGAATT CGCCGCGCGG CTGCGCGATG CGGCGAAACG CCGTTGA
 
Protein sequence
MQRANMDDRA RSGGEPREVK TTTCYMCACR CGIRVHLRNG EVRYIDGNPD HPLNQGVICA 
KGASGIMKQY SPARLTQPLM RKAGAERGSA QFEPVSWDVA FSVLEQRLAH LRATDPKRFA
LFTGRDQMQA LTGLFAKQYG TPNYAAHGGF CSANMAAGMI YTVGGSFWEF GGPDLDRAKL
FFMIGTAEDH HSNPLKIAIS KFKRAGGRFV AINPVRTGYA AIADEWVPIR PGTDGALFMA
MIRELIETGG YDRDFVTRYT NAAELLDMRA EVDTFGLFVR DASRPERNPL FPQNHLWWDL
GSGRAVAHHT RGATPALDGR YALDDGTPVA PSFALLRERV AECTPQWAER ITGIPAATIR
RLAHEMADVA RDHKITLPIR WTDAWGETHD TVTGNPVAFH AMRGLAAHSN GFQSIRALAV
LMSLLGTIDR PGGFRHKSPY PRAVPPSAKP PNGPDAVRPN TPLAAGPLGW PAAPEDLFVD
EQGGPVRIDK AFSWEYPLAV HGLMHSVITN AWRGDPYPID TLMIFMANMA WNSSMNTVEV
RRMLADRHDN GDYKIPFIVV CDAFQSEMTA FADLILPDTT YLERHDAMSM LDRPISEFDG
PVDSVRIPVV PPTGECKPFQ EVLIELASRL KLPAFTNADG TRKFRDYPDF VINYQTAPDS
GVGFLIGWRG EDGGDALVGA PNPRQWDEYE KHGCVFHYTL PDTLQYMRGC NGPYLKWAVE
KGFRKYDAPI VIHLYSDVLQ KFRLAAQGRT RGRQPPEHLR ARIARHFDPL PFWYEPLELG
ATDLQRYPLA AVTQRPMAMY HSWDSQNAWL RQIHGENALF VNPKVARDAG IDDGGWIYVE
SQWGKVRCRA RYSEVVEPGT VWTWNAIGKA AGAWNLGPDA NESQRAFLLN HVITDELPGE
GAHAPRISNS DPITGQAAWY DVRVRIYPAE ADADHTLPQF APMPALPGVT GAVRRIVQTY
FAGRGEFAAR LRDAAKRR