Gene BURPS1106A_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2326 
Symbol 
ID4900383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2301943 
End bp2302941 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content72% 
IMG OID640135555 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001066590 
Protein GI126454508 
COG category[R] General function prediction only 
COG ID[COG2130] Putative NADP-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0803814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGG TCAACCGGCG GGTGCTGCTC GTGTCGCGCC CCGAACGCGA GGCGCGCGTC 
GAGAACTTCG AACTCGTCGA GACGCCGCTT GCGCCGCTCG CCGTAGGCGA GGTGCGCGTG
CGCAATCATT TTCTGTCGAT CGATCCGTAC ATGCGCGGGC GGATGAACGC GGGGCGCTCG
TACGCCGAGC CGCAACCGCT CGGCGAGGTG ATGGGCGGCG GCACCGCCGG CGAGGTCGTC
GAGTCGCGCA ATCCGGCGTT CGCCCCCGGC GATCGCGTGA TCGGCGCGTA CGGCTGGCAG
GAGTACGGCA CGTCGGCGGG CAAGGAACTG CGCAAGGTCG ACACGACGCG CGTGCCGCTG
TCCGCGTATC TCGGAGCCGC CGGAATGCCC GGCGTGACCG CGTGGTACGG CCTGAACCGG
ATCATCCGGC CGCGCGCGGG CGAGACGCTC GTCGTCAGCG CGGCGAGCGG CGCGGTCGGC
AGCGTGGTCG GGCAGCTCGC GAAGCTCGCC GGGTGTCGCG CGGTCGGCAT CGCGGGCGGC
GCGGACAAGT GCCGCTACGT CGTCGATACG CTCGGCTTCG ATGCGTGCGT CGACTACAAG
GCGGGCCGGC TCGCCGACGA TCTCGCGGCC GCCGCGCCGG ACGGCGTCGA CGGCTGTTTC
GAGAACGTCG GCGGCGCGGT GCTCGATGCG ACGCTCGCGC GGATGAACCC GTTCGGGCGC
ATCGCGATGT GCGGGATGAT CGCCGCGTAC GACGGCGCGC CCGCGCCGCT CGCGAACCCG
GCGCTGATCC TGCGCGAGCG GCTGCTCGTG CAGGGCTTCA TCGTGTCCGA GCACTTCGAC
GTGTGGCCCG AGGCGCTCGC GCAGCTCGCG TCGCTCGTCG CGAACAGGCA GCTGCATTAT
CGGGAGACGA TCGCGCAGGG CCTCGAGCGC GCGCCCGACG CGCTGCTCGG GCTGCTGAAA
GGGCGCAATT TCGGCAAGCA GCTCGTCGCG CTCGTCTGA
 
Protein sequence
MSQVNRRVLL VSRPEREARV ENFELVETPL APLAVGEVRV RNHFLSIDPY MRGRMNAGRS 
YAEPQPLGEV MGGGTAGEVV ESRNPAFAPG DRVIGAYGWQ EYGTSAGKEL RKVDTTRVPL
SAYLGAAGMP GVTAWYGLNR IIRPRAGETL VVSAASGAVG SVVGQLAKLA GCRAVGIAGG
ADKCRYVVDT LGFDACVDYK AGRLADDLAA AAPDGVDGCF ENVGGAVLDA TLARMNPFGR
IAMCGMIAAY DGAPAPLANP ALILRERLLV QGFIVSEHFD VWPEALAQLA SLVANRQLHY
RETIAQGLER APDALLGLLK GRNFGKQLVA LV