Gene BURPS1106A_3053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3053 
Symbolzwf 
ID4901937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2982213 
End bp2983682 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content68% 
IMG OID640136279 
Productglucose-6-phosphate 1-dehydrogenase 
Protein accessionYP_001067292 
Protein GI126454033 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0364] Glucose-6-phosphate 1-dehydrogenase 
TIGRFAM ID[TIGR00871] glucose-6-phosphate 1-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATACCG ATTCGAGCTT CACCTTCGTT CTCTTCGGCG GCACCGGCGA TCTGTCGATG 
CGCAAGATCC TCCCCGCGCT CTTCGAAGCG CATCGCGCGA ACATGCTGTC GGAAGCCGGC
AGGATCGTCG CCGTGGCCCG CCACGCGGCG GACCGCGAAG GCTACCTGCA GTGGGTCGAG
GAGCACGTGA AGCCGCACGC GGCGAAGGCG GCGGGCGGCG CGTTCGACGA AGCGGTCTGG
CGGAGCTTTC TCGAGCGCAT CGTCTACGTG AAGCTCGACC TCGGCCGCGC GGAAGATTAC
GCGCTGCTGC GCGACACGGT CGGCGGGCTC TCGGGCATCC GCGTGTTCTA CCTGGCGACG
GGCCCGTCGC TGTTCGTGCC GATCTGCAAG GCGCTCGCCG CGGTGGGCCT GAACGAAGGC
GCGCGCATCG TGCTCGAGAA GCCGCTCGGC TACGACCTGC GCTCGTCGAA CGCGATCAAC
GACGCGGTGG GCGAGATCTT CGCCGAAGAC CAGATCTACC GGATCGATCA CTACCTCGGC
AAGGAGCCGG TGCAGAACCT GCTCGCGCTG CGCTTCGGCA ACGCGCTCTT CGAGCCGCTG
TGGCGCCGCG AATGGGTGGA GAGCATCCAG ATCACGATCG CCGAGGAACT CGGCGTCGAG
GCGCGCGGCG ATTTCTACGA CAATACCGGC GCGCTGCGCG ACATGGTGCA GAACCACCTG
CTGCAGCTGC TGTCGATCGT CGCGATGGAG CCGCCGCACT CGATGGATTC CGATTCGGTG
CGCGACGAGA AGCTGCGCGT GCTGCGCGCG TTGAAGCCCG TCGATCCGCG CGACATCGGC
AAGGTCGCGG TGCGCGGCCA GTACCACGCG GGCGTGATCA AGGGCGCGCA GGTGCCCGCG
TACGCGACCG AGCCCGGCGT GAAGGCGGAC AGCCAGACCG AGACGTTCGT CGCGCTGAAG
GTCGAGATCG AGAACTGGCG CTGGGCCGGC GTGCCGTTCT TCCTGCGCAC CGGCAAGCGC
CTCGCCGACC GCGTCGCGGA GATCGTCGTC AACTTCCGGC CGGTGCCGCA CTCGGCGCTC
GGCCCCACCG CGCTGCGCGC GGGCGCGAAC CGTCTCGTGA TCCGGCTGCA GCCGAACGAA
TCGATCCGCC TGTACTGCCT CGCGAAGCAG CCGGGCGAAG GGATGAACCT GGCAAGCGTG
CACCTCGACC TCGCGTTCGA CCAGTTCTTC AAGGAAGGCC AGATGGAGGC GTACCAGCGC
CTGCTGCTCG ACGTGATCAA CGGCCGCCTC GCGCTCTTCG TCCGGCGCGA CGAACAGGAA
GCCGCATGGC GCTGGGTCGA GCCGATCCTG AACGAATGGG CGCGCACGAC GAAGCCGCCG
AAGCCGTACG CGGCCGGCAC CTGGGGCCCG GCCGCGGCGA GCGCGATGCT CGCGCAGCAC
GGCACCTGCT GGCTCGAAGA AGAAAACTGA
 
Protein sequence
MHTDSSFTFV LFGGTGDLSM RKILPALFEA HRANMLSEAG RIVAVARHAA DREGYLQWVE 
EHVKPHAAKA AGGAFDEAVW RSFLERIVYV KLDLGRAEDY ALLRDTVGGL SGIRVFYLAT
GPSLFVPICK ALAAVGLNEG ARIVLEKPLG YDLRSSNAIN DAVGEIFAED QIYRIDHYLG
KEPVQNLLAL RFGNALFEPL WRREWVESIQ ITIAEELGVE ARGDFYDNTG ALRDMVQNHL
LQLLSIVAME PPHSMDSDSV RDEKLRVLRA LKPVDPRDIG KVAVRGQYHA GVIKGAQVPA
YATEPGVKAD SQTETFVALK VEIENWRWAG VPFFLRTGKR LADRVAEIVV NFRPVPHSAL
GPTALRAGAN RLVIRLQPNE SIRLYCLAKQ PGEGMNLASV HLDLAFDQFF KEGQMEAYQR
LLLDVINGRL ALFVRRDEQE AAWRWVEPIL NEWARTTKPP KPYAAGTWGP AAASAMLAQH
GTCWLEEEN