Gene BURPS668_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3211 
Symbol 
ID4883050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3141713 
End bp3142648 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content66% 
IMG OID640129139 
ProductKpsF/GutQ family sugar isomerase 
Protein accessionYP_001060222 
Protein GI126438820 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCATC ACAACTATCT TGATTCCGCG CGGCAAGTCT TCGATATCGA GTCGCGCGCG 
CTCGCGAGTC TCTCGGCACG CGTGGGCGAC TCGTTCGGCG ACGCGGTCGA CGCGATCCTG
CGCTCGAGTG GCCGCGTGGT CGTTTGCGGC ATGGGCAAAT CCGGGATCAT CGGGCGCAAG
ATCGCGGCGA CGTTCGCGAG CACCGGAACG CCGAGCTTTT TCATGCATCC GGGCGAGGCG
TACCACGGCG ACCTCGGCAT GGTGACGTCG GCCGACACGT TTCTCGCGAT CTCGTATTCG
GGCGAGACGG ACGAAGTGAT CAAGCTGATT CCGTTTCTGA AAAGCAACCG GAATTATCTC
GTCGCGCTGA CGGGCAACGC GCGATCGACG CTCGCGCAAG CGGCGCACAG CCATCTCGAC
GCGGGCGTCG AGCAGGAAGC GTGTCCGCTT CAGCTCGCAC CGACTTCGTC GACCACGGCC
GCGCTCGCGA TGGGCGACGC ACTCGCCGTC ACGCTGATGA AGGCACGCGG CTTTCGCCCG
GAGAACTTCG CGCGCTTCCA CCCGGGCGGC TCGCTCGGCC GCCGCCTGCT GTCGAAGGTC
GACGACGAAA TGACCGTCGA CGGCCTGCCG TTCGTCGACG AGCGCGCGCC GGCGATCGAC
GTGCTGCAGG CGATGACGCG GGGCCGGCTC GGGCTCGCGA TCGTGCGGCG CGAAACGGGC
TTCGGAATCG TCACCGACGG CGATGTCCGT CGCGCGATCG AGGCGTATGG AGACACGCTG
TTCCGGCGCG CCGCGTCGGA TCTGATGTCG GCGGATCCGG CGATGGTGCC GCTCGGCACG
CGCGTGGAAG ACGCGTTGCT GATGATGGAG GCGCGGCGCA TCAACGCGCT GCTCGTATTC
GACGGCGAAG ACGTCGTCGG CGTGTTCAAG AAATAG
 
Protein sequence
MNHHNYLDSA RQVFDIESRA LASLSARVGD SFGDAVDAIL RSSGRVVVCG MGKSGIIGRK 
IAATFASTGT PSFFMHPGEA YHGDLGMVTS ADTFLAISYS GETDEVIKLI PFLKSNRNYL
VALTGNARST LAQAAHSHLD AGVEQEACPL QLAPTSSTTA ALAMGDALAV TLMKARGFRP
ENFARFHPGG SLGRRLLSKV DDEMTVDGLP FVDERAPAID VLQAMTRGRL GLAIVRRETG
FGIVTDGDVR RAIEAYGDTL FRRAASDLMS ADPAMVPLGT RVEDALLMME ARRINALLVF
DGEDVVGVFK K