Gene BURPS668_0294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0294 
Symbol 
ID4882838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp277892 
End bp279139 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content72% 
IMG OID640126222 
Productputative glycosyltransferase 
Protein accessionYP_001057347 
Protein GI126439262 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03472] hopanoid biosynthesis associated glycosyl transferase protein HpnI 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGC TCGCGTGGCT CACCGGCGCG GTCCTTGCGC TGCTCGCGGC CGAGCTGATC 
TCGTTCGGCG ACATCGGCCG CTTCGTCACG CTGCTCGCCG CGTTGCTCGC CGCGCCATGC
GCGCTCGCGG CCGCGTTCGG CTGCGTGTAC ACGCTCGTCG CCGCGGCGCT CACGCACCGT
TTTTTCGCGC GTGCGCCACG CGAGCCGCAC GCGTGCCCGC CCGTCACGAT CGTCAAGCCG
TTGCACGGCA TCGAGCGGAC GCTGTTCGCG AACCTCGCGA GCTTTTGCGA GCAGCGCTAC
GACGGGCCGA TCCAGTTCCT GTTCGGCGTG CACGATCGCG ACGATCCCGC GCTGCGCGCC
GTCGACGCGC TGCGCACCGC GTTTCCCCGC GCGCACGTGA CGATCGTCGC CGACGCCCGG
CTGTACGGGC CGAACCGCAA GATCGCGAAC CTCGTCAACA TGCTGCCCGC CGCCGCGCAT
GACGTGCTGA TCTTCGCGGA CAGCGACGTG AGCGTCGGCC CCGACTACGT GCGGCATATC
GTCGGCGAGC TCGGCGAGCC GGGCGTCGGG CTCGTGACCT GCGTCTATCG CGGCCGCCCG
GACCCGGGCT TCTGGCCGCG CGTCGAGGCG CTCGTCACCA GCCATCAGTT CCTGCCGGGC
GTGGTGACGG GCCTCGCGCT GAAGCTCGCG CGGCCGTGTT TCGGCCAGAC GATCGCGATG
CGCCGCGCCA TGCTCGACGC GATCGGCGGC CTCGCGCAGT TCGCCCATCA CCTCGCCGAG
GATCACGCGA TCGGCGAAGC CGTGCGCGCG CGCGGCGCGC GCGTCGTCGT GCCGCCGTTC
GCGGTCGAGC ACGGCTGCGT CGAGACGCGC GTCGCGCAGC TCGTCGAACA CGAATTGCGC
TGGAGCCGCA CGATCCGTGC GGTCGACCCG CGCGGCCATC TGGGCTCGCT GCTCACGCAT
CCGCTCGCGC TCGCGCTGCT CGCCGGCGTG CTATCGAGCG GCGCCGCGTG GGCGTGGCCG
CTCGTGCCTG CCGCGCTCGT CGCGCGCGTC GCCGCGAAAC GCATCGTCGA TCGCGCGACG
AAGCGGCCGG TGCGCGACCT GTGGCTGCTG CCGCTCGCGG ATCTGATCGC CTTCGGCATC
TTCGTCGCGA GCTTCTCGTC GTCGCGCGTG ATCTGGCGCG GCTTCAGCTT CGACGTCGAT
CGCGACGGCC GCCTGTGCCC CACGCCGGAA AAACGCCCGA ATGCCTGA
 
Protein sequence
MKALAWLTGA VLALLAAELI SFGDIGRFVT LLAALLAAPC ALAAAFGCVY TLVAAALTHR 
FFARAPREPH ACPPVTIVKP LHGIERTLFA NLASFCEQRY DGPIQFLFGV HDRDDPALRA
VDALRTAFPR AHVTIVADAR LYGPNRKIAN LVNMLPAAAH DVLIFADSDV SVGPDYVRHI
VGELGEPGVG LVTCVYRGRP DPGFWPRVEA LVTSHQFLPG VVTGLALKLA RPCFGQTIAM
RRAMLDAIGG LAQFAHHLAE DHAIGEAVRA RGARVVVPPF AVEHGCVETR VAQLVEHELR
WSRTIRAVDP RGHLGSLLTH PLALALLAGV LSSGAAWAWP LVPAALVARV AAKRIVDRAT
KRPVRDLWLL PLADLIAFGI FVASFSSSRV IWRGFSFDVD RDGRLCPTPE KRPNA