Gene BURPS1106A_A2926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2926 
Symbol 
ID4905471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2857232 
End bp2858422 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content73% 
IMG OID640146029 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001076955 
Protein GI126456417 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.896752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCA TCGTCGTGTT TCTGCTGTCT TGCCTGTCGC TCGTGATCTG GCTCGTGCTG 
CTGTTCGGGC GCGGCGGCTT CTGGCGCGCG CGTGCCGCGC GGCGGCTGCC GCCCGACGCG
CGCGGCGCGG CCGCGGCCGC CGGCTGGCCG GCCGTCGCGA CCGTCGTGCC CGCCCGCAAC
GAGGCGGACG TGATCGGCGA GGCGGTTCGC TCGCTCGTCG AGCAAGCGTA CGAAGGCGCG
TTTCACCTGA TCGTCGTCGA CGACCACAGC ACCGACGGCA CCGCCGAGGC CGCGCGCGCG
GCCGCGGCGG CCGTCGGCTG CGCCGACCGG CTGACCGTGC TCGCCGCGCA GCCGCTGCCC
GCCGGCTGGT CGGGCAAGGT GTGGGCGCAG TCGCAGGGGA TCGCCGCGGT GCGCTCGCTC
GGGCTGCCCG CCGACTACCT GCTGCTGACG GACGCCGACA TCGGTCATCC GCCGGACGCG
GTCGCGCAGC TCGTCACGCG CGCGCAGGCG GAGCAGCGCG ATCTCGTATC GCTGATGGTG
CGGCTGCGCT GCGATTCGTT CTGGGAAAAG GCGCTGATTC CGGCGTTCGT GTTCTTCTTC
GCGAAGCTCT ACCCGTTCTC GTGGATCAAC GATCCGCGCA ACCGGACGGC GGGCGCGGCG
GGCGGCTGCA TGCTCGTGCG CCGCGACGCG CTCGAGGAGG CGGGCGGCAT CGAATCGATC
CGCGGCGCGC TGATCGACGA TTGCAGCCTG GCCGCGCAGA TCAAGCACCG CGGCGCCGGC
CGCCACCCGA TCCGGCTCGA TCTCGCCGAT CGCAGCGTGT CGTTGCGGCC GTACGACAGC
TGGCGCGACA TCTGGAACAT GATCGCGCGC ACCGCGTTCA CGCAGTTGCG GTATTCGCCG
GTGCTGCTGC TCGGCACGCT CGTCGGGATG ACGATCCTCT ACCTGGTGCC GCCCGTCGCC
GCGCTCGCGT ACGGCGCGCG CGCGTGGCCG GCATGGCTCG CGTGGGCGTC GATGTGCACT
GCCTATGCGC CGATGCTCAG CTACTACCGC CGCTCGCCGT GGTGGGCGCC GGCGCTGCCG
CTCGTCGCGC TGTTCTATGT CGGCGCGACG TTCGCGTCGG CCGTGCGCTA CTGGCGCGGC
AAGGGCGGAC AGTGGAAGGC GCGCGTGCAG GCGCCGGTGC GGGATCGTTG A
 
Protein sequence
MTLIVVFLLS CLSLVIWLVL LFGRGGFWRA RAARRLPPDA RGAAAAAGWP AVATVVPARN 
EADVIGEAVR SLVEQAYEGA FHLIVVDDHS TDGTAEAARA AAAAVGCADR LTVLAAQPLP
AGWSGKVWAQ SQGIAAVRSL GLPADYLLLT DADIGHPPDA VAQLVTRAQA EQRDLVSLMV
RLRCDSFWEK ALIPAFVFFF AKLYPFSWIN DPRNRTAGAA GGCMLVRRDA LEEAGGIESI
RGALIDDCSL AAQIKHRGAG RHPIRLDLAD RSVSLRPYDS WRDIWNMIAR TAFTQLRYSP
VLLLGTLVGM TILYLVPPVA ALAYGARAWP AWLAWASMCT AYAPMLSYYR RSPWWAPALP
LVALFYVGAT FASAVRYWRG KGGQWKARVQ APVRDR