Gene BURPS1710b_A1283 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A1283 
Symbol 
ID3693608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1595497 
End bp1596687 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content73% 
IMG OID637731537 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_336440 
Protein GI76819071 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCA TCGTCGTGTT TCTGCTGTCT TGCCTGTCGC TCGTGATCTG GCTCGTGCTG 
CTGTTCGGGC GCGGCGGCTT CTGGCGCGCG CGTGCCGCGC GGCGGCTGCC GCCCGACGCG
CGCGGCGCGG CCGCGGCCGC CGGCTGGCCG GCCGTCGCGG CCGTCGTGCC CGCCCGCAAC
GAGGCGGACG TGATCGGCGA GGCGGTTCGC TCGCTCGTCG AGCAAGCGTA CGAAGGCGCG
TTTCACCTGA TCGTCGTCGA CGACCACAGC ACCGACGGCA CCGCCGAGGC CGCGCGCGCG
GCCGCGGCGG CCGTCGGCTG CGCCGACCGG CTGACCGTGC TCGCCGCGCA GCCGCTGCCC
GCCGGCTGGT CGGGCAAGGT GTGGGCGCAG TCGCAGGGGA TCGCCGCGGT GCGCTCGCTC
GGGCTGCCCG CCGACTACCT GCTGCTGACG GACGCCGACA TCGGTCATCC GCCGGACGCG
GTCGCGCAGC TCGTCACGCG CGCGCAGGCG GAGCAGCGCG ATCTCGTATC GCTGATGGTG
CGGCTGCGCT GCGATTCGTT CTGGGAAAAG GCGCTGATTC CGGCGTTCGT GTTCTTCTTC
GCGAAGCTCT ACCCGTTCTC GTGGATCAAC GATCCGCGCA ACCGGACGGC GGGCGCGGCG
GGCGGCTGCA TGCTCGTGCG CCGCGACGCG CTCGAGGAGG CGGGCGGCAT CGAATCGATC
CGCGGCGCGC TGATCGACGA TTGCAGCCTG GCCGCGCAGA TCAAGCACCG CGGCGCCGGC
CGCCACCCGA TCCGGCTCGA TCTCGCCGAT CGCAGCGTGT CGTTGCGGCC GTACGACAGC
TGGCGCGACA TCTGGAACAT GATCGCGCGC ACCGCGTTCA CGCAGTTGCG GTATTCGCCG
GTGCTGCTGC TCGGCACGCT CGTCGGGATG ACGATCCTCT ACCTGGTGCC GCCCGTCGCC
GCGCTCGCGT ACGGCGCGCG CGCGTGGCCG GCATGGCTCG CGTGGGCGTC GATGTGCACC
GCCTATGCGC CGATGCTCAG CTACTACCGC CGCTCGCCGT GGTGGGCGCC GGCGCTGCCG
CTCGTCGCGC TGTTCTATGT CGGCGCGACG TTCGCGTCGG CCGTGCGCTA CTGGCGCGGC
AAGGGCGGAC AGTGGAAGGC GCGCGTGCAG GCGCCGGTGC GGGATCGTTG A
 
Protein sequence
MTLIVVFLLS CLSLVIWLVL LFGRGGFWRA RAARRLPPDA RGAAAAAGWP AVAAVVPARN 
EADVIGEAVR SLVEQAYEGA FHLIVVDDHS TDGTAEAARA AAAAVGCADR LTVLAAQPLP
AGWSGKVWAQ SQGIAAVRSL GLPADYLLLT DADIGHPPDA VAQLVTRAQA EQRDLVSLMV
RLRCDSFWEK ALIPAFVFFF AKLYPFSWIN DPRNRTAGAA GGCMLVRRDA LEEAGGIESI
RGALIDDCSL AAQIKHRGAG RHPIRLDLAD RSVSLRPYDS WRDIWNMIAR TAFTQLRYSP
VLLLGTLVGM TILYLVPPVA ALAYGARAWP AWLAWASMCT AYAPMLSYYR RSPWWAPALP
LVALFYVGAT FASAVRYWRG KGGQWKARVQ APVRDR