Gene BURPS668_A3157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3157 
Symbol 
ID4888190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2983729 
End bp2985060 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content73% 
IMG OID640133093 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001064148 
Protein GI126442881 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAAAA TCGCGTTGAT CAGTGAGCAC GCATCGCCGC TCGGCGTCAT CGGAGGCGTC 
GACGCGGGCG GCCAGAACAT CTATGTCGCG AACGTCGCCA AGCAGCTCGC GCGGCTCGGC
GTCGACGTCG ACGTGTTCAC GCGCTGCGAC AATCCGCACC TGCCCGACGT CGCGCACATC
GGCGCGGGCA TCCGCGTGAT CCACGTACCG GCCGGCCCGC CGTCGAACGT ACCGAAGGAA
GCGCTGCTGC CGTACATGAA GGCGTTCTCG GCATTCCTCA TCGACTGGTT CCGGCGCGAG
CCGACGCCTT ACGACGCGAT GCACGCGAAC TTCTTCATGT CCGGCGACGC GGCGCTGCGC
GTGAAGGCGC GCCTCGGCGT GCCGCTCGTG ATGACGTTCC ATGCGCTCGG CCGCGTGCGC
CGCCGGCATC AGGGCGCGGC CGACGGCTTT CCGGACGCGC GCTTTCCGAT CGAGGACGCG
CTCGCGAAGC GCGCCGATCG CGTGATCGCC GAGTGCCCGC AGGACGCGGC CGATCTGCGC
GCGCTGTACC GCGCCGATCC GGGCCGCATC GAGATCGTGC CGTGCGGCTT CGACGAAGAA
GAGTTTCGCC CGGTGCTGCG GCGCGCCGCG CGCGCGCGGC TCGGCTGGCG CGACGACGAA
TTCGCGGTGC TGCAGCTCGG GCGCCTCGTG CCGCGCAAGG GCATCGACAA CGTGATCGAG
GCGCTCGCGC GCGTGCCGCG CGACGCGGGC GCGCGGCCGG CCCGTCTCTA TGTGGTGGGC
GGCAGCGACT ACGAGCCGGA CCCGTCGCGC TGCGCGGAGC TCGCGCGCCT CGCCGGCATC
GCGCGCGAAG CCGGCGTGGC CGATCGCGTG ACGTTCGTCG GCCGGCGCGA TCGCGACGCG
CTGCATCTCT ACTACGGCGC GGCCGACGTG TTCGTGACGA CGCCGTGGTA CGAACCGTTC
GGGATCACGC CCGTCGAGGC GATGGCGTGC GCGACGCCCG TGATCGGCAG CGACGTCGGC
GGCATCCGCA CGACAGTCGA GCATGGCGTG ACGGGCTATC TCGTCGCGCC GCGCGATCCG
GGCGCGCTCG CCGCGCGGCT CGACGAACTG CGGCGCGACC CCGAGCGCGC GCAGCAGTTG
GGCTGGGCCG GCTACCGGCG CGCGCATCGC CATTACACGT GGCGCGGCGT GGCCGAGCGG
CTCGCGGCGA TCTATCGCGA CGTCGCCGCG TGCGCGCGGC GCGGCGCGCG CGCGGGCACG
GCGGCGCACG TGCGGCGCTC GCCCGTCGCG CCCTCGGCAA CGGTTGCGAA CCAGAAGGAG
AACGGATCAT GA
 
Protein sequence
MQKIALISEH ASPLGVIGGV DAGGQNIYVA NVAKQLARLG VDVDVFTRCD NPHLPDVAHI 
GAGIRVIHVP AGPPSNVPKE ALLPYMKAFS AFLIDWFRRE PTPYDAMHAN FFMSGDAALR
VKARLGVPLV MTFHALGRVR RRHQGAADGF PDARFPIEDA LAKRADRVIA ECPQDAADLR
ALYRADPGRI EIVPCGFDEE EFRPVLRRAA RARLGWRDDE FAVLQLGRLV PRKGIDNVIE
ALARVPRDAG ARPARLYVVG GSDYEPDPSR CAELARLAGI AREAGVADRV TFVGRRDRDA
LHLYYGAADV FVTTPWYEPF GITPVEAMAC ATPVIGSDVG GIRTTVEHGV TGYLVAPRDP
GALAARLDEL RRDPERAQQL GWAGYRRAHR HYTWRGVAER LAAIYRDVAA CARRGARAGT
AAHVRRSPVA PSATVANQKE NGS