Gene BURPS1106A_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1274 
Symbol 
ID4900608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1248603 
End bp1249637 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content69% 
IMG OID640134504 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001065553 
Protein GI126454366 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA TGATCGTCAC CGATGCGTGG GAGCCGCAAG TCAACGGCGT CGTGCGCACG 
CTCAAGAGCA CGGCGCGCGA GCTCACCGCG CTCGGCCACC GCGTCGAGCT CGTCACGCCG
CTCGAATTCC GCACGGTGCC CTGCCCGACC TATCCCGAAA TCCGCCTGTC GATCCTGCCG
TACCGGCGGC TGCGCGAGCG CCTGGACGCG TTCGAGCCGC ACGCGCTGCA CATCGCGACC
GAAGGCCCGC TCGGGCTCGC CGCGCGCCGC TACGCGCGCG CGCGCAAGCT GCCGTTCACG
ACCGCGTACC ACACGCGCTT TCCGGAATAC GTGCAGGCGC GCTTCGGCGT GCCGCTCGCG
GCGACCTATC GCTTCCTGCG ATGGTTCCAC GGCGCGTCGC TCGCCGTGAT GGCGCCGACG
CCCGTCGTCA AGGACGACCT CGAGCGCTTC GGCTTCGACA ACGTCGTGCT GTGGACGCGC
GGCGTCGATC TCGACATCTT CCGGCCGATC GAATCGAAGG TGCTCAACAC CGCGCGGCCG
ATCTTCCTGT ATGTCGGCCG CGTCGCGATC GAGAAGAACG TCGAGGCGTT CCTGAAGCTC
GACCTGCCCG GCTCGAAATG GGTCGCGGGC GAGGGCCCCG CGCTCGCCGA GCTCAAATCG
CGCTATCCTG AGGCGAATTA CCTCGGCGTG CTGACGCAGG CGGAGCTCGC CAAGGTATAT
GCGGCAGCCG ACGTGTTCGT GTTCCCGAGC CGCACCGACA CGTTCGGGCT CGTGCTGCTC
GAAGCGCTCG CGTGCGGCAC GCCCGTCGCC GCCTATCCGG TGACGGGCCC CGTCGACGTG
CTCGGCGACG GCGGCGCGGG CGCGATGAAC GACGATCTGC GCGAGGCGTG CCTCGAGGCG
CTGAAGATCG ACCGGCGGCA TGCGCGCGCG TGGGCCGAGC GCTTCTCGTG GCGCGCGGCG
TCCGAGCAGT TCGCCTCGCA CCTGAAGCCG CTACAGAAAT CCGCCTGCCC ACACACCGAA
GGCGCAGCCG TTTGA
 
Protein sequence
MKIMIVTDAW EPQVNGVVRT LKSTARELTA LGHRVELVTP LEFRTVPCPT YPEIRLSILP 
YRRLRERLDA FEPHALHIAT EGPLGLAARR YARARKLPFT TAYHTRFPEY VQARFGVPLA
ATYRFLRWFH GASLAVMAPT PVVKDDLERF GFDNVVLWTR GVDLDIFRPI ESKVLNTARP
IFLYVGRVAI EKNVEAFLKL DLPGSKWVAG EGPALAELKS RYPEANYLGV LTQAELAKVY
AAADVFVFPS RTDTFGLVLL EALACGTPVA AYPVTGPVDV LGDGGAGAMN DDLREACLEA
LKIDRRHARA WAERFSWRAA SEQFASHLKP LQKSACPHTE GAAV