Gene BURPS1106A_0647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_0647 
Symbol 
ID4899585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp619517 
End bp620758 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID640133877 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001064929 
Protein GI126454998 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0914561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGGTAG CAATCGTTCA CGACTGGCTG GTGGTGTATG GCGGCGCGGA GCGTGTGCTC 
GCGCAGATGA TCGACTGCTT TGCGCAAGCC GACATCTACA GCCTCGTCGA TTTTCTCGAC
GACCGCTCGT GCCTGCGTGG CCGGCCGGTG CACACCTCGT TCATCCAGAA ATTGCCGTTC
GCGCGCAGCA AGTACCGCAG CTATTTGCCG CTCTTTCCGC TCGCGATCGA GCAGTTCGAT
CTGTCCGGCT ACGACCTGAT CCTGTCGAGC TCGTATGCGG TCGCCAAGGG CGTGCTGAAC
GGCCCGGACC AGTTGCATGC GAGTTACGTG CACTCGCCCG TGCGCTACGC GTGGGACCTG
CAGCATCAGT ACCTGAACGA AGCGGGGCTC GCGCGCGGCG TGAAATCGGC GCTCGCGCGC
ACGTTGCTGC ACTACATCCG CAACTGGGAT GCGCGCTCGG CGAACGGGGT CGACCTGCTC
GCGGCGAATT CGCGCTTCGT CGCGCGACGT ATCCGCAAGA CGTATCGGCG CGACGCGACG
GTCATCTATC CGCCCGTCGA CGTCGATCAT CTCGCGCTGC GCGACACGAA GGACGACTTC
TATCTGACGG CGTCGCGCCT CGTGCCGTAC AAGCGGATCG ATCTGATCGT CGAGGCGTTT
TCGCACATGC CGTCGCGCCG GCTCGTCGTG ATCGGCGACG GGCCGGAGGC GGCGAAGATC
CGCGCGCTCG CGGGCCCGAA CGTCACGCTG CTCGGCTACC AGCCGTTCGA CGTGCTGCAC
GATCATCTGC AGCGCGCGAA GGCGTTCGTG TTCGCCGCGG AAGAGGATTT CGGCATCTCG
CCCGTCGAAG CGCAGGCATG CGGCACGCCC GTGATCGCAT ACGGCAAGGG CGGCGTGTGC
GAATCGGTGC GCGCGGCGGG CGCGGCGCCG ACGGGCCTCT TCTATGCGAA GCAAACGTGC
GACGCGCTGA TCGATGCGAT CGACCGGTTC GAGGCGATGC CGGCGGGCAC ATTCGATCCG
CACGCGTGCC GCGCGAACGC GGAGCGCTTC AGCGCCGCGC GCTTTCGCTC GACGTTCTCG
CGCTTCGTGC TCGAGGGCTA CGCCGCGTTG CAGGCGGAAA TGGGCGAGAC GATGCAGGAC
GCGCCGCTCG AGCCGGGTGG CGCGCCGGAC GGCGCGCCTG TCGAGCGCGA CGCGGCGGCG
CCGCACGGCG CCTGCCGGAA CGAAACGCTC GCGCGCATCT GA
 
Protein sequence
MKVAIVHDWL VVYGGAERVL AQMIDCFAQA DIYSLVDFLD DRSCLRGRPV HTSFIQKLPF 
ARSKYRSYLP LFPLAIEQFD LSGYDLILSS SYAVAKGVLN GPDQLHASYV HSPVRYAWDL
QHQYLNEAGL ARGVKSALAR TLLHYIRNWD ARSANGVDLL AANSRFVARR IRKTYRRDAT
VIYPPVDVDH LALRDTKDDF YLTASRLVPY KRIDLIVEAF SHMPSRRLVV IGDGPEAAKI
RALAGPNVTL LGYQPFDVLH DHLQRAKAFV FAAEEDFGIS PVEAQACGTP VIAYGKGGVC
ESVRAAGAAP TGLFYAKQTC DALIDAIDRF EAMPAGTFDP HACRANAERF SAARFRSTFS
RFVLEGYAAL QAEMGETMQD APLEPGGAPD GAPVERDAAA PHGACRNETL ARI