Gene BURPS668_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3094 
Symbol 
ID4885642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3036010 
End bp3037851 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content61% 
IMG OID640129022 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001060106 
Protein GI126438818 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTTG ATATATCCGT CGTGGTGTAT CGAGAGACGG AGGAAACCCT CGACGGTCTT 
CTCGACAGCC TCGCCGCTCA AGCATCATGC CCGGATACGG TGGTTCGTGT CTGGTTGCGC
AACAACGATC CCGCCGATGC CGATCGGTGG GATCGGTTCG TGCACGATCG GTCATGGTAT
CCATTCGAGA TTTCGATCTC GCATTCTCCG CAGAACGTGG GCTTCGGGCG CGCGCACAAT
GCGACGTTCG AAATGGCCGA CGCTCCGTTT TTCTTTGTCT TGAATCCGGA TACGCGACTG
CATTCGACTG CCGTCTCGGC ATTGCGGAAG GCGATAGACA CGTCTGCCGG CGACGTGGGC
GCCTGGGAGT TGCGGCAACT GCCGTACGAG CATCCGAAAC TGTACGATCC CGTTTCGCTG
AGCACGGACT GGGTGACAGG TGCCGCCGTC GTGTTCCGGC GTGCCGCGTT TGCGCAGGTG
CGCGGTTTCG AGCCGCGTAT CTTCATGTAT GGGGAAGATG TCGACCTATC GTGGCGAATG
CGCGCCGCGG GCTGGGTATT GCATTATGTG CCGCATGCTG TCGTCGTACA CCCCACGTAT
TCGAAACCGA TGGAGGCCAA ACCGCTCCAG ATCGCAGGCG GCGTGGTCGC ATCGTTGCAA
TTGCGTACGA GGTTCGGGTC CTGGCTCGAT ATCGCTCGCG GTCTAGGTTG CTGGGTTGCC
GAACTCGCGA GGCCGGCGCG CTTCCCGCAT GCGCGGCGCA CGCATCTGAT GGCACTGGCT
CGATATCTTC GGAGCGCAGC CTACTTCAGG CGCACGGGGG CACGGTATCG CAAAGGCGGT
TTTCGCCCCG GTTTTCGTTT TTGGGGATAT GGTGACCGAC GCGACGGCGC ATTTTTTGCC
TTCGCGGTCG AGGAACTCGA CGCGCGCACG GTGCCGCTTG TCTCGATCAT CGTGCGAACG
CATCGCCGAC CGGCATTGTT GCGGGAGGCG CTGATGTCGC TGTCGCATCA AACCTATCCG
CGTGTCGAAG TCATTGTCGT CGAGGACGGC GAGCCGAATA GCCGCGCGAT GATCGAACGC
GAATTTGCAG GGCGCCTTGA TATCCGCTAT GAGGCGACGG GCATGCCGGT AGGCAGGAGT
GCCGCCGGGA ATCTGGGCCT TTCGCTCGCC GCCGGGGAAT GGCTGGGATT TCTTGACGAC
GACGACCAGT TCTACGCAGA CCACGTCGAG GCAATGATGC AGGTCGCGCG AAGCGGTACG
AACCGGGCAG TCTATGGTGC GTCGCACGAG ATTCCGACCG AATTTGCACA ATTGACGGAC
GAGGCCGCGA CATATCGCGA AGAGCCCGCG TCGCTCAAGT ATCGGCCATA TTCCCGCCTG
GCGATGTGGC AGGAGAACCT TGCGCCGATT CAAGCAGTCC TCTTCCATCG AAGCTTGTAT
GACGAATTGG GCGGATTTGA CGAGGACCTC GATCAACTCG AAGACTGGGT GCTGTGGGTG
CGCTATTCGT GTGCGACTGA CTTCTCTTCG TTCCTGCGGG TGACATCACG CTACCGGGTG
CCCATGGCGG CCAAGGTTGC CGTTGAGCGT CAGGCCAAGC TGCATGAAGC CTATGCCGTC
GCCCTGGAGC GACAGCGAGC GATGCGAGTG ACGCTTAGCC CGTTCGACGT TGTCGCCATG
GCGGAAGAGC AGGCCCGTCG GCATGCTATC GTCCACGTTT CGAGGCAAAC CGCGCGAAAG
CTGATCGTGC GAGTGCCGTT CATGCGAACG TTGTTATCGA GCCAGGCGGG ATGGCGACGG
CGCATGAGAG CGCTATATCG TCGAATGTCG CCGCGCTCCT GA
 
Protein sequence
MRFDISVVVY RETEETLDGL LDSLAAQASC PDTVVRVWLR NNDPADADRW DRFVHDRSWY 
PFEISISHSP QNVGFGRAHN ATFEMADAPF FFVLNPDTRL HSTAVSALRK AIDTSAGDVG
AWELRQLPYE HPKLYDPVSL STDWVTGAAV VFRRAAFAQV RGFEPRIFMY GEDVDLSWRM
RAAGWVLHYV PHAVVVHPTY SKPMEAKPLQ IAGGVVASLQ LRTRFGSWLD IARGLGCWVA
ELARPARFPH ARRTHLMALA RYLRSAAYFR RTGARYRKGG FRPGFRFWGY GDRRDGAFFA
FAVEELDART VPLVSIIVRT HRRPALLREA LMSLSHQTYP RVEVIVVEDG EPNSRAMIER
EFAGRLDIRY EATGMPVGRS AAGNLGLSLA AGEWLGFLDD DDQFYADHVE AMMQVARSGT
NRAVYGASHE IPTEFAQLTD EAATYREEPA SLKYRPYSRL AMWQENLAPI QAVLFHRSLY
DELGGFDEDL DQLEDWVLWV RYSCATDFSS FLRVTSRYRV PMAAKVAVER QAKLHEAYAV
ALERQRAMRV TLSPFDVVAM AEEQARRHAI VHVSRQTARK LIVRVPFMRT LLSSQAGWRR
RMRALYRRMS PRS