Gene BURPS1106A_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_3131 
Symbol 
ID4903147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3051434 
End bp3053275 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content61% 
IMG OID640136357 
Productglycosyl transferase, group 2 family protein 
Protein accessionYP_001067369 
Protein GI126454769 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGTTTG ATATATCCGT CGTGGTGTAT CGAGAGACGG AGGAAACCCT CGACGATCTT 
CTCGACAGCC TCGCCGCTCA AGCATCATGC CCGGATACGG TGGTTCGTGT CTGGTTGCGC
AACAACGATC CCGCCGATGC CGATCGGTGG GATCGGTTCG TGCACGATCG GTCATGGTAT
CCATTCGAAA TTTCGATCTC GCATTCTCCG CAGAACGTGG GCTTCGGGCG CGCGCACAAT
GCGACGTTCG AAATGGCCGA CGCTCCGTTT TTCTTTGTCT TGAATCCGGA TACGCGACTG
CATTCGACTG CCGTCTCGGC ATTGCGGAAG GCGATAGACA CGTCTGCCGG CGACGTGGGC
GCCTGGGAGT TGCGGCAACT GCCGTACGAG CATCCGAAAC TGTACGATCC CGTTTCGCTG
AGCACGGACT GGGTGACAGG TGCCGCCGTC GTGTTCCGGC GTGCCGCGTT TGCGCAGGTG
CGCGGTTTCG AGCCGCGTAT CTTCATGTAT GGGGAAGATG TCGACCTATC GTGGCGAATG
CGCGCCGCGG GCTGGGTATT GCATTATGTG CCGCATGCTG TCGTCGTACA CCCCACGTAT
TCGAAACCGA TGGAGGCCAA ACCGCTCCAG ATCGCAGGCG GCGTGGTCGC ATCGTTGCAA
TTGCGTACGA GGTTCGGGTC CTGGCTCGAT ATCGCTCGCG GTCTAGGTTG CTGGGTTGCC
GAACTCGCGA GGCCGGCGCG CTTCCCGCAT GCGCGGCGCA CGCATCTGAT GGCACTGGCT
CGATATCTTC GGAGCGCAGC CTACTTCAGG CGCACGGGGG CACGGTATCG CAAAGGCGGT
TTTCGCCCCG GTTTTCGTTT TTGGGGATAT GGTGACCGAC GCGACGGCGC ATTTTTTGCC
TTCGCGGTCG AGGAACTCGA CGCGCGCACG GTGCCGCTTG TCTCGATCAT CGTGCGAACG
CATCGCCGGC CGGCATTGTT GCGGGAGGCG CTGATGTCAC TGTCGCATCA AACCTATCCG
CGTGTCGAAG TCATTGTCGT CGAGGACGGC GAGCCGAATA GCCGCGCGAT GATCGAGCGC
GAATTTGCAG GGCGCCTTGA TATCCGCTAT GAGGCGACGG GCATGCCGGT AGGCAGGAGT
GCCGCCGGGA ATCTGGGGCT TTCGCTCGCC GCCGGGGAAT GGCTGGGATT TCTTGACGAC
GACGACCAGT TCTACGCAGA CCACGTCGAG GCAATGATGC AGGTCGCGCG AAGCGGTACG
AACCGGGCAG TCTATGGTGC GTCGCACGAG ATTCCGACCG AATTTGCACA ATTGACGGAA
GAGGCCGCGA CATATCGCGA AGAGCCCGCG TCGCTCAAGT ATCGACCATA TTCCCGCCTG
GCGATGTGGC AGGAGAACCT TGCGCCGATT CAAGCAGTCC TCTTCCATCG AAGCTTGTAT
GACGAATTGG GCGGATTTGA CGAGGACCTC GATCAACTCG AAGACTGGGT GCTGTGGGTG
CGCTATTCGT GTGCGACTGA CTTCTCTTCG TTCCTGCGGG TGACATCACG CTACCGGGTG
CCCATGGCGG CCAAGGTTGC CGTTGAGCGT CAGGCCAAGC TGCATGAGGC CTATGCCGTC
GCCCTGGAGC GACAGCGAGC GATGCGAGTG ACGCTTAGCC CGTTCGACGT TGTCGCCATG
GCGGAAGAGC AGGCCCGTCG GCATGCTATC GTCCACGTTT CGAGGCAAAC CGCGCGAAAG
CTGATCGTGC GAGTGCCGTT CATGCGAACG TTGTTATCGA GCCAGGCGGG ATGGCGACGG
CGCATGAGAG CGCTATATCG TCGAATGTCG CCGCGCTCCT GA
 
Protein sequence
MRFDISVVVY RETEETLDDL LDSLAAQASC PDTVVRVWLR NNDPADADRW DRFVHDRSWY 
PFEISISHSP QNVGFGRAHN ATFEMADAPF FFVLNPDTRL HSTAVSALRK AIDTSAGDVG
AWELRQLPYE HPKLYDPVSL STDWVTGAAV VFRRAAFAQV RGFEPRIFMY GEDVDLSWRM
RAAGWVLHYV PHAVVVHPTY SKPMEAKPLQ IAGGVVASLQ LRTRFGSWLD IARGLGCWVA
ELARPARFPH ARRTHLMALA RYLRSAAYFR RTGARYRKGG FRPGFRFWGY GDRRDGAFFA
FAVEELDART VPLVSIIVRT HRRPALLREA LMSLSHQTYP RVEVIVVEDG EPNSRAMIER
EFAGRLDIRY EATGMPVGRS AAGNLGLSLA AGEWLGFLDD DDQFYADHVE AMMQVARSGT
NRAVYGASHE IPTEFAQLTE EAATYREEPA SLKYRPYSRL AMWQENLAPI QAVLFHRSLY
DELGGFDEDL DQLEDWVLWV RYSCATDFSS FLRVTSRYRV PMAAKVAVER QAKLHEAYAV
ALERQRAMRV TLSPFDVVAM AEEQARRHAI VHVSRQTARK LIVRVPFMRT LLSSQAGWRR
RMRALYRRMS PRS