Gene BURPS1710b_A0913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A0913 
Symbol 
ID3692196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp1171896 
End bp1175096 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content69% 
IMG OID637731167 
Productputative glycosyltransferase 
Protein accessionYP_336071 
Protein GI76819588 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.193043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGGTG AATACGCAAG CGAAACATCC TTATGCCGAC CTCGCGGTGA AGACCGACGA 
GGAAGACGTC GTCCTGGGCC AGATGATCCA GGTGATTCTC GACGATATCT GGCTGCTCCT
CGGCATCGCG TTGGTCGTGG TCGCGCTCGC CGGGCTCTAC TGCTACGTCG CGAAGCCGCT
CTATTCGGCC GATGCGCAGG TGCGGGTGGA GGCGAGCGAC AACACGTCGC AGGCGCTTAC
GCAGACGCAG ACGGGCGCGA TGATCAACAG CGGGCCGCCG ACGCCGCCCA CCGATGCGGA
AATCGAGATC ATCAAGAGCC GCGGCGTCGT CGCGCCGGTC GTCGAGCAGT TCAAGCTGAA
CGCGTCGGTC ACGCCGAACA CGTTGCCGAT TCTCGGCGCG ATCGCCGCGC GGCTCGCGAC
GCCGGGCCAT CCGGGCAAAC CGTGGCTCGG CTTGTCGTCG TACGCGTGGG GCGGCGAGGA
GGCGAGCATC GATTCGATCG ACGTGACGCC CGTGCTCGAA GGCAAGCAGC TCACGCTCAC
GGCCGGCGCG GACGGCGGCT ACGCGCTCGC CGATCCGGAC GGCGCGGTGC TCGTGCGCGG
CAAGGTCGGC GAGCGCGAGC AGGGCGGCGG CGTGACGATC AACGTCTCGA AGCTCGTCGC
GCGCCCCGGC ACGCGCTTCA CGGTGGTCCG GCAGAACGAT CTCGATGCGA TCACCGCGTT
CCAGTCGGCG ATCCAGGTGG CCGAGCAGGG CAAGCAGACC GGCGTGATCC AGATCTCGCT
CGAAGGCAAG GACCCCGAAC AGACCGCGCA GATCGCGAAC GCGCTCGCGC AGTCGTATCT
GCATCAGCAC GTGACGAGCA AGCAGGCCGA AGCGACGAAG ATGCTCGAGT TCCTGAAGAA
CGAAGAGCCG CGCCTGAAAT CGGACCTCGA GCGCGCGGAG GCGGAGCTCA CCCAGTATCA
GCGCACGTCG GGCTCGATCA ACGCGAGCGA CGAAGCGAAG GTCTACCTCG AAGGCAGCGT
CCAGTACGAG CAGCAGGTCG CCGCGCAGCG GCTGCAGCTC GCGGCGCTCG CGCAGCGCTA
CACGGACGAG CATCCGCTCG TCGTCGCGGC GAAGCAGCAG CTCGGCCAGC TCGAGGCGGA
GCGCGCGAAG TACGACGGCA AGTTCCGCGG GCTGCCGGCG ACCGAAGTCA AGGCTGTCGC
GTTGCAGCGC AACGCGAAGG TTGCGGAAGA CATCTACGTG CTGCTGCTCA ACCGTGTGCA
GGAGCTGTCG GTGCAGAAGG CCGGCACGGG CGGCAACATC CGCCTCGTCG ATGCGGCGCT
GCGCCCGGGC GTGCCGGTCA AGCCGAAGAA GGTGCTGATC CTGTCGGCGG CGACGCTGCT
CGGCCTGATC CTCGGCACGA GCGTCGTGTT CCTGCGCCGC AACCTGTTCC ATGGCATCGA
GGATCCGGAT CGCGTCGAGC GCGCGTTCAA CCTGCCGCTG TACGGCCTCG TGCCGATGAG
CGCGGAGCAG GCGCGATTCG ATGCCGCCGA CAAGGGCAAT CGCGTGCGGC CGATTCTCGC
GTGCGCGCGG CCGAAGGATC TGAGCGTCGA AAGCCTGCGC AGCCTGCGCA CCGCGATGCA
GTTCGCGCTG ATGGATGCGA AGAACCGCGT GATCGTGCTG ACCGGACCGA CCCCCGGCAT
CGGCAAGAGC TTTCTCGCGG TCAACCTCGC CGCGCTCGTC GCGCATTCGG GCAAGCGCGT
GCTGCTGATC GACGCGGACA TGCGGCGCGG CTCGCTCGAT CGCCACTTCG GCACCGGGGG
AAGGCGCGGC CTGTCGGAAT TGCTGAGCGA TCAGGTCGCG CTCGAAGAGG CGATTCGCGA
AACGTCGGTG CCGGGGCTGT CGTTCATCCC GAGCGGCGCG CGCCCGCCGA ATCCGTCGGA
GCTGCTGATG TCGCCGCGCC TGTCGCAATA CCTCGACGGC CTCGCGAAGC GCTACGACAT
GGTGATCGTC GATTCGCCGC CGATCCTCGC CGTCACCGAC GCGACGATCT TCGGCGAACT
CGCCGGCTCG ACGTTCCTCG TGCTGCGCTC CGGCATGCAC ACCGAAGGCG AGATCGGCGA
CGCGATCAAG CGGCTGCGCA CCGCGGGCGT GCAACTGCAA GGCGGGATCT TCAACGGCGT
GCCGGCGCGC ACGCGCGGCT ACGGCCGCGG CTATGCGGCC GTGCACGAAT ATCTGAGCGC
ATGACGCGCG CTTGTTCGAT CGACGAGGTG ACGATGAAAA TTTCCGTACT CGTGCCGACC
TTCCGGCGTC CCGCGGATCT CGCCCGCTGT CTGATCGCGC TGCAGCGGCA GCGCGTCGCG
CCCGACGAGG TGATCGTCGT CGCGCGTCCC GACGACGACG TGACGCACGA GCGGCTTGCC
GATCCCGCCG TGCGCGGCGC GCTCGCGCTG CGGATCGTGA CGGTCGACGT GCCGGGACAG
GTGGCGGCGC TCAACCGCGG GCTCGACGCC GCGCGCGGCG ACGTGATCGC GATCACCGAC
GACGACGCCG CGCCGCGCGT CGACTGGATC GAGCGGATCG GCGCGATCTT CGCCGCCGAT
CCGCGCGTCG GCGCGGTGGG CGGCCGCGAC TGGGTGCACG AGAAGGGCCG GCTGCTCGAC
GGCGAGCAGC CGCTCGTCGG CAGGCTCACG GTGTCGGGCA AGATCGTCGG CAATCACCAT
CTCGGCGTCG GCGGCGCGCG CGAGGTCGAT ACCCTCAAGG GGGCGAACAT GAGCTATCGC
CGCACGGCGA TCGAGGGGCT GCGCTTCGAT ACGCGCCTGC GCGGCGCGGG CGCGCAGACG
CACAACGATA CGTCGTTCAG CATGTGCGTG AAGCGCGCGG GCTGGAAGCT CGTCTACGAC
CCGGCCGTCG CCGTCGACCA TTACCCGGCC GAACGCTTCG ACGACGATCG CCGCGACGCC
GCGTCGATGG CCGCGCTGTC GAACGCCGCC TACAACCTGC ATCTGACGTT GCGCGAGCAT
TTGTCGCCCG TGCGGCGCGA GATCGCGTGG TGGTGGTGGA CGTTCGTCGG CACCCGTGCG
TATCCGGGGC TCGTGCACGT CGTGTTGAGC GCCGCCGCGA AGAACGGTGC GAGCATGCGC
GCGCGCTGGC GCGCGGTGCG GCGGGGCGCG CGCGACGCGC GGCGTGTGAG CATCGCGCCT
CACGCCGGCA TCGCGCAGTG A
 
Protein sequence
MHGEYASETS LCRPRGEDRR GRRRPGPDDP GDSRRYLAAP RHRVGRGRAR RALLLRREAA 
LFGRCAGAGG GERQHVAGAY ADADGRDDQQ RAADAAHRCG NRDHQEPRRR RAGRRAVQAE
RVGHAEHVAD SRRDRRAARD AGPSGQTVAR LVVVRVGRRG GEHRFDRRDA RARRQAAHAH
GRRGRRLRAR RSGRRGARAR QGRRARAGRR RDDQRLEARR APRHALHGGP AERSRCDHRV
PVGDPGGRAG QADRRDPDLA RRQGPRTDRA DRERARAVVS ASARDEQAGR SDEDARVPEE
RRAAPEIGPR ARGGGAHPVS AHVGLDQRER RSEGLPRRQR PVRAAGRRAA AAARGARAAL
HGRASARRRG EAAARPARGG AREVRRQVPR AAGDRSQGCR VAAQREGCGR HLRAAAQPCA
GAVGAEGRHG RQHPPRRCGA APGRAGQAEE GADPVGGDAA RPDPRHERRV PAPQPVPWHR
GSGSRRARVQ PAAVRPRADE RGAGAIRCRR QGQSRAADSR VRAAEGSERR KPAQPAHRDA
VRADGCEEPR DRADRTDPRH RQELSRGQPR RARRAFGQAR AADRRGHAAR LARSPLRHRG
KARPVGIAER SGRARRGDSR NVGAGAVVHP ERRAPAESVG AADVAAPVAI PRRPREALRH
GDRRFAADPR RHRRDDLRRT RRLDVPRAAL RHAHRRRDRR RDQAAAHRGR ATARRDLQRR
AGAHARLRPR LCGRARISER MTRACSIDEV TMKISVLVPT FRRPADLARC LIALQRQRVA
PDEVIVVARP DDDVTHERLA DPAVRGALAL RIVTVDVPGQ VAALNRGLDA ARGDVIAITD
DDAAPRVDWI ERIGAIFAAD PRVGAVGGRD WVHEKGRLLD GEQPLVGRLT VSGKIVGNHH
LGVGGAREVD TLKGANMSYR RTAIEGLRFD TRLRGAGAQT HNDTSFSMCV KRAGWKLVYD
PAVAVDHYPA ERFDDDRRDA ASMAALSNAA YNLHLTLREH LSPVRREIAW WWWTFVGTRA
YPGLVHVVLS AAAKNGASMR ARWRAVRRGA RDARRVSIAP HAGIAQ