Gene BURPS1106A_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1568 
Symbol 
ID4903235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1523917 
End bp1527264 
Gene Length3348 bp 
Protein Length1115 aa 
Translation table11 
GC content74% 
IMG OID640134798 
Productglycosy hydrolase family protein 
Protein accessionYP_001065839 
Protein GI126454507 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCATGCGT GGCCGCCGCT GTTCGAGCGC GTCGCCGCAT GGGGCTTCGA TCATGTGCTG 
ATCGGCGGCC TCTGGGCGGC GAGCCGCCGC GGCTATCCGC GCCACGTCGC GGACCCCGAT
CGCCCCGCCG AATCGTTCGC GACGTCGCTC GACGCGACGA GCGCGCTCGC CCGCCTGTCG
GACAGCGCGC GCGAGCACGG GCTGCGCATC GCCGTCGAAG TCGTCGTCGA CCGCGTCGCG
CGCGAGCACC CGCTGCACGA CGCGCACCGC GACTGGTACG TCGTCGACGA ACGCGACGAC
GCGCTGATCG ATCCGCGCAC CGCGGCGCTC GCGCACGACG TCGCGCATGC GAACGTCGGC
AGCGCGGCCG CGCTCGACGC GCTCGCCGAC TGGTGGCGCG CCCGGCTCGG CGCGCTGGCC
GACGCGGGCG CGGCGGCCTT TCTCGTCGAC GCGCCGCAGC GGATGCCGGC GCACTGGTGG
GCCGCGCTGC TCCGCGCGCT GCGGCAGGCC CGCGCGGATC TGCCGGTGAT CGCCGGCGTG
CCGGGCCGGG AGCGCGAGGC GCTCGCGCAG CTCGCCTGCG CCGGCTTCGA CGCGGCGTTC
TCGTCGCTGC GCTGGTGGGA CCTGCGCGCG CCGTGGTTCG TCGAAGAACA CCGGCTGCTG
CGGCGCGTCG GCGCGCCGAT CGCGTTCCCG GACGCGTTCG ACGGCCCGCG GCTCGCGCAC
GATTGGCGGC AGGCCGCGCC CGAGACGATC GAGCGCGCGC ATCGCCGCGC GCTGTGGACC
GCCGCCGCGC TCGGCACCGG CTGGCTCGTG CCGATGGGCT TCGAGCGCGG CGTGGCCGTC
GAACTGATGG CGCGCGAGCC GCGAGCCGAC GCGTACCGCG CCGCGCTCGA CAGCGCGCCG
TTCGACTTGT CGGGCGCGAT CGCCGAAGCG AACGCGCTGC GCCGCGCGGC GCCCGCGCTG
CGCGGCAACG GCGAGATCGC GCAGCTGACG GGTGCCGATG CGCCGGCGAC CGTGCTGCTG
CGCGGCGCGC GCACCGCGCT CGAATACGAC GACGAGGCCG CGCTGATCGC GGTCAATCCG
GACCTCGCGC ACCCCGCGGC GATCGTGCCG TGCGCGGCGC TCGCCGGCGT GCCGGGCGGC
TTCACGCGCT TCGCGCCGTT CGCCGACGGC CGCCGCCCGC GTATGGGCGC GCTCGAACCG
TTCGCGCTCG CCGCCGGTGC CTGCACGTTG CTGCGCGCGC AGCGCGCGCG CCCGGTCACG
ACGGCCCCCG CCGAGGATCG GCGCGGCAAT CGGCCCGGCA CCCGTGCGTC GGTCACGGCC
GCGCTCGCCG GCGAGCGCAT CGCGATCGAG CGCATCGAGC CCGTCGTCGA CGATGGCCGC
TTCGCCGTCA AGCGCGTGAT CGGCGAGCGG CTCGCCGTGC GCGCGGCGAT TTTCGCCGAC
GGTCACGCAC GCCTCGCGGC GGCCGTCCAA TGGCGCGCCG CGGACGAGAA CGGCTGGCAC
GAGGCCCGAT GCGCCGCCGA AGGCAACGAT GCGTGGCGAG CGGACATTCC GCTCGAACGG
CTCGGCCGGC ATCTGTTCCG CGTGATCGCG TGGCGCGACG ATTGGGCGAC GCTCGTCGAC
GAGATCGGCA AGAAGCACGC GGCGGGTCAG GCGGTGGCGC TCGAGCTGGA AGAAGCGCGG
CGACTCGCCG CCGACGTGCT CGCGCGCGCG CCGGAGGCGA ACCCCGCCGC GCTCGCCGTG
CTGCGCGAAT TCGCGGCGGC CCTCGACGCC GCGCCGCCCG ACCAGCGGCT CGCGCTGATC
GGCGCGCCGC ACGTCGCCGA CGCGTTCGCG GCGCTGCGCG AGCGAGCGTT CGCCACGCGC
GACGCGCCCG TCTTCCCGGT CGACGTCGAG CGGCGCGCGG CCCGCTTCGC CAGCTGGTAC
GAGATGTTTC CCCGCTCGGC GAGCGACGAT GTCCGCCGCC ACGGCACGTT CGACGACGTC
GTCGCGCATC TGCCGCGCAT CCGCGACATG GGCTTCGACG TGCTGTACTT CCCGCCGATC
CATCCGATCG GCACGACCGC GCGCAAGGGC CGCAACAACA GCCTGCAGGC CGCGCCCGAC
GACGTCGGCA GCCCGTATGC GATCGGCTCG CCGGCGGGCG GCCACACCGC CGTCCATCCG
CAGCTCGGCT CGCTCGATGC GTTCCGCCGG CTCGTCGCTG CGGCGCGCGC GCACGATCTC
GAGATCGCGC TCGACTTCGC GGTTCAATGC TCGCCGGACC ATCCGTGGCT CACCGAGCAT
CCCGGCTGGT TCGCATGGCG GCCGGACGGC TCGCTGCGCT ACGCGGAAAA TCCGCCGAAG
CGCTATCAGG ACATCGTGAA TCCCGACTTC TACGCGCGCG ACGCGATGCC CGCGCTGTGG
ATCGCGCTGC GCGACGTCGT GCTGTTCTGG ATCGACGCGG GCGTGCGCAT CTTCCGCGTC
GACAATCCGC ACACGAAGCC GCTGCCGTTC TGGGCATGGA TGATCGCCGA CGTGCGCGCG
CGACACCCGG ACACGGTGTT CCTGTCCGAG GCGTTCACGC GGCCGAGCAT GATGTACCGG
CTCGCGAAGC TCGGCTTCTC GCAGTCGTAT ACCTACTTCA CGTGGCGCGA GTCGAAGCGC
GAGTTCATCG ATTATCTGAC CGAGCTCGCC GACGGGCCGG CGCGCGAATA CTTCCGGCCG
AACTTCTTCG TCAACACGCC GGACATCAAT CCGCGCCACC TGCAGCAGGC GCCGCGCACG
CAGTTCGTGA TCCGCGCGGC GCTCGCCGCG ACGCTCTCGG GCCTCTGGGG AATGTATTCG
GGCTTCGAGC TGTGCGAGTC CGACGCGCTG CCCGACAGCG AGGAATATCG CGACGCGGAG
AAATACGAGC TGCGCGCGCG CGACTGGCGG CGGCCCGGCC ACATCGGCGA CGAAATCGCG
CGGCTCAACC GCGCGCGGCG CGACAACCCG GCGCTGCAGA CGCATCTCGG CATCCGCTTC
GCGCACGCGC CGAACGACGC GGTGCTGGTG TTCTCGAAGG CGACGCCCGC GCACGACAAC
GTCGTCGTCG TCGCGATCAG CCTCGATCCG TGGCATCCGC AGGCCACCGA TTTCACGCTC
GACGCGGCGC TGTACCGCGG CTGGGGCATC GCCGACGGCG AGCGGCTCGT CGCCGTCGAT
CAGACGGCCG ACCACGTCGA AACCTGGCAC GGGCGCCGGC ATTACGTCGC GCTCGACCCG
CACGTGCGCC CGTTCGCGAT CTGGCGCGTC GCGCCCGCGG CGGGCGTCGC GCGCGGCGCT
CGCGACGACG CGCGCGACGT CCCCGCACAG GAGGTGCACG AACGATGA
 
Protein sequence
MHAWPPLFER VAAWGFDHVL IGGLWAASRR GYPRHVADPD RPAESFATSL DATSALARLS 
DSAREHGLRI AVEVVVDRVA REHPLHDAHR DWYVVDERDD ALIDPRTAAL AHDVAHANVG
SAAALDALAD WWRARLGALA DAGAAAFLVD APQRMPAHWW AALLRALRQA RADLPVIAGV
PGREREALAQ LACAGFDAAF SSLRWWDLRA PWFVEEHRLL RRVGAPIAFP DAFDGPRLAH
DWRQAAPETI ERAHRRALWT AAALGTGWLV PMGFERGVAV ELMAREPRAD AYRAALDSAP
FDLSGAIAEA NALRRAAPAL RGNGEIAQLT GADAPATVLL RGARTALEYD DEAALIAVNP
DLAHPAAIVP CAALAGVPGG FTRFAPFADG RRPRMGALEP FALAAGACTL LRAQRARPVT
TAPAEDRRGN RPGTRASVTA ALAGERIAIE RIEPVVDDGR FAVKRVIGER LAVRAAIFAD
GHARLAAAVQ WRAADENGWH EARCAAEGND AWRADIPLER LGRHLFRVIA WRDDWATLVD
EIGKKHAAGQ AVALELEEAR RLAADVLARA PEANPAALAV LREFAAALDA APPDQRLALI
GAPHVADAFA ALRERAFATR DAPVFPVDVE RRAARFASWY EMFPRSASDD VRRHGTFDDV
VAHLPRIRDM GFDVLYFPPI HPIGTTARKG RNNSLQAAPD DVGSPYAIGS PAGGHTAVHP
QLGSLDAFRR LVAAARAHDL EIALDFAVQC SPDHPWLTEH PGWFAWRPDG SLRYAENPPK
RYQDIVNPDF YARDAMPALW IALRDVVLFW IDAGVRIFRV DNPHTKPLPF WAWMIADVRA
RHPDTVFLSE AFTRPSMMYR LAKLGFSQSY TYFTWRESKR EFIDYLTELA DGPAREYFRP
NFFVNTPDIN PRHLQQAPRT QFVIRAALAA TLSGLWGMYS GFELCESDAL PDSEEYRDAE
KYELRARDWR RPGHIGDEIA RLNRARRDNP ALQTHLGIRF AHAPNDAVLV FSKATPAHDN
VVVVAISLDP WHPQATDFTL DAALYRGWGI ADGERLVAVD QTADHVETWH GRRHYVALDP
HVRPFAIWRV APAAGVARGA RDDARDVPAQ EVHER