Gene BURPS1106A_1564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1564 
SymboltreZ 
ID4899423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1514220 
End bp1516124 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content74% 
IMG OID640134794 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_001065835 
Protein GI126452473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0166062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATT CGACCGCCAC GCATGCGTAT GCGCTGCCCT TCGGCGCGAC GTGCGTCGAC 
GCGCGGCACA CGCGCTTTCA CCTCTGGGCG CCCGCCTGCC GCAGCGCCGC CGTCGAACGG
CAAGACGGCG AGACCGTCGC GATGACGCCC GACGGCGGCG GCGGCTTCGA CGCGATCGTG
CGATGCGGGC CCGGCACGCT GTACCGCTAC CTGCTCGACG GCACGCTGCG CGTGCCCGAT
CCCGCGTCGC GCTTCCAGCC GCACGGCGTG CACGGCCCGA GCGAGGTCGT CGATCCGGCC
GCCTACCGCT GGCGCCACGA CGACTGGCGC GGACGGCCGT GGCGCGAAAC CGTGCTGTAC
GAGGTGCACG TCGGCGCGTT CGGCGGCTAT GCGCGGATCG CGCGCCGGCT CCCGGCGCTC
GCCGCGCTCG GCATCACCGC GATCGAGCTG ATGCCCGTCA ACGCGTTCCC CGGCGCGCGC
AACTGGGGCT ACGACGGCGT GCTGCCGTTC GCGCCCGACG CATCCTACGG GCCGCCCGAC
GCGCTGAAGT CGCTCGTCGA CGCCGCGCAC GGGCTCGGGC TTCAGGTGCT GCTCGACGTC
GTCTACAACC ACTTCGGCCC CGACGGCAAC CTGCTGCCCC GCTACGCGCC CGCGTTCTTC
CGCCGCGACC GCGATACCGC ATGGGGGCCG GCCATCGATT TCTCGTGCCC GCAGGCGGGC
GCGTTCTTTC TCGAGAACGC GCTGTACTGG CTCGACGAAT ACCGGTTCGA CGGACTGCGG
ATCGACGCCG CGCACGCGAT CGACGACGAC GCGTGGCTCG CCGGCCTCGC CCGCCGCGTG
CGCGCCTACG CGGGCGACGC GCGCCACGTG CATCTCGTGC TCGAAAACGA ACGCAACACC
GCGAGCCTGC TCGCCGGCGG CCGCTTCGAC GCGCAATGGA ACGACGATTT CCACAACAGC
ATGCACGTGC TGCTGACGGG CGAGCAGGCG GGCTACTACC GCGCGTATGC CGACGCGCCG
ATCCGCCACC TCGCGCGCGT GCTCGGCGAA GGTTTCGCGT ATCAGGGCGA GCCGTCGCCG
CTGCACGGCG GCGCACCGCG CGGCGAACCG AGCGCGGACC TGCCGCCGAG CGCGTTCGTC
GCGTTCCTGC AGAACCACGA TCAGATCGGC AACCGCGCAT TCGGCGAGCG GCTGCGCGCG
CTCGTGAACG ACGACGCGCT GCGCGCGGCG AGCGCGCTCG CGCTGCTTGC GCCGCAGATT
CCGCTGCTGT TCATGGGCGA GGAATGCGGC ACGACGCAAC CGTTCCAGTT CTTCACCGAT
CATCGCGGCG CGCTCGCCGC AGCGGTGCGC GAAGGCCGCC GCCGCGAATT CGCCGCCTTC
CCCGCGTTCG CCGATCCCGC CCATCGCGAC GCGATTCCCG ACCCGAACGA CCCGGCGACG
TTCGCCCGCT CGTCGCTCGC CGCGCCGGGC GCCGAGCCGC CCGACGCGAA CGCGTGGCGG
CGCTTCTACC GCGGCGCGCT CGCCGTGCGC GCGCGCTTCG TCACGCCGTG GCTCGACGGC
GCGCGCGCGC TCGGTGCGAC GGTGCTCGCG CGCGCGGACG GCGGCCACGC GAACGCGCTC
GTCGCGCGCT GGCGCCTCGG CGACGGCAAC ACGCTCGCGA TCGCGTTGAA TCTGGACGCC
CGGCCGGCCG CGCTCGCCGC GCCGCCCGAC GGCAAGATCG TGTTCGAGAC TCCGCCGCGC
GCACGCGACG CGCTCGCCGA CGCACGGCTC GCCGCGCACG CGTGCATTGC CTGGCGCAGC
GGGAACGTGA ACGGCGTCGC GCGGCGCGGC CGCGCCGTCG ACGCGAAGCA CATGCCGGCC
GTGAAGCACG CGAACGGCGT GAACGGCCAG GACGGCGCGC CATGA
 
Protein sequence
MNDSTATHAY ALPFGATCVD ARHTRFHLWA PACRSAAVER QDGETVAMTP DGGGGFDAIV 
RCGPGTLYRY LLDGTLRVPD PASRFQPHGV HGPSEVVDPA AYRWRHDDWR GRPWRETVLY
EVHVGAFGGY ARIARRLPAL AALGITAIEL MPVNAFPGAR NWGYDGVLPF APDASYGPPD
ALKSLVDAAH GLGLQVLLDV VYNHFGPDGN LLPRYAPAFF RRDRDTAWGP AIDFSCPQAG
AFFLENALYW LDEYRFDGLR IDAAHAIDDD AWLAGLARRV RAYAGDARHV HLVLENERNT
ASLLAGGRFD AQWNDDFHNS MHVLLTGEQA GYYRAYADAP IRHLARVLGE GFAYQGEPSP
LHGGAPRGEP SADLPPSAFV AFLQNHDQIG NRAFGERLRA LVNDDALRAA SALALLAPQI
PLLFMGEECG TTQPFQFFTD HRGALAAAVR EGRRREFAAF PAFADPAHRD AIPDPNDPAT
FARSSLAAPG AEPPDANAWR RFYRGALAVR ARFVTPWLDG ARALGATVLA RADGGHANAL
VARWRLGDGN TLAIALNLDA RPAALAAPPD GKIVFETPPR ARDALADARL AAHACIAWRS
GNVNGVARRG RAVDAKHMPA VKHANGVNGQ DGAP