Gene BURPS668_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1541 
SymboltreZ 
ID4881924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1503960 
End bp1505864 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content74% 
IMG OID640127469 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_001058582 
Protein GI126438378 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.465785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATT CGACCGCCAC GCATGCGTAT GCGCTGCCCT TCGGCGCGAC GTGCGTCGAC 
GCGCGGCACA CGCGCTTTCA CCTCTGGGCG CCCGCCTGCC GCAGCGCCGC CGTCGAACGG
CAAGACGGCG AGACCGTCGC GATGACGCCC GACGGCGGCG GCGGCTTCGA CGCGATCGTG
CGATGCGGGC CCGGCACGCT GTACCGCTAC CTGCTCGACG GCACGCTGCG CGTGCCCGAT
CCCGCGTCGC GCTTCCAGCC GCACGGCGTG CACGGCCCGA GCGAGGTCGT CGATCCGGCC
GCCTACCGCT GGCGCCACGA CGACTGGCGC GGACGGCCGT GGCGCGAGAC CGTGCTGTAC
GAGGTGCACG TCGGCGCGTT CGGCGGCTAT GCGCGGATCG CGCGCCGGCT CCCGGCGCTC
GCCGCGCTCG GCATCACCGC GATCGAGCTG ATGCCCGTCA ACGCGTTCCC CGGCGCGCGC
AACTGGGGCT ACGACGGCGT GCTGCCGTTC GCGCCCGACG CATCCTACGG GCCGCCCGAC
GCGCTGAAGT CGCTCGTCGA CGCCGCGCAC GGGCTCGGGC TTCAGGTGCT GCTCGACGTC
GTCTACAACC ACTTCGGCCC CGACGGCAAC CTGCTGCCCC GCTACGCGCC CGCGTTCTTC
CGCCGCGACC GCGATACCGC ATGGGGGCCG GCCATCGATT TCTCGTGCCC GCAGGCGGGC
GCGTTCTTTC TCGAGAACGC GCTGTACTGG CTCGACGAAT ACCGGTTCGA CGGACTGCGG
ATCGACGCCG CGCACGCGAT CGGCGACGAC GCGTGGCTCG CCGGCCTCGC CCGCCGCGTG
CGCGCCTACG CGGGCGACGC GCGCCACGTG CATCTCGTGC TCGAAAACGA ACGCAACACC
GCGAGCCTGC TCGCCGGCGG CCGCTTCGAC GCGCAATGGA ACGACGATTT CCACAACAGC
ATGCACGTGC TGCTGACGGG CGAGCAGGCG GGCTACTACC GCGCGTATGC CGACGCGCCG
ATCCGCCACC TCGCGCGCGT GCTCGGCGAA GGTTTCGCGT ATCAGGGCGA GCCGTCGCCG
CTGCACGGCG GCGCACCGCG CGGCGAACCG AGCGCGGACC TGCCGCCGAG CGCGTTCGTC
GCGTTCCTGC AGAACCACGA TCAGATCGGC AACCGCGCAT TCGGCGAGCG GCTGCGCGCG
CTCGTGAACG ACGACGCGCT GCGCGCGGCG AGCGCGCTCG CGCTGCTTGC GCCGCAGATT
CCGCTGCTGT TCATGGGCGA GGAATGCGGC ACGACGCAAC CGTTCCAGTT CTTCACCGAT
CATCGCGGCG CGCTCGCCGC AGCGGTGCGC GAAGGCCGCC GCCGCGAATT CGCCGCCTTC
CCCGCGTTCG CCGATCCCGC CCATCGCGAC GCGATTCCCG ACCCGAACGA CCCGGCGACG
TTCGCCCGCT CGTCGCTCGC CGCGCCGGGC GCCGAGCCGC CCGACGCGAA CGCGTGGCGG
CGCTTCTACC GCGGCGCGCT CGCCGTGCGC GCGCGCTTCG TCACGCCGTG GCTCGACGGC
GCGCGCGCGC TCGGCGCGAC GGTGCTCGCG CGCGCGGACG GCGGCCACGC GAACGCGCTC
GTCGCGCGCT GGCGCCTCGG CGACGGCAAC ACGCTCGCGA TCGCGTTGAA TCTGGACGCC
CGGCCGGCCG CGCTCGCCGC GCCGCCCGAC GGCAAGATCG TGTTCGAGAC CCCGCCGCGC
GCACGCGACG CGCTCGCCGA CGCACGGCTC GCCGCGCACG CGTGCATTGC CTGGCGCAGC
GGGAACGTGA ACGGCGTCGC GCGGCGCGGC CGCGCCGTCG ACGCGAAGCA CATGCCGGCC
GTGAAGCACG CGAACGGCGT GAACGGCCAG GACGGCGCGC CATGA
 
Protein sequence
MNDSTATHAY ALPFGATCVD ARHTRFHLWA PACRSAAVER QDGETVAMTP DGGGGFDAIV 
RCGPGTLYRY LLDGTLRVPD PASRFQPHGV HGPSEVVDPA AYRWRHDDWR GRPWRETVLY
EVHVGAFGGY ARIARRLPAL AALGITAIEL MPVNAFPGAR NWGYDGVLPF APDASYGPPD
ALKSLVDAAH GLGLQVLLDV VYNHFGPDGN LLPRYAPAFF RRDRDTAWGP AIDFSCPQAG
AFFLENALYW LDEYRFDGLR IDAAHAIGDD AWLAGLARRV RAYAGDARHV HLVLENERNT
ASLLAGGRFD AQWNDDFHNS MHVLLTGEQA GYYRAYADAP IRHLARVLGE GFAYQGEPSP
LHGGAPRGEP SADLPPSAFV AFLQNHDQIG NRAFGERLRA LVNDDALRAA SALALLAPQI
PLLFMGEECG TTQPFQFFTD HRGALAAAVR EGRRREFAAF PAFADPAHRD AIPDPNDPAT
FARSSLAAPG AEPPDANAWR RFYRGALAVR ARFVTPWLDG ARALGATVLA RADGGHANAL
VARWRLGDGN TLAIALNLDA RPAALAAPPD GKIVFETPPR ARDALADARL AAHACIAWRS
GNVNGVARRG RAVDAKHMPA VKHANGVNGQ DGAP