Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1541 |
Symbol | treZ |
ID | 4881924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1503960 |
End bp | 1505864 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640127469 |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_001058582 |
Protein GI | 126438378 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0296] 1,4-alpha-glucan branching enzyme |
TIGRFAM ID | [TIGR02402] malto-oligosyltrehalose trehalohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.465785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATT CGACCGCCAC GCATGCGTAT GCGCTGCCCT TCGGCGCGAC GTGCGTCGAC GCGCGGCACA CGCGCTTTCA CCTCTGGGCG CCCGCCTGCC GCAGCGCCGC CGTCGAACGG CAAGACGGCG AGACCGTCGC GATGACGCCC GACGGCGGCG GCGGCTTCGA CGCGATCGTG CGATGCGGGC CCGGCACGCT GTACCGCTAC CTGCTCGACG GCACGCTGCG CGTGCCCGAT CCCGCGTCGC GCTTCCAGCC GCACGGCGTG CACGGCCCGA GCGAGGTCGT CGATCCGGCC GCCTACCGCT GGCGCCACGA CGACTGGCGC GGACGGCCGT GGCGCGAGAC CGTGCTGTAC GAGGTGCACG TCGGCGCGTT CGGCGGCTAT GCGCGGATCG CGCGCCGGCT CCCGGCGCTC GCCGCGCTCG GCATCACCGC GATCGAGCTG ATGCCCGTCA ACGCGTTCCC CGGCGCGCGC AACTGGGGCT ACGACGGCGT GCTGCCGTTC GCGCCCGACG CATCCTACGG GCCGCCCGAC GCGCTGAAGT CGCTCGTCGA CGCCGCGCAC GGGCTCGGGC TTCAGGTGCT GCTCGACGTC GTCTACAACC ACTTCGGCCC CGACGGCAAC CTGCTGCCCC GCTACGCGCC CGCGTTCTTC CGCCGCGACC GCGATACCGC ATGGGGGCCG GCCATCGATT TCTCGTGCCC GCAGGCGGGC GCGTTCTTTC TCGAGAACGC GCTGTACTGG CTCGACGAAT ACCGGTTCGA CGGACTGCGG ATCGACGCCG CGCACGCGAT CGGCGACGAC GCGTGGCTCG CCGGCCTCGC CCGCCGCGTG CGCGCCTACG CGGGCGACGC GCGCCACGTG CATCTCGTGC TCGAAAACGA ACGCAACACC GCGAGCCTGC TCGCCGGCGG CCGCTTCGAC GCGCAATGGA ACGACGATTT CCACAACAGC ATGCACGTGC TGCTGACGGG CGAGCAGGCG GGCTACTACC GCGCGTATGC CGACGCGCCG ATCCGCCACC TCGCGCGCGT GCTCGGCGAA GGTTTCGCGT ATCAGGGCGA GCCGTCGCCG CTGCACGGCG GCGCACCGCG CGGCGAACCG AGCGCGGACC TGCCGCCGAG CGCGTTCGTC GCGTTCCTGC AGAACCACGA TCAGATCGGC AACCGCGCAT TCGGCGAGCG GCTGCGCGCG CTCGTGAACG ACGACGCGCT GCGCGCGGCG AGCGCGCTCG CGCTGCTTGC GCCGCAGATT CCGCTGCTGT TCATGGGCGA GGAATGCGGC ACGACGCAAC CGTTCCAGTT CTTCACCGAT CATCGCGGCG CGCTCGCCGC AGCGGTGCGC GAAGGCCGCC GCCGCGAATT CGCCGCCTTC CCCGCGTTCG CCGATCCCGC CCATCGCGAC GCGATTCCCG ACCCGAACGA CCCGGCGACG TTCGCCCGCT CGTCGCTCGC CGCGCCGGGC GCCGAGCCGC CCGACGCGAA CGCGTGGCGG CGCTTCTACC GCGGCGCGCT CGCCGTGCGC GCGCGCTTCG TCACGCCGTG GCTCGACGGC GCGCGCGCGC TCGGCGCGAC GGTGCTCGCG CGCGCGGACG GCGGCCACGC GAACGCGCTC GTCGCGCGCT GGCGCCTCGG CGACGGCAAC ACGCTCGCGA TCGCGTTGAA TCTGGACGCC CGGCCGGCCG CGCTCGCCGC GCCGCCCGAC GGCAAGATCG TGTTCGAGAC CCCGCCGCGC GCACGCGACG CGCTCGCCGA CGCACGGCTC GCCGCGCACG CGTGCATTGC CTGGCGCAGC GGGAACGTGA ACGGCGTCGC GCGGCGCGGC CGCGCCGTCG ACGCGAAGCA CATGCCGGCC GTGAAGCACG CGAACGGCGT GAACGGCCAG GACGGCGCGC CATGA
|
Protein sequence | MNDSTATHAY ALPFGATCVD ARHTRFHLWA PACRSAAVER QDGETVAMTP DGGGGFDAIV RCGPGTLYRY LLDGTLRVPD PASRFQPHGV HGPSEVVDPA AYRWRHDDWR GRPWRETVLY EVHVGAFGGY ARIARRLPAL AALGITAIEL MPVNAFPGAR NWGYDGVLPF APDASYGPPD ALKSLVDAAH GLGLQVLLDV VYNHFGPDGN LLPRYAPAFF RRDRDTAWGP AIDFSCPQAG AFFLENALYW LDEYRFDGLR IDAAHAIGDD AWLAGLARRV RAYAGDARHV HLVLENERNT ASLLAGGRFD AQWNDDFHNS MHVLLTGEQA GYYRAYADAP IRHLARVLGE GFAYQGEPSP LHGGAPRGEP SADLPPSAFV AFLQNHDQIG NRAFGERLRA LVNDDALRAA SALALLAPQI PLLFMGEECG TTQPFQFFTD HRGALAAAVR EGRRREFAAF PAFADPAHRD AIPDPNDPAT FARSSLAAPG AEPPDANAWR RFYRGALAVR ARFVTPWLDG ARALGATVLA RADGGHANAL VARWRLGDGN TLAIALNLDA RPAALAAPPD GKIVFETPPR ARDALADARL AAHACIAWRS GNVNGVARRG RAVDAKHMPA VKHANGVNGQ DGAP
|
| |