Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_1540 |
Symbol | malQ |
ID | 4882424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 1501735 |
End bp | 1503963 |
Gene Length | 2229 bp |
Protein Length | 742 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 640127468 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_001058581 |
Protein GI | 126441295 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCG ATCCGCGTCA TCCGTCGATC GTCGAGCTCG CGGACGCGGC CGGGCTCGAC GCGCACTGGA TCGACGCATC GGGCTTCGCG CGGCGGGTCG GCGACGACAT GCTCGCGGCG TTGCTCGACG CGCTCGGCTA CCCGTGCGAC ACGCGCGCGG CGCGCGCCGA GAGCGCCGCG CGGCTCGCGG CGCACGCGCA GGCCGCGCCG CGGCTCGTGA CGGGCGACGT CGACGCGCCG CTCACGCTGC CCGCCTCGCT CGGACGCCCC GGCGCGTGCT ACCGGGTCAC CCTCGAAAAC GGCGACGCCG TCGCCGGCCG CTTCGCGCAG CCCGCCGGCG CAGCCGTGCT GCCGCCGCTC GCGACGCCCG GCTACCACGT GCTCGACACC GGCGAGCAGC GCGCAACGCT CGCCATCGCG CCGCCATCCG CATGGACGCC CGCCGACGCG ATGCGCGCCG CGGCGGCCGG CGGCCGCGAA CGCCCGCCGC CGTGGGGCCT CGCGGCGCAA CTCTACGGGC TGCGCCGCGA GGCCGACGGC GGCATCGGCG ATTTCACCGC GCTTGCCGCC TGCGCGCGCG CCGCCGCCCG GCGCGGCGCG CACGCGCTCG CGATCAGCCC GACGCAGGCG GCCTTCCCCG CGCTGCCCGA GCGCGACAGC CCGTACTCGC CTTCGTCGCG GCTTTGGCGC AACGCCGCCT ACATCGACGT CGAAACGGTG CTCGGCGCGC ACGCGGCGCG CGCGGCGATC GCCGATGCGG GGCTCGCCGC GCAATGGAGC GCGCTCACGC GCGCGCCGCT CGTCGATTGG CCCGGCGCGG TGCCGGCGAA GCTGCGCGTG CTGCGCCTGC TGTTCGACCG CTGGCGCGCG CAGGCGCCCA CGGGCGCGGA TGCGGGCCCG CGGGCGTTCG CGCGCTTTCG GGCACGGCAC GCCGGCGCGC TCGACGCACA TGCGACGTTC GACGCGTTGC AGGCGTGCTG CATCGACAAC GGCATCGGCG CGGACTGGCG GCGTTGGCCG CCCGCATGGC GCACGCCCGA TGCGCCGGAC GTCGCCGCGT TCGCGCGCGC GCACGCGGAC GACATCGCGT TCCACGCCTT CCTGCAATGG CAGGCCGCGC GCGGGCTCGC GGCCGCGCAA CGCGCCGCGC GCGGCCGCGG CATGGCGATC GGCCTGATCG CGGACTTGCC GGTCGGCTGC GACGCGGCGG GCAGCGACGC GTGGCGGGAT GGCGATGCGA TGCTGCGCGG CCTGTCGATC GGCGCGCCCG CCGATCCGTT CAACGCGCGC GGCCAGGCAT GGGGCGTCAC CACGTGGACG CCGACGGCGC TGCGCGCGCG GGGCTTCGCG CCGTTCGTCG AATGTCTGCG CGCGGGCTTC GCGCACGCGG GAGGCGTTCG CATCGATCAC GTGCTCGGCC TCGCGCGGCT GTGGGTGGTG CGCGACGGCG CGCCGCCGCG CGACGGCGCG TACCTGCGCT ATCCGCGCGG CGATTTGCTG CGGCTCGCCG CGCTCGAATC GTTCCGGCAT CGCGCGATCG TCATCGGCGA GGATCTCGGC ACCGTGCCGG CGGATTTCCG CGCGCGGATC GCCGCGCGCG GCATCGTCGG CCTGCGCGCG CTCTGGTTCG AACGGGACCC GGCGGGCGCG TTCCGCGCGC CGGGCGATTG GGATCGTCAC GCGGCCGCGA CGAGCTCGAC GCACGATCTG CCGACGGTCG CGGGCTGGTG GCGCGGCGTC GATCTCGGCT GGCGATGGCG CGCGGCCGCC TCCGCCTCGG CCTGCGCGCC CGCGTCCCCC CTCGCTCCCG CCTCCGACGC CGAGGCCGAG GCCGAGGCGA ACGCCCCGCC CGCGCGGCCC GGCGAATCGG AGGTCGCCGG GCCGGATGCG CTGCCGCCCG AGGTGCGCGA CATGCGCCGC GCCGAGCGCG CGGCGCTCTG GCGCGCGCTG CAGCAAGCCG GCGTCGCCGC GCGCGGGCAA AAGATGCCGC CGCGGGATGC GCCCCCCGTC GGTGCGATAC TCGCGTACGT CGCGCAAGCG CCCGCGCCGC TCGCGATCTT TCCGCTCGAG GATCTGCTCG CGCTCGAGGG TCAGCCGAAC GTGCCGGGCC CGCCGTGCGG GCACCCGAAC TGGCGGCGGC GCATGCCGCG CTCCGTCGAC GCGCTGTTCG ACGCGCCGGC GCGCACGCGC ATCGCCGCCG TGCGGCGCGC GAGGAAGCGC GCGCGATGA
|
Protein sequence | MTADPRHPSI VELADAAGLD AHWIDASGFA RRVGDDMLAA LLDALGYPCD TRAARAESAA RLAAHAQAAP RLVTGDVDAP LTLPASLGRP GACYRVTLEN GDAVAGRFAQ PAGAAVLPPL ATPGYHVLDT GEQRATLAIA PPSAWTPADA MRAAAAGGRE RPPPWGLAAQ LYGLRREADG GIGDFTALAA CARAAARRGA HALAISPTQA AFPALPERDS PYSPSSRLWR NAAYIDVETV LGAHAARAAI ADAGLAAQWS ALTRAPLVDW PGAVPAKLRV LRLLFDRWRA QAPTGADAGP RAFARFRARH AGALDAHATF DALQACCIDN GIGADWRRWP PAWRTPDAPD VAAFARAHAD DIAFHAFLQW QAARGLAAAQ RAARGRGMAI GLIADLPVGC DAAGSDAWRD GDAMLRGLSI GAPADPFNAR GQAWGVTTWT PTALRARGFA PFVECLRAGF AHAGGVRIDH VLGLARLWVV RDGAPPRDGA YLRYPRGDLL RLAALESFRH RAIVIGEDLG TVPADFRARI AARGIVGLRA LWFERDPAGA FRAPGDWDRH AAATSSTHDL PTVAGWWRGV DLGWRWRAAA SASACAPASP LAPASDAEAE AEANAPPARP GESEVAGPDA LPPEVRDMRR AERAALWRAL QQAGVAARGQ KMPPRDAPPV GAILAYVAQA PAPLAIFPLE DLLALEGQPN VPGPPCGHPN WRRRMPRSVD ALFDAPARTR IAAVRRARKR AR
|
| |