Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2145 |
Symbol | |
ID | 4903901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 2102657 |
End bp | 2104219 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 640145250 |
Product | endo-1,4-D-glucanase |
Protein accession | YP_001076178 |
Protein GI | 126455862 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.246354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGGC AAGCGGGGCG AGACGAAGGC GTCGGGCGCA TCGGGCGCAT CGGGCGCATC GGGCGCATCG GGCGCATCGG GCGTGAAGCG GGCGACGCGC GCGAGGCGGC CGGGGCGGGC GAGGCGTGCG GGAGGCGCCA CGCGCGGGCC GACGCGTGGG CGCGGCGGGC GTCGCGAGCA TCGCGCGGGT GGCGCACGTT TTGGCGCGGG GGCGGGCGTG GGCGCGTGCG CGCCGTCGTG GCGAGCGTCG TGGCGAGCTT CTCGGTGATG GCGTTCGCGG CGGCGACGTT GCCGGTGTCG TGGCGCGTCG CCGCGGCGGC GGAGCGTACG CGAAGCGGCG GCGAACGCGC GGGCGGGCTG CGCGACACGG CGGGCTTGAT CGAGATCTCG GCGGCGGCGC CGGCCTCGAC GCCGATCCCG GCGGCGCCGC GGCGGTTCGC GCAGCCGTTC GCGCAGCCGG CTCGCGCGTT CGCCGTCGCG AGCGCCTGCG CGCCGTCCTG GCCGCGCTGG GACCGTTTCA AGCGTGACTT CGTATCGGCC GACGGCCGCG TGATCGACGT CGGCTCGGCC GACGAGCGGA CCGTATCCGA GGGGCAGGCG TACGGCCTTT TCTTCGCGCT CGTCGCGAAC GACCGCGCGG CGTTCGACGC GCTGCTGCGC TGGACCGAGG ACAATCTCGC GCAGGGCGAT CTGAGCGCGC GTCTGCCCGC GTGGCTGTGG GGCCGCGCGG CCGACGGCGC GTGGCGCGTG CTCGATGCGA ACGCCGCGTC CGACGCCGAT CTGTGGCTTG CGTACGCGCT GCTCGAAGCG GGGCGCTTGT GGCGCGAGCG CAGCTACACG GCGCGCGGCG CGTTGCTCGC GAAGCGCGTG CTCGACGAGG AGACCGCGAC GCTGCCGGGG CTCGGTCTCG TGCTGCTGCC GGGCCCGACG GGTTTTCGGC CGGCGCGCGA CGCGTGGCGG CTGAATCCGA GCTATTCGCC GCCGCAGGCG ATTCGCGGGA TCGGCGCGCA TGTGCCCGAC GACGCGCGCT GGGCGCGGCT CGCGGCGGGC GTCGGCCGCG TGCTGACCGA CAGCGCGCCG CGCGGCTTCG CGCCGGACTG GGCGCTGTAT CGCGCGGGCC GCGGCTTCGA GCCGGACGCC GAAACGCATG CGGCGAGCGC GTACAACGCG ATTCGCGTCT ATCTGTGGGC GGGCATGCTC GATGCGGGCG ACCCGTTGGC GCGGCCGCTC GTCGCGCATT TCGCGCCGTT CGCCGAGCAT GTCGCCGCGC ATGGCGCGCC GCCGGAGGCG GTCGATGCGA CGACGGGCGC GGCCGCCCCG CGCGACGGCA ATGCCGGGTT TTCCGCGGCG GCCGTGCCGT TTCTCGAGGC GCGCGGCGAG CGGGCGAGCG CCGACGCGCA GCTCGCGCGC GTCGCGCGGC TCGAGCGCGA GACGGCGAGC GGCTATTACG CGAACGTGCT GACGCTGTTC GGGCTCGGCT GGCGCGACGG GCGCTACCGG TTCGCGGCCG ACGGCACGCT GCGGGTGCGA TGGAGCGAGC CGTGCTCGAC GCCCGCGCGT TGA
|
Protein sequence | MARQAGRDEG VGRIGRIGRI GRIGRIGREA GDAREAAGAG EACGRRHARA DAWARRASRA SRGWRTFWRG GGRGRVRAVV ASVVASFSVM AFAAATLPVS WRVAAAAERT RSGGERAGGL RDTAGLIEIS AAAPASTPIP AAPRRFAQPF AQPARAFAVA SACAPSWPRW DRFKRDFVSA DGRVIDVGSA DERTVSEGQA YGLFFALVAN DRAAFDALLR WTEDNLAQGD LSARLPAWLW GRAADGAWRV LDANAASDAD LWLAYALLEA GRLWRERSYT ARGALLAKRV LDEETATLPG LGLVLLPGPT GFRPARDAWR LNPSYSPPQA IRGIGAHVPD DARWARLAAG VGRVLTDSAP RGFAPDWALY RAGRGFEPDA ETHAASAYNA IRVYLWAGML DAGDPLARPL VAHFAPFAEH VAAHGAPPEA VDATTGAAAP RDGNAGFSAA AVPFLEARGE RASADAQLAR VARLERETAS GYYANVLTLF GLGWRDGRYR FAADGTLRVR WSEPCSTPAR
|
| |