Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0540 |
Symbol | |
ID | 4885556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 511967 |
End bp | 514468 |
Gene Length | 2502 bp |
Protein Length | 833 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640126468 |
Product | glycosy hydrolase family protein |
Protein accession | YP_001057593 |
Protein GI | 126440590 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.152646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGAA TTTCGCATTC CCTGTGCGCC GCGCTATTGG CCGCCGCGAC GCTGTTGCCC ACGGCCTCGC GCGCGCAACT GCCCGCGCGC CCCACGGCCG GCGCGGCCGC GCCCGCGACG GCCGCGCCCG TGCGGCCCGC GTCCACGCCG GCCGAGCTCG CCGCGCGGCT CGCCGACGGC CTCGCGGTGC GCGTGGCCGT CGACAACAAT CACGCGGCAT CGGCCGGCGT GCCGTGCGCC GATCTCGGCG CGGATTGGGC GAGCTGCGCG ACGGGCCGCC TGATCCTGCA GAATCGCGGC CACTCGCCCC TCACCGACGG CGGCTGGAAG CTCTATCTGC ACAGCATCCG CCGGCTGCTC CGAATCGACC GCCCCGGCTT CACGCTGCGC CATCTGACGG GCGATCTGTA CGAGCTGACG CCGCAGCCCG GCACGGTAAG GCTCGCGCAG GGCGAGCGCA TCGAGCTGCC GTTCGTCGCC GAATACTGGC TGCGCCGCTA CAGCGACGTG ATCCCGCGCC CGTACGTGGT CGTCGACGGC GCGGCGCCCG CGGTGCTGCG CTACGACGAT ACCGACGACG AGCTGCGCTA CGTCGAAACG CTGCCCGCCG ACGCGCAGAA CAATTCGCCC GGCAATGCGC CGCCCGCCGC CGCGCAGCCG GTGGCGAACC GCGCGCTGCC GAGCGTGAAG CGGCAGCGCG CGCTGCCCGG CGCGCTCGAT CTGCGCGGCG TCGAGCTGAC GCTGCCGGAG CTGCCGTCCG CGCAGGTCGC GGCGCTGCGC GAACGCGCGG GCACGCTCGG CCTGGACGGC GCGCGCGTGC CGGTGTGGGG CGTCGTCGCG CCGCGCCGGC TGCCCGCCGA CATCGCGGTG CCGGGCGGCT ACCGGCTCGC GATCGGCCCG CGCGGCGCGT TCATCGAGGG GGCCGATCGC GCGGGCCTCT ACTACGGCGT GCAGACGCTC TTCTCGCTCG TGCCGGCCGG CGGCGCGACG GTGCCCGCGA TGCTGATCGA AGACGCGCCG CGCTTCACGC ACCGCGGGAT GCACGTCGAT CTCGCGCGCA ACTTCAAGCC GCCCGCCACG CTGCGCCGGC TGATCGACCA GATGAGCGCG TACAAGCTCA ACCGGCTGCA TCTGCACCTG TCCGACGACG AGGGCTGGCG CATCGAGATT CCCGGCCTGC CCGAGCTGAC CGACGTCGGC GCGCGCCGCT GCCACGACCC GAGCGAGACG CGCTGCCTGC TGCCGCAGCT CGGCTCGGGG CCCGACGATC GTTCGGGCGG CGGCTACCTG ACGCGCGACG ACTACGTCGC GCTGTTGCGC TACGCGGCCG AGCGCTTCGT CGAAGTGATC CCCGAGATCG ACATGCCCGC GCACTCGCGC GCGGCCGTCG TATCGATGGA GGCGCGCTAT CGCCGCCTGC ACGCGGCGGG CCGCGAGCGG GAAGCGAACG CGTATCGGCT GCTCGATGCG CAGGACACGT CGAACCTGCT GACCGTGCAG TTCTACGACC GGCGCAGCGA TCTGAACCCG TGCATGCCGG GCGCGCTGAA CTTCGCGTCG AAGGTGCTCC GCGAGATCGC GTCGATGCAC GCGGACGCGC AAGCGCCGCT GCGGATCTGG CACTTCGGCG GCGACGAGGC GAAGAACATC CTGCTCGGCG CGGGCTTCCA GCCGCTCGAC GGCGCCGATC CCGGCAAGGG CCGCGTCGAT CTCGCCGCGC AGGACAAGCC GTGGGCGCGC TCGCCCGCCT GTACGGCGCT GCTTCGGCGC GGCGAGATCA AATCGATCGA TGAATTGCCG ACGCGCTTCG CGAAGCAGGT CAGCGCGATC GTGAACGCGA ACGGAATCGG CACGATGGCC GCGTGGCAGG ACGGCATCAA GCACGCGAGC GGGCCGCGGG AGTTCAGCAC GCGGCACGTG ATGGTGTCGC TGTGGGACAC CATCTTCTGG GGCGCGTCCG ACAGCGCGCG CGATCTGAGC GCGAAGGGCT ACCGGACCGT GCTCGCGCTG CCCGATTACC TGTACTTCGA TTTCCCGTAC ACGCGCAATC CGCGCGAGCG CGGCTATTAC TGGGGCTCGC AGGCGACGGA CGAGTACAAG GTGTTCTCGC TCGCGCCGGA GAACCTGCCG CAGAACGCCG AGGTGTTCGG CGATCGCGAC GGCAACCCGT TCGAGGTGAC GAGTGCGGGC GCGGCGCCGA GCATCGAGGG CATCCAGGGG CAGGCGTGGG GCGAGGTGAT GCGCAACGGG CAACTGCTCG AATACATGGT GTATCCGCGC CTTCTCGCGC TCGCCGAGCG CGCGTGGCAC AAGGCTGACT GGGAACTGCC CTACGCGGCC GGCGTGCGCT ACAAGCTCGG CGACACGCAT CACGTCGACA CGGCCGCGCT CGAGCGCGAC TGGGCGGGCT TCGCGACGGT GCTCAAGCAG CGCGAACTGC CGAAGCTCGA GCGTGCGGGC ATCGGGTATC GCAAGCCGAC GTTTACGCTG ACGGGCGAAT GA
|
Protein sequence | MNRISHSLCA ALLAAATLLP TASRAQLPAR PTAGAAAPAT AAPVRPASTP AELAARLADG LAVRVAVDNN HAASAGVPCA DLGADWASCA TGRLILQNRG HSPLTDGGWK LYLHSIRRLL RIDRPGFTLR HLTGDLYELT PQPGTVRLAQ GERIELPFVA EYWLRRYSDV IPRPYVVVDG AAPAVLRYDD TDDELRYVET LPADAQNNSP GNAPPAAAQP VANRALPSVK RQRALPGALD LRGVELTLPE LPSAQVAALR ERAGTLGLDG ARVPVWGVVA PRRLPADIAV PGGYRLAIGP RGAFIEGADR AGLYYGVQTL FSLVPAGGAT VPAMLIEDAP RFTHRGMHVD LARNFKPPAT LRRLIDQMSA YKLNRLHLHL SDDEGWRIEI PGLPELTDVG ARRCHDPSET RCLLPQLGSG PDDRSGGGYL TRDDYVALLR YAAERFVEVI PEIDMPAHSR AAVVSMEARY RRLHAAGRER EANAYRLLDA QDTSNLLTVQ FYDRRSDLNP CMPGALNFAS KVLREIASMH ADAQAPLRIW HFGGDEAKNI LLGAGFQPLD GADPGKGRVD LAAQDKPWAR SPACTALLRR GEIKSIDELP TRFAKQVSAI VNANGIGTMA AWQDGIKHAS GPREFSTRHV MVSLWDTIFW GASDSARDLS AKGYRTVLAL PDYLYFDFPY TRNPRERGYY WGSQATDEYK VFSLAPENLP QNAEVFGDRD GNPFEVTSAG AAPSIEGIQG QAWGEVMRNG QLLEYMVYPR LLALAERAWH KADWELPYAA GVRYKLGDTH HVDTAALERD WAGFATVLKQ RELPKLERAG IGYRKPTFTL TGE
|
| |