Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_1728 |
Symbol | |
ID | 4899653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 1686197 |
End bp | 1687297 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640134958 |
Product | polysaccharide deacetylase family protein |
Protein accession | YP_001065999 |
Protein GI | 126451454 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.169494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACCA ACATGTCCGA CGCGGCCGGC GCACCGGCCA CTTCATTCGC GCCCGAGCGG CGCGCGTTCC TGGCCCGTGC GGGCGGCGGG CTCGGCCTCG CGGCGGCCGG GCTCGCGCTC GGCGCGGCAG CGGCGCCGGG CCGGGCGCTC GCGGCCGGCG CCACGGCCAC CGCCGATACC GGCGCGGCGT CACCCGCCGG CGGCTCGCCG CGGCGCTCCC CCGCCGACGA ACCCGAGACG GCGCACGGCG CGTTCTGGCC GAACGGCGCG CGGCTCGTGA TCTCGATCTC GATGCAGTTC GAGGCGGGCG GCCAGCCGCC GACGGGCGCC GACGGCCCGT TCCCGCCCGT CGTCTTTCCG CCGCAGGTGC CCGTCGATCT CGCGTCCGCG ACGTGGTTCG CCTACGGCTA TCGCGAAGGC ATCCCGCGCA TGCTCGATTT GTGGGACCGG CACGGCGTGA AGGTCACCTC GCACATGATC GGCGAGGCCG TGCACCGCCG GCCGGATCTC GCCCGAGAGA TCGTCGCGCG CGGCCACGAG GCGGCCGGAC ACGGGCCGCG CTGGAGCGCG CAGTACGCAC TCCCCCGCGA CGAAGAGCGG CGCTTCCTGA TCGCCGCCCG CGAGATGGTC GAAACCGCGA CGGGCGCGCG GCCCGTCGGC TACAACTGCA ACTGGCTCAG ACGCGGGCCG AACACACTGC CGCTGCTGCA GGAGCTCGGC TATCTGTACC ACATCGACGA CGTGAGCCGC GACGAGCCGT TCATCGAGCA GGTGAACGGC CAGGATTTCG TCGTCGTGCC CTACACGCTG CGCAACAACG ACATCCTGCT GATCGAAGGT CGCAACTATT CGCCCGGGCA ATTCCTCGAG CAGATCAAGC TCGACTTCGA TCAGTTGTAC GACGAAGCCG CCACGCGGCG GCGCATGATG TCGATCAGCG CGCACGACCG GATCAGCGGC ACGCCGCAGA TGGTGCGCGC GTGGGACGCG TTCCTGCGCT ACGCGCAATC GCATCCGGGC GTCGCGTTCA TGCGCAAGGA CGACATCGCC CGCCATGCAC GGCGTAGCCC GCTCACGCTG CGCGAACCCG AAACCCTCTG A
|
Protein sequence | MTTNMSDAAG APATSFAPER RAFLARAGGG LGLAAAGLAL GAAAAPGRAL AAGATATADT GAASPAGGSP RRSPADEPET AHGAFWPNGA RLVISISMQF EAGGQPPTGA DGPFPPVVFP PQVPVDLASA TWFAYGYREG IPRMLDLWDR HGVKVTSHMI GEAVHRRPDL AREIVARGHE AAGHGPRWSA QYALPRDEER RFLIAAREMV ETATGARPVG YNCNWLRRGP NTLPLLQELG YLYHIDDVSR DEPFIEQVNG QDFVVVPYTL RNNDILLIEG RNYSPGQFLE QIKLDFDQLY DEAATRRRMM SISAHDRISG TPQMVRAWDA FLRYAQSHPG VAFMRKDDIA RHARRSPLTL REPETL
|
| |