Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_3486 |
Symbol | |
ID | 4902669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | - |
Start bp | 3391532 |
End bp | 3392527 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640136712 |
Product | carbohydrate ABC transporter periplasmic sugar-binding protein |
Protein accession | YP_001067723 |
Protein GI | 126455234 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCA GAACGTTCAT CACGCTGGCG GCAGCGGCGA CGGTCGCGGC GGCGGGCCTG CCCGCGCAGG CGGCCGAGCC CGTGAAGATC GGCTTCCTCG TCAAGCAGCC CGAGGAGCCG TGGTTCCAGG ACGAATGGAA ATTCGCCGAG CTCGCCGCGA AGGACAAGGG CTTCACGCTC GTGAAGATCG GCGCGCCGTC CGGCGAGAAG GTGATGAGCG CGATCGACAA TCTCGCCGCG CAGAAGGCGC AGGGCTTCAT CATCTGCACG CCGGACGTGA AGCTCGGGCC GGGCATCGTC GCGAAGGCGA AGTCGCACGG CCTGAAGATG ATGACGGTCG ATGACCGGCT CGTCGACGGC GCGGGCAAGC CGATCGAATC GGTTCCGCAC ATGGGCATTT CCGCGTACGA CATCGGCAAG CAGGTCGGCG GCGGGATCGC GGCCGAGATC AAGAGGCGCG GCTGGAACAT GAACGAAGTC GGCGCGATCG ACATCACGTA CGAGCAGTTG CCGACCGCGC ACGACCGCAC GACGGGCGCG ACCGACGCGC TCGTCGCCGC AGGCTTTCCG AAGGCGAACG TGATTGCCGC GCCGCAGGCG AAGACCGACA CCGAGAACGC GTTCAACGCG GCGAACATCG CGCTCACGAA GAATCCGAAG TTCAAGCACT GGGTCGCCTA CGGCCTGAAC GACGAAGCGG TGCTCGGCGC GGTGCGCGCG GCCGAAGGGC GCGGCTTCAA GGCGGCCGAC ATGATCGGCA TCGGCATCGG CGGCTCGGAC TCGGCGCTCA GCGAGTTCAA GAAGCCGCAG CCGACCGGCT TCTTCGGCAC CGTGATCATC AGCCCGAAGC GGCACGGCGA AGAGACTTCC GAGCTGATGT ACGCGTGGAT CACGCAAGGC AAGGCGCCGC CGCCGCTCAC GCTGACGACG GGCATGCTCG CGACGCGCGA GAACGTCGCG CAGGTGCGCG AGACGATGGG GCTCGCGGCG AAGTGA
|
Protein sequence | MKRRTFITLA AAATVAAAGL PAQAAEPVKI GFLVKQPEEP WFQDEWKFAE LAAKDKGFTL VKIGAPSGEK VMSAIDNLAA QKAQGFIICT PDVKLGPGIV AKAKSHGLKM MTVDDRLVDG AGKPIESVPH MGISAYDIGK QVGGGIAAEI KRRGWNMNEV GAIDITYEQL PTAHDRTTGA TDALVAAGFP KANVIAAPQA KTDTENAFNA ANIALTKNPK FKHWVAYGLN DEAVLGAVRA AEGRGFKAAD MIGIGIGGSD SALSEFKKPQ PTGFFGTVII SPKRHGEETS ELMYAWITQG KAPPPLTLTT GMLATRENVA QVRETMGLAA K
|
| |