Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_0371 |
Symbol | |
ID | 4881995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 340058 |
End bp | 341071 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640126298 |
Product | sulfate/thiosulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001057423 |
Protein GI | 126440512 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.597645 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCACA ACAAGGGAAT CGGCCGTCGG CTCGCGGCCG GCGTCGCCGC GGCGCTGCTC GCGACGGCCG CGCATGCGGA CACAGCGCTG TTGAACGTGT CCTACGACGT CACGCGCGAA CTGTACAAGG ACATCAACGC GAGCTTCGTC GCCGCATACA AGCAGAAGAC GGGCGAGACG GTATCGGTGC GGCAGTCGCA CGGCGCGTCG AGCGCGCAGG CGCTGTCGGT GCTGCAGGGG TTGCAGGCCG ACGTCGTGAC GATGAACCAG CCGAACGACA TCGATCTGCT CGCCGAGCGC GGCCAGTTGC TGCCGAAGAA CTGGCGCACG CGTCTGCCGA ACGGCAGCTC GCCGTACTCG ACGACGATGG TGTTCCTCGT GCGCCACGGC AACCCGAAGC AGATCAAGGA CTGGAGCGAT CTCGCGAAAC CCGGCGTGCA GGTGATCATC GCGAATCCGA AGACCTCGGG CAACGGCCGC TACGCGTATC TCGCCGCGTG GGGCTACCAG AAGCTCAAGG GCGCGACCGA TCAGCAGGCG CTCGATTTCG AGAAGGCGAT CTTCCGCAAC GTGCCGGTGC TCGATTCCGG CGGCCGGGGC GCGACGACGA CGTTCACGCA GCGCGGCATC GGCGACGTGC TCGTCACGTT CGAAAACGAG GTCGCGCTGA TCGATTCGGG CGCGGCGGGC GCGAGCTTCG ATGCGGTCTA TCCGTCTGCG AGCATTCTCG CGGAGCCGCC CGTGGCGATC GTCGACAAGG TCGTCGACAA GAAAGGCACG CGCCGCGCCG CGCAGGCGTA CCTCGACTAT CTGTATTCGC CCGCCGCGCA GGAGATCGTC GCGCAGCATC ATCTGCGTCC GCGCGATCCG AACGTGCTGA AGCAGCACGC GAGCGAGTTC AAGCCGCTCA AGACGTTCAC CGTCGAGCAG ATCTTCGGCA ACTGGGCGAA CGCGCAGAAG ACGCATTTCG CCGACGGCGG CACGTTCGAC AAGATCATCG TCGATCGCAA GTAG
|
Protein sequence | MKHNKGIGRR LAAGVAAALL ATAAHADTAL LNVSYDVTRE LYKDINASFV AAYKQKTGET VSVRQSHGAS SAQALSVLQG LQADVVTMNQ PNDIDLLAER GQLLPKNWRT RLPNGSSPYS TTMVFLVRHG NPKQIKDWSD LAKPGVQVII ANPKTSGNGR YAYLAAWGYQ KLKGATDQQA LDFEKAIFRN VPVLDSGGRG ATTTFTQRGI GDVLVTFENE VALIDSGAAG ASFDAVYPSA SILAEPPVAI VDKVVDKKGT RRAAQAYLDY LYSPAAQEIV AQHHLRPRDP NVLKQHASEF KPLKTFTVEQ IFGNWANAQK THFADGGTFD KIIVDRK
|
| |