Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A1806 |
Symbol | |
ID | 4886907 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 1764254 |
End bp | 1765324 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640131744 |
Product | hypothetical protein |
Protein accession | YP_001062801 |
Protein GI | 126443047 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCC GCACGAACGC GGCTTTCACC GCCCTCCTCG CCGCCGCGCT GTTCGGCGCC ACCACGCCGC TCGCGAAGAC GCTGCTCGGC TCGCTCACGC CGTTCATGGT CGCGGGCCTG TTCTATCTCG GCAGCGGCGT CGGCCTCGGG GCGTTCATGC TGATGCGCCG GCTCGCGCGC GGCGCCGGCG CCGGCGCATC GCCCGCCGGC CACGCGCGGC TGCCGCTTGC CGAGCTCCCG TGGCTCGCGG GCGCGGTCGC GGCGGGCGGC ATCGCGGGCC CGGCGCTGCT GATGCTCGGC CTCGCGACGA CGCCCGCCGC GACGAGCGCG CTGCTGCTCA ATCTCGAAGG CGTGTTCACC GCGCTGATCG CGTGGGCCGT ATTCCGCGAG AACGTGGATG CGCAGATTTT CGCCGGCATG GCCGCGATCG TCGCGGGCGG CGTGCTGCTG TCGTGGCATC CGGGCGCGGC GGGCGTGCCG CTCGGCGCGC TGCTCGTCGC GGCCGCCTGC GCGTGCTGGG CGATCGACAA CAACCTGACG CGCAAGGTCT CGACTCACGA CGCCGCGGCG ATCGCGTGCG TCAAGGGCCT CGTCGCCGGC ACGGTCAACC TCGGCATCGC GCTCGCGCTC GGCGCGCGGC TGCCCGCCGC CGCCGACAGC GCGGCCGCGA TGCTCACGGG CTTCGCCGGC TATGGCGTGA GCCTCGTGCT GTTCGTCGTC GCGCTGCGCA ATCTCGGCAC CGCGCGGACC GGCGCGTATT TCTCGGTCGC GCCGCTGTTC GGCGTCGGGC TGTCGCTCGC GCTGTGGCCC GAATGGCCGC CGCTGTCGTT CTGGGCCGCC GCGGCGCTGA TGGCGCTCGG CATCTGGCTG CACCTGCGCG AGCGCCACGA GCATCCGCAT ACGCACGAGG CGCTCGAGCA CAGCCATCGG CACCGGCACG ACACGCATCA TCAGCACGCG CACGACTTCG ACTGGGACGG CACGGAGCCG CACACGCACG CGCACCGGCA CACGCCGATC ACGCACACGC ATGCGCATTT CCCGGACATT CATCACCGGC ACTCGCACTG A
|
Protein sequence | MSARTNAAFT ALLAAALFGA TTPLAKTLLG SLTPFMVAGL FYLGSGVGLG AFMLMRRLAR GAGAGASPAG HARLPLAELP WLAGAVAAGG IAGPALLMLG LATTPAATSA LLLNLEGVFT ALIAWAVFRE NVDAQIFAGM AAIVAGGVLL SWHPGAAGVP LGALLVAAAC ACWAIDNNLT RKVSTHDAAA IACVKGLVAG TVNLGIALAL GARLPAAADS AAAMLTGFAG YGVSLVLFVV ALRNLGTART GAYFSVAPLF GVGLSLALWP EWPPLSFWAA AALMALGIWL HLRERHEHPH THEALEHSHR HRHDTHHQHA HDFDWDGTEP HTHAHRHTPI THTHAHFPDI HHRHSH
|
| |