Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2325 |
Symbol | |
ID | 4887862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | + |
Start bp | 2265749 |
End bp | 2266942 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640132262 |
Product | major facilitator superfamily permease |
Protein accession | YP_001063319 |
Protein GI | 126443897 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0197346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTCGTCG TCTTCACCGC ATTGACGAAT CTCGCCGATG GCGTGACGAA GGTCGCGCTG CCGCTGATGG CGACCGCACT CACGCACTCG CCCGCGCGCA TCTCGGGCGT ATCACTGACA CTGACGCTGC CCTGGCTGCT CGTCGCGCTT CACGTCGGCG TGCTGGTGGA CCGCTTCGAT CGCCGCACGC TGCTGTGGCT CGCGAACGCG GCGCGCATGG CCGCCATGGC GCTGCTCATC GCGCTGCTGC CATCCGGCCG CGTCACGCTG CCGGTGCTGT ACGCGAGCGG CCTGACGCTC GGCCTCGCCG AGGTCGTCGC GCTGACTTCC GCGGCCGCCT TGATTCCGGA CGCCGTCGCC CCTTCGGGCC GCGAGCGCGC GAACGCATGG ATCGCCGGCG CGGAGACCGT CTGCAACGAA TTCTGCGGCC CGCTCACCGG CGGCATGCTG GTCGCGGCCG GCACGGCGAT CGCGCTCGGC GCCGTTGCCG TCGGCTACTT CGGCGGCGGC GTCGCGCTGT TTTTCCTGAT CGGGCGGTTC CGCGTCGCGC ATGCGCCGCA TGGGCGGCCG CCGCCCGTTC GCCTGCAGAT TGCCGAAGGG CTCGGATGCC TGTGGCACCA GCCGTTGCTC CGGCTGATGG CCGTCGCGCT GACGGTGCTC TGCATGTGCT GGGGCGCATG GCTCGCGCTG ATGCCGCTGT TCGCGACGAC GGTGCTCGGC CTCGACTCGC GCGGCTATGG CGTGACGGTC AGCGCGCTCG GCGTCGGCGG CTTCGTCGGC GCGCTGAGCG TCACCTTGCT GAACCGCCGC TTCGGGCGGC GCACCGTCAT GCTCACGGAT CTGCTCGGCA CCTTCGCGAT GATGGCCGTA CCGGTGCTGA GCACGAACCT ATGGGCCGTC GCGGCGAGCG CATTCGCGGG CGGCCTGGGC GGCACGCTGT GGACGGTCAA TGCGAGGACG ATCAGCCAGC ATCTCGTGCC GGGGCCGCTG CTCGGCCGCT ACAATGCGGC GGCCCGCCTG TTCAGTTGGG GAGCGATGCC GATCGGCGCG GGCTTTGCCG GCGCGATCGC GGAACTGCTG GGCATGCGCG CCGCGTTCGC GGCGCTCGCC GTCGCGGCCT TGATGTTGAT CGTGCCGTTC CTGCGCGTCG CTTCGGCGCA AGCGCTGCGA ATCGGCCCCG AACGCCGACA TTGA
|
Protein sequence | MLVVFTALTN LADGVTKVAL PLMATALTHS PARISGVSLT LTLPWLLVAL HVGVLVDRFD RRTLLWLANA ARMAAMALLI ALLPSGRVTL PVLYASGLTL GLAEVVALTS AAALIPDAVA PSGRERANAW IAGAETVCNE FCGPLTGGML VAAGTAIALG AVAVGYFGGG VALFFLIGRF RVAHAPHGRP PPVRLQIAEG LGCLWHQPLL RLMAVALTVL CMCWGAWLAL MPLFATTVLG LDSRGYGVTV SALGVGGFVG ALSVTLLNRR FGRRTVMLTD LLGTFAMMAV PVLSTNLWAV AASAFAGGLG GTLWTVNART ISQHLVPGPL LGRYNAAARL FSWGAMPIGA GFAGAIAELL GMRAAFAALA VAALMLIVPF LRVASAQALR IGPERRH
|
| |