Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2112 |
Symbol | |
ID | 4882006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2102559 |
End bp | 2103776 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640128040 |
Product | major facilitator superfamily permease |
Protein accession | YP_001059147 |
Protein GI | 126439977 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.954954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCAAG CAAAGGCAAG ACACCCGCTC CTCGTCTGGC TGCTGATCGT CGGCACCGGC TTCGTCGTGA TGGCGCGCGC GATGAGCCTG CCGTTTCTGG CGATCTACCT GCACGAACGG ATGGGGCTCG ACGCGGCGAC GATCGGCCTG CTGCTCGGCA CGGGCGCACT CGTCGGCACG TTCGGCGGCT TCTTCGGCGG CCATCTGTCC GACGTGCTCG GCCGGCGCAA GGTGCTGACC GGCTGCCTGC TCGTATCGAG CCTGTCGTTC GCCGCGCTTC ATTTCGCGGC CGACGCGTGG CAGGTCTTCG TGATCAACCT CTTCATCAAT CTCGCGAGCT CGTTCTACGA TCCGGTCTCG AAAGCGACCA TCAGCGACAA TCTGCCGCCC GAGCAGCGGC TGCGCGCATT CGCGCGGCGC TACGTGGCGA TCAACATCGG CTTCGCGATC GGGCCGCTGC TCGGCGCGTC GCTCGGCCTG CTCGACAAAT CCCCCGTGTT CCTCATCACG GGCGCCGTCT ACCTGCTGTT CTCGATCGCG ATCTACGCGA TCACGGCCCG GCTCGTGTTC GGCCGCGCGC CGCACGAAGC GGCCGCCTCC GAGTTGCCGC TCGCCGCGAA GCTGCGCGTC ATCGGCACCG ACCGGCGCCT CGTCCTCTTC ACCGCGGGTA GCATGCTCGC GATCGCCGTG CACGGCGAAA TGTCGGTCAC GTTCTCGCAA TACCTGATCG GCGCGTTCGA CGACGGGCTC AAGATGTTCG CCTGGCTGAT GAGCACGAAC GCGATCACGG TCGTCTTGAG CCAGCCGTTG CTGAACCGCA TCGGCGAACG GCGCGGGCCG TTCACGTCGC TCACGCTCGG CGCGATCCTG CTCGCGATCG GCGCGGCCGG CTTCGCGAAT TCGCCGAACA TGATCGCGCT CGTCGTGTCG ATGGTCGTCT TCACGTGGGG CGAAGTGCTG CTGATCCCGT CGGAATACGC GGTCCTCGAC AGCATCACGC CCGAGCCGCT GCGCGGCATC TATTACGGCG CGCATTCGCT CAGCAACGTC GGCAACCTGC TCGGGCCCTG GCTTGGCGGC CTCGTGCTGC TGCACTACGG CGGCGCCGCG ATGTTCTACG GCATGGGCTT CATCGCGCTG CTCAGCCTGC TCACGTTCGC CGTCGGCTCG CAGATCAAGC CCGCGCCGGC GGGCCGGCTC GAAGTCCAGA ACCGCTGA
|
Protein sequence | MSQAKARHPL LVWLLIVGTG FVVMARAMSL PFLAIYLHER MGLDAATIGL LLGTGALVGT FGGFFGGHLS DVLGRRKVLT GCLLVSSLSF AALHFAADAW QVFVINLFIN LASSFYDPVS KATISDNLPP EQRLRAFARR YVAINIGFAI GPLLGASLGL LDKSPVFLIT GAVYLLFSIA IYAITARLVF GRAPHEAAAS ELPLAAKLRV IGTDRRLVLF TAGSMLAIAV HGEMSVTFSQ YLIGAFDDGL KMFAWLMSTN AITVVLSQPL LNRIGERRGP FTSLTLGAIL LAIGAAGFAN SPNMIALVVS MVVFTWGEVL LIPSEYAVLD SITPEPLRGI YYGAHSLSNV GNLLGPWLGG LVLLHYGGAA MFYGMGFIAL LSLLTFAVGS QIKPAPAGRL EVQNR
|
| |