Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2212 |
Symbol | |
ID | 4883929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2203525 |
End bp | 2205081 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640128140 |
Product | major facilitator transporter |
Protein accession | YP_001059247 |
Protein GI | 126440688 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.508956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCCG CGCCGGGCCG ACCGCCGCTC TGGAGCGCCG CGAACCTGCG CGGCGATTTC TTTCCATGGG TGCTCGCGAT CGTCACCGGC CTCGATTACT TCGACAACGC CGCGTTCTCG TTCTTCGCGA GCTACATCGC GGGCGGAATC AACGCGTCGC CGGACGAGCT CGTGTGGGCG TCGAGCGCTT ACGCGGTGAC GGCCGTGCTC GGCATCCTGC AGCAGCAATG GTGGGTCGAC CGGCTCGGTC ACCGGCGTTA CGTCGCCGGC TGCATGCTGA TGTTCTCGCT TGGCGCGATG GCCGCGGCGG CGGCCGACAC GTCGCTGCAG CTCGCGTTCG CGCGCGGCTT TCAGGGCTAT TTCATCGGTC CCATGATGGG CGCGTGCCGG ATCCTGATCC AGGTCAGCTT CGCGCCGAAG GATCGCCCGC CCGCGACGCG CGCATTCCTC ATCATGCTGC TGCTCGGCAG CGCGCTCGCG CCGATCGCGG GCGGCCTGCT CGTCGCGCAC TTCACATGGC GCGCGCTGTT CGCCTGCACG GCGCCGGCCG GCATCCTGTT CGCGGCGCTC GCGTTCGTCG CGCTGCCCGA TTCCGGCCAC ACGCCGCCCG ACGAACGCGG CGGCGCGCAT TTCTGGCCGT ACGTGATCTT CGCGCTCGCG CAAGGCGCGC TGCAGATCGT CATGCAGCAG GTGCGCTTCC AGCTCTTCGC CGGCTCGCCG CTGCTCGTGC TGCTCGCCGT CGGCGGCCTC GCGGCGCTCG CGTGGTTCGG CCATCATCAG TGGCATCATC CGGCGCCGCT CGTGCGGCTG CACGCGCTTC GCGAGCGCAC GTTCCGGGTC GGCCTGCTGC TCTACCTGTT CTATTACTAC GAGACGACGG GCTACAGCTA TCTGATCTCC CGCTTCCTCG AAACCGGGCT CGGCTATCCG ATCGAGAACG CCGGGCGGCT CGTCGGCACG ATGTCGCTGA TCTCCGCGAG CGCGCTCTTC GTCTACCTGC GCTACGCGAA GCTTCTCACG CACAAGAAAT GGATCATCGT GCCCGGCTTC GCGCTCGCCG CGTTCGCCGC GCTATGGATG ACGCGGATGT CGCCCGAGGT CGGCGAAGCG GCGCTCGTCG CGCCGCTCCT GATGCGCGGG CTGCTGCTCC TGTTCATCGT GCTGCCCGTC GCGAACCTGA CGTTTCGCGT GTTCGCGATC GACGAGTATT CGCACGGCTA CCGGCTGAAG AACATCGTCC GGCAATTGAC GATTTCGTTT GCGACCGCCT CCGTCATCAT CGTCGAGCAG CATCGGCTCG CCGTACATCA GACGCGGCTT GTCGAGCAGG CGAACGTCTA CAATCCGCTG TTCCGGCAAA CCCTCGACAC GCTCACGCGC GGCTTCGCGG CCGCGGGCCA CGCGTTCTCC GACGCGCACG CGCTCGCGCT CGTCACCGTG AGCCGCACCA TCGCGCAACA GGCGTCGTTC CTGGCGTCGC TCGACGGCTT CTACTTCCTC GCGGGCGTCG CGATCTGCGG CGGCCTGTTC GCCGCCTGGC AAAAAGAGAT CGATTGA
|
Protein sequence | MSAAPGRPPL WSAANLRGDF FPWVLAIVTG LDYFDNAAFS FFASYIAGGI NASPDELVWA SSAYAVTAVL GILQQQWWVD RLGHRRYVAG CMLMFSLGAM AAAAADTSLQ LAFARGFQGY FIGPMMGACR ILIQVSFAPK DRPPATRAFL IMLLLGSALA PIAGGLLVAH FTWRALFACT APAGILFAAL AFVALPDSGH TPPDERGGAH FWPYVIFALA QGALQIVMQQ VRFQLFAGSP LLVLLAVGGL AALAWFGHHQ WHHPAPLVRL HALRERTFRV GLLLYLFYYY ETTGYSYLIS RFLETGLGYP IENAGRLVGT MSLISASALF VYLRYAKLLT HKKWIIVPGF ALAAFAALWM TRMSPEVGEA ALVAPLLMRG LLLLFIVLPV ANLTFRVFAI DEYSHGYRLK NIVRQLTISF ATASVIIVEQ HRLAVHQTRL VEQANVYNPL FRQTLDTLTR GFAAAGHAFS DAHALALVTV SRTIAQQASF LASLDGFYFL AGVAICGGLF AAWQKEID
|
| |