Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A1163 |
Symbol | |
ID | 4906110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | + |
Start bp | 1110583 |
End bp | 1111923 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640144269 |
Product | major facilitator family transporter |
Protein accession | YP_001075198 |
Protein GI | 126457457 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.151971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCGTC GCGCCGATGC GTCGGCTTCG GCGTATCGCT TCGAGACACT GCTTCGAGAC ACCGCTTCGG CGCATCGCCT GCGCGCATCG CACGCGTGTC CTTCCTCCGT CCCGTTCACC CAGAGTCTGC CCATGCCCGC ACGTTCCGCT TCCTCGCCCC GTCCGCCGAT TCCGCGCACC GTATGGGCGC TCGGCTTCGT CAGCCTGTGC ATGGATGTGT CGTCGGAGCT GATCCACGCG CTGCTGCCGA TCTATCTCGT GACGACGATG GGCATGAGCG TCGCGGCGCT CGGCGTGCTC GAAGGCGCGG CCGAGGCGAC CGCGATGATC GTCAAGATCT TCTCCGGCGC GCTCAGCGAT TGGCTGGGCC GGCGCAAGGC GCTGCTGCTG CTCGGCTACG GGCTCGCCGC GCTGACGAAG CCGCTCTTTC CGCTCGCGGC AGGGCCGGCG ACGGTCGCCG CCGCGCGGCT GCTCGATCGC GTCGGCAAGG GCATTCGCGG CGCGCCGCGC GATGCGCTCG TCGCCGATGT CGCGCCGCCC GAGATCCGCG GCGCGTGCTT CGGGCTGCGC CAGTCGATGG ACACCGTCGG CGCGTTCGCG GGGCCGCTGC TCGCGATCGC GCTGATGCTC GCGTTCGCCG ATCACATCCG CGCGGTGCTG TGGTTCGCGG TCGTGCCGGC GTTCGCCGCG GTCGCGCTGA TCCTGTTCGG CGTCGAAGAG CCCGCGTCCG CGCCCGCCGC CGCGCGGGCG TTCCGCTCGC CGCTGCACTG GCGCGCGCTG CGCGCGTTTT CCGGTCGCTA CTGGTTCGTC GTGCTGATCG GCACCGCGTT CACGCTCGCG CGCTTCAGCG AGGCGTTCCT CGTGTTGCGC GCGCAGCAGG TGGGGCTCGA CATCGCATGG ATCCCGGCCG TGATGGTCGT GATGAGCATA GCGTACGCGG CGTCCGCGTA TCCGGTCGGC ATCGTGTCCG ACAAGTTCGG CGCGCGCGCG CCGCTCGCGG CCGGCATGCT GCTGCTGATC GCGGCCGATC TGCTGCTGGG CGCGAGCGCG TCGCGCACGG CGCTGTTCGC GGGCGTCGCC GTTTGGGGGC TGCACATGGG TTTCACGCAG GGCATGCTCG CCGCGCTCGT CGCGCAAACC GCGCCGGCCG CGCTGCGCGG CACCGCGTTC GGCGTGTTCA ATCTCGCGGG CGGGATCGCG ATGCTCGCGG CGAGCGCGCT CGCCGGCTGG CTGTGGGAAC ACCACGGCGC GCCGACGACG TTCTTCACCG GCGCGGCGCT CGCGGCCGTC GCACTCGCGA TGTGCGGATT CGTTCGGCGG CGCCCGGGGC CTGCGGCATG A
|
Protein sequence | MPRRADASAS AYRFETLLRD TASAHRLRAS HACPSSVPFT QSLPMPARSA SSPRPPIPRT VWALGFVSLC MDVSSELIHA LLPIYLVTTM GMSVAALGVL EGAAEATAMI VKIFSGALSD WLGRRKALLL LGYGLAALTK PLFPLAAGPA TVAAARLLDR VGKGIRGAPR DALVADVAPP EIRGACFGLR QSMDTVGAFA GPLLAIALML AFADHIRAVL WFAVVPAFAA VALILFGVEE PASAPAAARA FRSPLHWRAL RAFSGRYWFV VLIGTAFTLA RFSEAFLVLR AQQVGLDIAW IPAVMVVMSI AYAASAYPVG IVSDKFGARA PLAAGMLLLI AADLLLGASA SRTALFAGVA VWGLHMGFTQ GMLAALVAQT APAALRGTAF GVFNLAGGIA MLAASALAGW LWEHHGAPTT FFTGAALAAV ALAMCGFVRR RPGPAA
|
| |