Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2398 |
Symbol | |
ID | 4884086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | + |
Start bp | 2365042 |
End bp | 2366262 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640128326 |
Product | major facilitator family transporter |
Protein accession | YP_001059430 |
Protein GI | 126438519 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAAA CCGCGCTGCC GCGCGGCACG GTTGCGCTGT TCGCCGGTGC GAGCGGCTTG AGCGTGGCGA ACGTCTACTA TGCGCAGCCG CTTCTCGACG CGCTTGCCGC GGATTTCACG ATCGGCCGCG CGGCGATCGG CGGCGTCGTG ACCGCCACGC AAATCGGCTG CGCACTTGCG CTGCTGTTGC TCGTGCCGCT CGGCGACCTC GTCGACCGCC GCCGGCTGAT GCTCGTGCAA TCGCTCGCGC TCGCGGCGAC GTTGATCGCC GTCGGCTTCG CGTCGACCAG CGCCGTGCTG ATCGCCGGCA TGCTTGGCAC AGGGCTGCTC GGCACGGCGA TGACCCAGGG GCTCGTATCG TACGCGGCGA GCGCCTCGGC CTCGCACGAG CGCGGGCGCG TGGTCGGCGC AGCGCAAGGC GGCGTCGTGA TCGGGCTGTT GCTCGCGCGC GTGCTGGCGG GCTTCGTCGG CGACGTGGCG GGATGGCGCG GCGTCTATTT CCTGTCGGCG GCGACGATGC TCGCGCTCGC GGCGCTGCTC GCGCGCAAGC TGCCCGCGCT CGCGCCGGCA TCGCCGCGCA TCGGCTATCC GCGGCTGATC GCATCGCTGT TCGGCCTGCT GCGCGACGAG CACGTCTTGC AGATCCGCGG GATGCTCGCG ATGCTGATGT TCGCCGCGTT CAACATTTTC TGGAGTGCGC TCGCGCTGCC GCTCAGCGCG CCGCCCTATA CGCTTTCGCA CACCGCGATC GGCGCATTCG GGCTCGTCGG CGCATTGGGC GCGTTCGCCG CCGCGCGCGC CGGGCATTGG GCCGATCGCG GCTTCGGACA ACCGACGAGC GCCGCGGCGC TCGCGCTGCT GCTCGCATCG TGGCTGCCGC TCGCCTTCAT GCCGATGTCG CTATGGGCGC TCGTGCTCGG CATCGTGATG CTCGATGTCG GCGGACAGGC GATTCACGTG ACGAATCAGA GCATGATCTT CCGCTCGCGG CCGGATGCGC ACAGCCGGCT CATCGCCGCC TACATGCTGT TCTATTCGGT CGGCAGCGGG CTCGGCGCGA TCGCGTCGAC GGCCGTCTAC GCAACGCACG GATGGCGCGG CGTCTGCATG CTGGGCGCGG CCGTCAGCGC GGCGGCGCTC ATATTCTGGG CGGCCACGGT GCGGCCGACG CCGAACGAAG CCGCGTCGGC GCATACGGCA AACGGGCGGC TCCGGCGGTG A
|
Protein sequence | MTQTALPRGT VALFAGASGL SVANVYYAQP LLDALAADFT IGRAAIGGVV TATQIGCALA LLLLVPLGDL VDRRRLMLVQ SLALAATLIA VGFASTSAVL IAGMLGTGLL GTAMTQGLVS YAASASASHE RGRVVGAAQG GVVIGLLLAR VLAGFVGDVA GWRGVYFLSA ATMLALAALL ARKLPALAPA SPRIGYPRLI ASLFGLLRDE HVLQIRGMLA MLMFAAFNIF WSALALPLSA PPYTLSHTAI GAFGLVGALG AFAAARAGHW ADRGFGQPTS AAALALLLAS WLPLAFMPMS LWALVLGIVM LDVGGQAIHV TNQSMIFRSR PDAHSRLIAA YMLFYSVGSG LGAIASTAVY ATHGWRGVCM LGAAVSAAAL IFWAATVRPT PNEAASAHTA NGRLRR
|
| |