Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2070 |
Symbol | |
ID | 4887998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2008056 |
End bp | 2009474 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640132008 |
Product | major facilitator transporter |
Protein accession | YP_001063065 |
Protein GI | 126444242 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.376866 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCAA CGCAAAGCGC GCCGCGCGCA ACGGCGCCGG CAACGACGAT CGACGCCGGC GTCATCTCGG CGCGCCTCGA TCGCCTGCCG CCCACGCGCA GCGTCTGGAA ACTCGTCGCG CTGCTGAGTC TCGGCTTCTT CTTCGAGCTC TACGATCTGC TGTACAGCGG CTACGTCGCG CCCGGCCTCG TGAAGGGCGG CATCCTGAGC GCGACGACGC GCGGGCTGTT CGGCACGACG GGCGTCGCGA GCTTCATCGC CGCGCTGTTC GCGGGGCTCT TCATCGGCAC GATCGCGTGC GGCTTTCTCG CCGACCGCTT CGGCCGCCGC GCGGTGTTTA CGTGGTCGCT GCTGTGGTAC ACGGCCGCGA ACCTCGTGAT GGCGTTCCAG GATACCGCCG GGGGCCTCAA TTTCTGGCGC TTCGTCGTCG GGCTGGGGCT CGGCGTCGAA ATGGTGACGA TCGGCACATA TATCTCGGAG TTGGTCCCGA AACAGATTCG CGGCCGCGCG TTCGCGTGCG AGCAGGCGGT CGGCTTCGTC GCGGTGCCCG TGGTGGCGCT GCTCGCGTAT CTGCTGGTGC CGCATGCGCC GTTCGGCCTC GACGGCTGGC GCTGGGTCGT GCTGATCGGC GCGCACGGCG CGATCTTCGT CTGGTGGATT CGCCGCCAGT TGCCGGAAAG CCCGCGCTGG CTCGCGCAGC AGGGCCGGCT TGACGAAGCC GAGCGCGTGC TCGCCGCGCT CGAGGCGAAG GTCGAGGCCG AGTACGGCCG GCCGCTGCCG CCGCCCGCGC CCGCCGAGCC CATCGCGCCG CGCGGCCGGT TCGCCGACAT GTGGGTGCCG CCGTACCGCA GGCGCACGGT GATGATGACG ATCTTCAACG TGTTTCAGAC GGTGGGCTTC TACGGCTTCG CGAACTGGGT GCCGACGCTG CTGATCAAGC AGGGGATCAC CGTCACGACG AGCCTCATGT ATTCGAGCGT GATCGCGCTC GCCGCGCCGA TCGGGCCGTT GATCGGCCTT GCGATCGCCG ACCGCTTCGA GCGCAAGACG GTGATCGTCG CGATGGCGGG CGCGGCGATG ATCGCGGGGC TGCTGTTCAG CCACGCGTCG GCCGCGTGGC TGCTCGTCGC GCTCGGCGTA TGCCTCACGC TCGCGAACAA CATCATGTCG TACAGCTTCC ATGCCTATCA GGCCGAGCTG TTTCCGACCG CGATCCGCGC GCGCGCGGTC GGCTTCGTCT ATTCGTGGAG CCGCTTTTCG GCGATCTTCA CGTCGTTCGC GATCGCGGCC GTGCTGAAGG GATTCGGCAC GCCCGGCGTG TTCGTGTTCA TCGCGGGCGC GATGGCGATC GTGATGGCGT CGATCGGGCT GATGGGGCCG CGCACGAAAG GCGTCGCGCT CGAAGCGATA TCGCGTTGA
|
Protein sequence | MASTQSAPRA TAPATTIDAG VISARLDRLP PTRSVWKLVA LLSLGFFFEL YDLLYSGYVA PGLVKGGILS ATTRGLFGTT GVASFIAALF AGLFIGTIAC GFLADRFGRR AVFTWSLLWY TAANLVMAFQ DTAGGLNFWR FVVGLGLGVE MVTIGTYISE LVPKQIRGRA FACEQAVGFV AVPVVALLAY LLVPHAPFGL DGWRWVVLIG AHGAIFVWWI RRQLPESPRW LAQQGRLDEA ERVLAALEAK VEAEYGRPLP PPAPAEPIAP RGRFADMWVP PYRRRTVMMT IFNVFQTVGF YGFANWVPTL LIKQGITVTT SLMYSSVIAL AAPIGPLIGL AIADRFERKT VIVAMAGAAM IAGLLFSHAS AAWLLVALGV CLTLANNIMS YSFHAYQAEL FPTAIRARAV GFVYSWSRFS AIFTSFAIAA VLKGFGTPGV FVFIAGAMAI VMASIGLMGP RTKGVALEAI SR
|
| |