Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0078 |
Symbol | |
ID | 5707083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 91940 |
End bp | 93550 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641269604 |
Product | major facilitator transporter |
Protein accession | YP_001535004 |
Protein GI | 159035751 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0195486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACGACA CACACCCGGG CGACATGGGC CAGCGAGCGA CCCGCCGGAG CTGGTTCGGA CTCGCGGTGC TCGGGCTGCC CACCCTCCTG CTGTCGCTCG ACCAGAGCGT GCTCTACCTC GCGCTGCCGC ACCTGAGCGC CGATCTGGGG GCGGGCAGCA TCGAGTCGCT CTGGATTCTG GACATCTACG GCTTCATGCT CGCGGGCTTC CTGGTCACCA TGGGGACTCT CGGTGACCGG ATCGGCCGGC GCAAACTGCT GCTCATCGGG GCTGCTGCCT TCGGCGTCAG CACGGTCGCG GCGGCGTACT CGGACAGCGC ACAGATGTTG ATCGTGACCC GCGCGGCGAT GGGTGTGGCA GGCGCGACCC TGATGCCGTC CACACTGGCG CTGATCAGCA ATGTGTTCCA CGACGCCAAG CAGCGCGGCG TGGCCATCGC GGTGTGGTTC AGCTGCCTCA TGGTCGGCGG CGCGCTCGGT CCGGTGGTCG GAGGCGCCCT GCTGGAGCAC TTCTGGTGGG GTTCGGTCCT CCTGCTGGGC GCGCCGATCA TGATGCTGCT GCTGGTTCTC GGGCCCGTAC TGCTACCCGA GTACCGCGAC CCCTCGGCCG GACGAATCGA CCTACTCAGC GTACTGCTCT CCCTACTGAC GGTCCTACCG ATCATCTACG GGATCAAGGA ACTCGCCTAC GACGGCTGGA CGGCGGAACC GCTGGTTGTC ATGTCGGCCG GCGTGGTGTT CGGCGCGGTC TTCGTCACCC GTCAGCACCG GCTGGCGGAA CCGCTGGTCG ACATCCGGCT CTTCCGGACC CGCGCGTTCA GCGCCGCGTT GGTGATCCTG CTGTTCGGCT CGGTCACCAC CGGCGGGATC TACCTGCTGG TCAACCTGTA CCTACAGATG GTCGAAGGGC TCTCGCCCCT GCGGACCGGA CTCTGGCTGC TGCCGTCCAC ACTGGCCATC GTCGTGGGCT CGATGACGGC GCCGGCGCTG GCGCCGAGGG TACGGCCGGC GTACCTCATC TCGAGCGGGT TGGCGGTGAC CACCTTCGGT TACCTACTAC TTACCCAGGT AGACCCGACA GGTGGGCTGC CACTGCTGGT AACTGGTTTC GTGTTGGCGT TCCTGGGCGC CGGTCCGATG GGCGCGCTCG GCACCGACCT GGTCGTCGGA TCCGCGTCGC CGGAGCAGGC CGGTTCTGCG GCATCCCTGT CGGAGACCGG CAACCACCTC GGTATCGCGA TCGGAATCGC GGTGATGGGC AGTATCGGGA CCACCGTCTA CCGGGACCGG ATCGACACCA CCGTGCCGGA CGGAATCGCC GCCGACGCTG CCGAAGCGGC ACGGGAGAAC GTCACCGGGG CGGTCACCGC GGCGGAAGGG CTGCCCGCCG GACCGGCGGC ACAGCTCCTC GATGCCGCGG CGACCGCCTT CACACACGGC CTGAACACCG CCGCATACAT TGGCGCCGGG TTGTTTCTCA CGCTCGCCAT CGTCGCGGCG GTATCGCTGC GCGAGACCCG GATGCCGGCA GGCGAAACGG CGAGCGGCGT CGCCGACGAG TCCGCCCCGA CCGACACTGC CGCTCGGGCC GCCCAGAAGC CGACCGAGTG A
|
Protein sequence | MHDTHPGDMG QRATRRSWFG LAVLGLPTLL LSLDQSVLYL ALPHLSADLG AGSIESLWIL DIYGFMLAGF LVTMGTLGDR IGRRKLLLIG AAAFGVSTVA AAYSDSAQML IVTRAAMGVA GATLMPSTLA LISNVFHDAK QRGVAIAVWF SCLMVGGALG PVVGGALLEH FWWGSVLLLG APIMMLLLVL GPVLLPEYRD PSAGRIDLLS VLLSLLTVLP IIYGIKELAY DGWTAEPLVV MSAGVVFGAV FVTRQHRLAE PLVDIRLFRT RAFSAALVIL LFGSVTTGGI YLLVNLYLQM VEGLSPLRTG LWLLPSTLAI VVGSMTAPAL APRVRPAYLI SSGLAVTTFG YLLLTQVDPT GGLPLLVTGF VLAFLGAGPM GALGTDLVVG SASPEQAGSA ASLSETGNHL GIAIGIAVMG SIGTTVYRDR IDTTVPDGIA ADAAEAAREN VTGAVTAAEG LPAGPAAQLL DAAATAFTHG LNTAAYIGAG LFLTLAIVAA VSLRETRMPA GETASGVADE SAPTDTAARA AQKPTE
|
| |