Gene Sare_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0078 
Symbol 
ID5707083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp91940 
End bp93550 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content69% 
IMG OID641269604 
Productmajor facilitator transporter 
Protein accessionYP_001535004 
Protein GI159035751 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0195486 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACGACA CACACCCGGG CGACATGGGC CAGCGAGCGA CCCGCCGGAG CTGGTTCGGA 
CTCGCGGTGC TCGGGCTGCC CACCCTCCTG CTGTCGCTCG ACCAGAGCGT GCTCTACCTC
GCGCTGCCGC ACCTGAGCGC CGATCTGGGG GCGGGCAGCA TCGAGTCGCT CTGGATTCTG
GACATCTACG GCTTCATGCT CGCGGGCTTC CTGGTCACCA TGGGGACTCT CGGTGACCGG
ATCGGCCGGC GCAAACTGCT GCTCATCGGG GCTGCTGCCT TCGGCGTCAG CACGGTCGCG
GCGGCGTACT CGGACAGCGC ACAGATGTTG ATCGTGACCC GCGCGGCGAT GGGTGTGGCA
GGCGCGACCC TGATGCCGTC CACACTGGCG CTGATCAGCA ATGTGTTCCA CGACGCCAAG
CAGCGCGGCG TGGCCATCGC GGTGTGGTTC AGCTGCCTCA TGGTCGGCGG CGCGCTCGGT
CCGGTGGTCG GAGGCGCCCT GCTGGAGCAC TTCTGGTGGG GTTCGGTCCT CCTGCTGGGC
GCGCCGATCA TGATGCTGCT GCTGGTTCTC GGGCCCGTAC TGCTACCCGA GTACCGCGAC
CCCTCGGCCG GACGAATCGA CCTACTCAGC GTACTGCTCT CCCTACTGAC GGTCCTACCG
ATCATCTACG GGATCAAGGA ACTCGCCTAC GACGGCTGGA CGGCGGAACC GCTGGTTGTC
ATGTCGGCCG GCGTGGTGTT CGGCGCGGTC TTCGTCACCC GTCAGCACCG GCTGGCGGAA
CCGCTGGTCG ACATCCGGCT CTTCCGGACC CGCGCGTTCA GCGCCGCGTT GGTGATCCTG
CTGTTCGGCT CGGTCACCAC CGGCGGGATC TACCTGCTGG TCAACCTGTA CCTACAGATG
GTCGAAGGGC TCTCGCCCCT GCGGACCGGA CTCTGGCTGC TGCCGTCCAC ACTGGCCATC
GTCGTGGGCT CGATGACGGC GCCGGCGCTG GCGCCGAGGG TACGGCCGGC GTACCTCATC
TCGAGCGGGT TGGCGGTGAC CACCTTCGGT TACCTACTAC TTACCCAGGT AGACCCGACA
GGTGGGCTGC CACTGCTGGT AACTGGTTTC GTGTTGGCGT TCCTGGGCGC CGGTCCGATG
GGCGCGCTCG GCACCGACCT GGTCGTCGGA TCCGCGTCGC CGGAGCAGGC CGGTTCTGCG
GCATCCCTGT CGGAGACCGG CAACCACCTC GGTATCGCGA TCGGAATCGC GGTGATGGGC
AGTATCGGGA CCACCGTCTA CCGGGACCGG ATCGACACCA CCGTGCCGGA CGGAATCGCC
GCCGACGCTG CCGAAGCGGC ACGGGAGAAC GTCACCGGGG CGGTCACCGC GGCGGAAGGG
CTGCCCGCCG GACCGGCGGC ACAGCTCCTC GATGCCGCGG CGACCGCCTT CACACACGGC
CTGAACACCG CCGCATACAT TGGCGCCGGG TTGTTTCTCA CGCTCGCCAT CGTCGCGGCG
GTATCGCTGC GCGAGACCCG GATGCCGGCA GGCGAAACGG CGAGCGGCGT CGCCGACGAG
TCCGCCCCGA CCGACACTGC CGCTCGGGCC GCCCAGAAGC CGACCGAGTG A
 
Protein sequence
MHDTHPGDMG QRATRRSWFG LAVLGLPTLL LSLDQSVLYL ALPHLSADLG AGSIESLWIL 
DIYGFMLAGF LVTMGTLGDR IGRRKLLLIG AAAFGVSTVA AAYSDSAQML IVTRAAMGVA
GATLMPSTLA LISNVFHDAK QRGVAIAVWF SCLMVGGALG PVVGGALLEH FWWGSVLLLG
APIMMLLLVL GPVLLPEYRD PSAGRIDLLS VLLSLLTVLP IIYGIKELAY DGWTAEPLVV
MSAGVVFGAV FVTRQHRLAE PLVDIRLFRT RAFSAALVIL LFGSVTTGGI YLLVNLYLQM
VEGLSPLRTG LWLLPSTLAI VVGSMTAPAL APRVRPAYLI SSGLAVTTFG YLLLTQVDPT
GGLPLLVTGF VLAFLGAGPM GALGTDLVVG SASPEQAGSA ASLSETGNHL GIAIGIAVMG
SIGTTVYRDR IDTTVPDGIA ADAAEAAREN VTGAVTAAEG LPAGPAAQLL DAAATAFTHG
LNTAAYIGAG LFLTLAIVAA VSLRETRMPA GETASGVADE SAPTDTAARA AQKPTE