Gene Sare_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0417 
Symbol 
ID5708231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp477160 
End bp478422 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content70% 
IMG OID641269942 
Productmajor facilitator transporter 
Protein accessionYP_001535337 
Protein GI159036084 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00934872 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAGCAGTG TCGTCGAGGC GTTCGTGCCG GCCCGGCTTG GCAACAGGTT CCGTTGGTTG 
CTGGCCTCGT CGTGGGTGAC GAACCTGAGC GACGGAATCG CGGTGGCGGC CGGGCCGCTA
CTGGTGGCGT CGCTGACCAC CAACCCGATC CTGGTCTCGC TGGCAGCCCT GCTGCGCTGG
GCACCTCCGT TGGTGTTCGG CCTGTGGGCC GGCGTGCTCT CCGATCGGCT CGACCGGCGG
CGCATCGTGT TGGTGGCCAA CACAGTCCGA CTCGTCACCC TGGTCGTGCT GGCCGGGGCA
TTGGTGACCG ACCGGGTGTC GGTGTCGGCG GTGCTGCTGA TGCTGGCGCT GCTCGCCACC
GCCGAGGTGT TTGCGGACAA CACGACGGGC ACGCTGACGC CGATGCTGGT GCGTCGGGAG
GATCTGGCGC TCGCCAACGC CCGCGTCCTG GCCGGGTTCA TCACGCTGAA CACACTGGCC
GGGCCGGCAG TTGGGGCGGC ACTCTTCGCG GCCGGGCGGT CCTTGCCGTT CGCCACCAAC
GCGGTTCTCA TCGCGGCCGG GCTGGTGCTG GTGTCCCGGC TGTCCCTGCC GCCACGCGAG
CCGGCAGCGG AGAACCGCGG CGTCCGGCGG GACATCGTGG CGGGTATCCG ATGGACCGTC
CGTCATCCGG CCGTTCGGAC GCTCTGCCTG ACCACCCTGG TCTTCAACAT CACGTACGGT
GCCGCGTGGT CGATCCTGGT GCTCTACGCC ACCGAGCGGC TCGGTCTGGG CGCTGTCGGC
TTCGGCCTGA TCAGCACGGC GACGGCGGTC GGCGGCCTGC TCGCCACCGT CGGCTACGGG
TGGCTCACCC GTCGGATGAG CCTCGGGGGG ATCATGCGGG CCGGCCTGGT CATCGAGACT
CTCACCCACT TCGGTCTCGC CGTCACCACC GCACCCTGGG TCGCCTCGGC CATTCTCTTC
GTCTTCGGGG CGCACGCCTT CGCCTGGGGC ACCACCTCGA TGACGATCCG TCAGCGGGCG
GTTCCGGCCC ACCTCCTGGG CCGGGTCAAC AGCATCCACA CCATCAGTGC GTACGGTGGG
CTGGTCATCG GCTCGGCAAT CGGTGGCCCG CTGGTTGCCC TCCTGGGTGT GACCAGCCCG
TTCTGGTTCG CTTTCGCTGG TTCGGCCGTC CTCGTCGTGC TGCTGTGGCG CGAGTTCGCC
CACATCGCAC ACACCGACGA TCCCGCTCCG ACCCCGGCCC CGGCCGGTTC AACGGCGGCG
TGA
 
Protein sequence
MSSVVEAFVP ARLGNRFRWL LASSWVTNLS DGIAVAAGPL LVASLTTNPI LVSLAALLRW 
APPLVFGLWA GVLSDRLDRR RIVLVANTVR LVTLVVLAGA LVTDRVSVSA VLLMLALLAT
AEVFADNTTG TLTPMLVRRE DLALANARVL AGFITLNTLA GPAVGAALFA AGRSLPFATN
AVLIAAGLVL VSRLSLPPRE PAAENRGVRR DIVAGIRWTV RHPAVRTLCL TTLVFNITYG
AAWSILVLYA TERLGLGAVG FGLISTATAV GGLLATVGYG WLTRRMSLGG IMRAGLVIET
LTHFGLAVTT APWVASAILF VFGAHAFAWG TTSMTIRQRA VPAHLLGRVN SIHTISAYGG
LVIGSAIGGP LVALLGVTSP FWFAFAGSAV LVVLLWREFA HIAHTDDPAP TPAPAGSTAA