Gene Sare_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2475 
Symbol 
ID5706080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2831035 
End bp2832441 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content70% 
IMG OID641271940 
Productmajor facilitator transporter 
Protein accessionYP_001537310 
Protein GI159038057 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACTCG TGGTGTTTCT TGAGGTGTCG ATCGTCAACG TCGCCCTGCC GGCGATGGGT 
GCAGCGCTGG CGTTGACCGA GTCCGGGCTG ACCTGGGTGG TCAACGCCTA CCAACTCACC
TTCGGCGGCT TGCAGCTGGT CGCCGGTAGG GCCGCGGACG TGGTCGGCCG GCGCCGCATG
TTCCAGGCCG GAATCGCGCT GTTCACCGGC GCGTCGCTAC TGGCGGGCCT CGCACCCGAC
GCGAGCACGC TGCTAGTGGG TCGAGCGATA CAGGGCGTTG GTGCGGCGAT CGTGGTCCCC
GCCGAGCTCG CGTTGATCGC CGCGATCTTC ACCGAACCCG CTGCCTACCG TCGGGCGTTC
GCGGTATGGA GCGCCATGGC CGCCGCCGGT GCGGGCTCGG GTGTGGCGTT GGGCGGCGTC
CTGACCCAGA CACTTGGTTG GCCCTGGATC TTTCTGATCA ACGTCCCGAT CGGTGTGGTC
GCCCTTGTCC TGAGCCCGCG CCTGCTCCCC GCCGACCCGC CGTGGGCCGG GCGTGCCAGC
GGCGCTCGGA CGCAGCTGGA CCTGATAGGT GGACTCACCG TCACCGGATC CGTGCTCGCC
ATGGTCCTGC TCGCCACCGA GTTGCCCACC TCCGGGTGGA CGCCGCTCAC CTACACGGCA
GCCGTCGCCA CCCTCGGGCT CGGTACGGTG TTCGCGGTCA ACCAGCGGCG CCATCCGGCC
CCGATCCTGC CGCCGCAACT GCTGCGGATA CGCCAGGTAC GGGCCGGCGT CGCGGCCAAC
GCTCTCGTGG GCGCCTCCCA CGTGCCAGCG TTCGTGCTGC TGTCGCTCAT GCTCCAGCAG
GCCATGGGAT ACTCCGCCCT CGCCGCCGGA TTCGCCGTAC TGCCTATCGC CGCGGTCAAC
ATGGTCACCG CGCGTACCGC GCTGCCCTGG GCGATCGGCC GCTTCGGGTC ACGCGTAGTT
CTCGCGGCCG GGATGTTCCT GGTCGCGCTC GGCCTGGCCG GATACGCGAT CCTGCTGCAC
CCCGGCGCCG GCTACCTGAC CGCCGTGTTG CCGGCCAGCC TGATCTTCGC AGCCGGCCTG
CCTGCGGTGT TCGTCGGTTC CACCGCCCCC GCCGTACGCA GCGCACCTGA AAATCAGCAA
GGAGCCGCCT CTGGCCTGGT CAACACCGCC CAACGCCTCG GCGCCGCCCT GGGCCTCACC
GCCTTACTGA TGGCCTCGGC AGCATGGACC GACAACCGCG GCAGCGGCGA CAGGGCCACC
GCGCTCGCCG ATGGTCTACG CGTGGGGTTT GCCGGCGCGG CCGCCATCGC CGTCCTGGGA
ATCGTCTGCG CGCTGTCCAC GGGCACCCGA GTACCGAAGA AGCCGGCGAG CGGCGATACA
CCACCACAGG AGGCGGTCGC CCGATGA
 
Protein sequence
MELVVFLEVS IVNVALPAMG AALALTESGL TWVVNAYQLT FGGLQLVAGR AADVVGRRRM 
FQAGIALFTG ASLLAGLAPD ASTLLVGRAI QGVGAAIVVP AELALIAAIF TEPAAYRRAF
AVWSAMAAAG AGSGVALGGV LTQTLGWPWI FLINVPIGVV ALVLSPRLLP ADPPWAGRAS
GARTQLDLIG GLTVTGSVLA MVLLATELPT SGWTPLTYTA AVATLGLGTV FAVNQRRHPA
PILPPQLLRI RQVRAGVAAN ALVGASHVPA FVLLSLMLQQ AMGYSALAAG FAVLPIAAVN
MVTARTALPW AIGRFGSRVV LAAGMFLVAL GLAGYAILLH PGAGYLTAVL PASLIFAAGL
PAVFVGSTAP AVRSAPENQQ GAASGLVNTA QRLGAALGLT ALLMASAAWT DNRGSGDRAT
ALADGLRVGF AGAAAIAVLG IVCALSTGTR VPKKPASGDT PPQEAVAR