Gene Sare_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4030 
Symbol 
ID5706434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4583214 
End bp4584455 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content69% 
IMG OID641273455 
Productmajor facilitator transporter 
Protein accessionYP_001538811 
Protein GI159039558 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00161616 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCGTCCA TCCTCGCGGT CCTGCGTCGC AACCGAAACT TTCGCATGCT GCTCCTGGCC 
GAGCTGATGG TCTTCGGCGT CGACTGGTTC GTCATGGTGC CGCTGCTGGT GCTCCTGCCG
GCACTGACCG GCAGCGGCGT GTGGGGTGCG CTGGTCCTCG CCATGGACAC CGGAGTCGTG
GCGCTGCTGC TGCCGTACAC CGGGGCGGTG GCGGACCGGT TCGATCGCCG CCGGATCATG
ATCGGCGCCA ACATCGCCGC ACTGGTCGGC GTACTGCTGC TGCTGGGTGT GCGCGATGCC
GGCACGGCCT GGCTGGCCCT GGTCGGGATC GGGGTGGTGG CGGTGGCCAA GGCGTTCTAC
TCTCCGGCCG CGCAGGCCGC CTTGCCGAAC GTGCTCGACC CAGATGAGTT GGCCGCGGGT
AATGCGGTCG CAGGTTCGGC ATGGGGCACG ATGACGATCG TCGGGGCGTC GCTGGGGGGT
GTCCTGAGCA GCGCAGCTGG GCCATACGTC GCCTTCTGGG CGGCCGCTGG CGGCCTGGTT
CTGGCCGGGG TCCTGGCGGG GCTGATCCGT CGGCCGTTGC AGGCCCCACG GGACCAGGAC
CGACCGGTGC AGCAGACCTG GGCGGCCATC CGGGAGGCAC TCGGCTACAT CGGCCACCGG
CCGCGGGTGC TGGCGTTGGT GACCGTGAAG TCGGCGGTCG GCCTCGGCAA CGGCGTGTTG
ACGGTGTTTC CTTTGCTGGC GGTGGCCTAC GGGGTGGGTC CGATCGGCAC CGGGCTGCTC
TTCGGGGTGC GAGGCGCGGG TGCTCTGGTC GGTCCGATCC TGATGCGGCG GGTTCTGGGT
AACCGGTCCT GGCTGCTGCC CGGCCTGGCG GCATCCATGT CGTTGTATGG GCTGGCCTAT
CTGGGCACCT CGGCGGTGAA CTGGTTCCCG CTGGTGCTTG CGTTGGTCTT CGTGGCGCAC
TTCGCCGGGG GTAGTAACTG GGTCATGTCC AACTACGCCC TCCAGGGCGA GGTCCCGGAT
CGGTTACGGG GACGGGTCTT CGCCACCGAC ATGATGCTGG CGACCCTCGC CATCTCGGTG
AGTCAGCTGG TGGTGGCATC GGTGATCGAT GTGGTTGACG CGCGGGTGGT GTTGGCCGGT
GGTGGACTGG TCACCCTGGT CTACGCGGTT GGCTGGCGAA TCGCGACCCG CCGCCTGTCG
TTGACCGACC CGGTCGCGGC GCCGGAGTCG GTCGTTCGCT GA
 
Protein sequence
MPSILAVLRR NRNFRMLLLA ELMVFGVDWF VMVPLLVLLP ALTGSGVWGA LVLAMDTGVV 
ALLLPYTGAV ADRFDRRRIM IGANIAALVG VLLLLGVRDA GTAWLALVGI GVVAVAKAFY
SPAAQAALPN VLDPDELAAG NAVAGSAWGT MTIVGASLGG VLSSAAGPYV AFWAAAGGLV
LAGVLAGLIR RPLQAPRDQD RPVQQTWAAI REALGYIGHR PRVLALVTVK SAVGLGNGVL
TVFPLLAVAY GVGPIGTGLL FGVRGAGALV GPILMRRVLG NRSWLLPGLA ASMSLYGLAY
LGTSAVNWFP LVLALVFVAH FAGGSNWVMS NYALQGEVPD RLRGRVFATD MMLATLAISV
SQLVVASVID VVDARVVLAG GGLVTLVYAV GWRIATRRLS LTDPVAAPES VVR