Gene Sare_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3656 
Symbol 
ID5704620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4218808 
End bp4220355 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content68% 
IMG OID641273081 
Productmajor facilitator transporter 
Protein accessionYP_001538445 
Protein GI159039192 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00788445 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.022371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTCCC GGCCTGGCCG GTCAACGCCG TCCCGACGGG CCGCCTTGGT CGGCCTGTGT 
ACCGCTGCCA CCCTGGTGTG GCTCGCGTTC TCCGACCTGG GTGTGGCGCT ACCGACGATC
GCCACCGAGT TGAGCGTCAA CCTGACCGAC ATGCAGCGGG CGAACAACGC CCTGAGTATC
GCCTGCGGCA CACTCCTACT GGCCGGCGGA CGCCTCACCG ACCTCTACGG CCATCGACGG
ATGCTGCTCC TCGGCCTACT GATCTTCGGT ATTGCCACCT TGGCGACCGC GTTCACACCC
AACCTCGCCG GCCTGGTCGC CGGTCGGGCG ATGATGGGCG TCGGCAGCGC ACTCATCCTG
CCCGCCTCGC TAGCCATGAT CCCGGCGCTG TTCGACCGGG CCAGGCAGCC GTCGGCATTC
GCCGCATGGG CAGCGACCAC CTGGGCGGGG CAGGCGGTCG GGCCCGCCAT CGGCGGAGGA
CTCACCACCC TGTTGGGCTG GCGGTCGCTG TTCTGGCTCA CCGCGCCGGT GGTGCTTGTC
GTGTACGTGA TAACCAGCCG ATACGCACCC AAGGCAAGCA GGCGCCGCGG GCGGGTCGAC
CTGGTCGGGC TGGCCACCGG CGCCGGAGCA GCGCTGTGTC TGCTCTTCGC GTTGACCGAG
AGCCAGCAGG TCGGCTTCAA CGATCCTTTG ATCATCGTTT TGTTCGCCGC GACGCCGGTA
CTCGGCGCAG CGTTCGTGTT CATCGAGACA AAGGCCGCTG ATCCGCTGGT GGATCTGCGG
CTGTTCCGCA CCCGCAGCTT CACTGCCGCT CTCATCGTCA ACCTGGCGAT GAGCATGTCC
TTCGCCGGCA CGCTGTTCGT GCTGTCCCTC TACCTCCAGG ATGTCCGCGG CTACACCGCG
TTCGTGGCCG GCCTGCTGCT GATCCCCGCC GCCGGAACGA TCCTGGGGTT CAACACTGTC
GGAGCGCGGC TGGTCACCCG ACACAACGCC CGCTCCCCCT CGCTCTGGGG CCTCGTCCTG
GTCGGTCTCG GCGGTTTCGC CATCAGCGCC CTGCTGCCCT CCCTGTCCGT CCTGGCAGTG
ATCCTGGGCC TGCTCATCAT CGGCGCCGGG CTGGGCCTGC TGTCCGTGCC CGTGGCCGAC
ACCGATGTCG GAGGTCCACC GGCCTCCCTC GCCGGCGCCG CGTCCGGGGC GTACAGGAGC
AGCAGCATGC TGGGTGGCGC ACTCGGCGTC GTCCTCCTGA CCACGGCGAC AACCCGCTTC
GGCCGCGCCG AGGCCGCACC GGTCAGCACC GCGGCCGGAC TCACCGAAGC GGAATCCAAC
CAGGTCGTCA ACGCACTGAC CAACTCCCAG ACCGCGAGCG CCATCCTCGA CAAACTGCCG
GCAAACGAAC GGTCCCTCGT CGTCGGTGTC TACAACCAGG CGTTCACGGA CGGAGTTTCG
ACCGCCCTCA TCCTCGGTGG TGTGATCGCG GTGGCGGGCA CGGTGCTGGC TGGTTGGATC
TGGCCCCGCA CCCACAGAGC CCGACACACG ACGAACCCCG GACCTTAG
 
Protein sequence
MYSRPGRSTP SRRAALVGLC TAATLVWLAF SDLGVALPTI ATELSVNLTD MQRANNALSI 
ACGTLLLAGG RLTDLYGHRR MLLLGLLIFG IATLATAFTP NLAGLVAGRA MMGVGSALIL
PASLAMIPAL FDRARQPSAF AAWAATTWAG QAVGPAIGGG LTTLLGWRSL FWLTAPVVLV
VYVITSRYAP KASRRRGRVD LVGLATGAGA ALCLLFALTE SQQVGFNDPL IIVLFAATPV
LGAAFVFIET KAADPLVDLR LFRTRSFTAA LIVNLAMSMS FAGTLFVLSL YLQDVRGYTA
FVAGLLLIPA AGTILGFNTV GARLVTRHNA RSPSLWGLVL VGLGGFAISA LLPSLSVLAV
ILGLLIIGAG LGLLSVPVAD TDVGGPPASL AGAASGAYRS SSMLGGALGV VLLTTATTRF
GRAEAAPVST AAGLTEAESN QVVNALTNSQ TASAILDKLP ANERSLVVGV YNQAFTDGVS
TALILGGVIA VAGTVLAGWI WPRTHRARHT TNPGP