Gene Sare_2597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2597 
Symbol 
ID5707891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2960986 
End bp2962515 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content72% 
IMG OID641272059 
Productmajor facilitator transporter 
Protein accessionYP_001537429 
Protein GI159038176 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0332017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0660253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGA TCGACAGTCC GCCCAACGAT GCGGGCCGCG GCCTGTCCAC ACCCATGGAC 
CGGGCCGCAG CGGGCCGGGT GCCGCACCGC TGGCTGATTC TCGCGGTGCT CTGCCTGACC
CAGCTCGTCG TCGTGCTGGA CAACACCGTG TTGACGGTGG CGGTTCCGGT ACTCACGGTG
GAACTGAACG CCGGCACCGC CGACGTGCAG TGGATGATCA ACGCGTACGC GTTGGTGCTG
TCCGGGCTGT TACTGAGCGC TGGCAGTGCA GTCGACCGGT ACGGGCGGCG CCGGATGCTG
CTGGTCGGGC TGGTGCTGTT CGGCCTGGGC TCGCTGGCAG CCGGGCTGGC CCGCACCACC
GAGCAGCTGA TCGCCGCCCG GGCTGGCATG GGGGTGGGCG GCGCGCTGCT GGTCACCGCC
ACCCTGGCCG TCGCCATGCA GGTCTTCGAC TCCGGCGAGC GGTCCCGGGC AATCGGCATC
TGGGCGGCGA CCAGCGCGCT GGGCTTCGCC GCTGGGCCGC CGATCGGCGG GGCCGTCCTC
GCCCACCTGC CCTGGGGCGC AATCTTCTTG ATGAACATCC CGATCGTGCT GATCTGCCTC
CTCGCCGGCT GGACGCTGGT CCCGGAGTCA CGGGATCCGT CCGGCGGGCG GCTGGATCTG
GGCGGGGTGG CGCTGTCCAC TGCCGGCCTG ACCGCGATCG TCTGGGCGAT CATCTCCGGT
CCGGAGCTCG GGTGGGCCTC GACGAAGGTG CTTGGCGCGG GTGCCGCCGG CGTGCTGTTG
CTGGTGTCCT TCGTGCGCTG GGAACAGCGG GTCGCCCACC CGATGCTGGA CATGCACTTT
TTCCGGAATC GCCGCTTCGT CGGCGCGGTC TGCGGCGTCG TGCTGATCAC TTTCGGCGCC
ACCGGCGCAC TGTTCCTGCT CACCCAGCAA CTGCAGTTCG TCCGCGGCTT CTCGGCCTGG
GAGGCCGGTC TGCGGATGGT GCCGTTCGCC CTGTCCATCG TGCTGCTCAA CGTGAGCGGC
ATCGCCGCGA TGGTGATCCG CCGGCTTGGG CTACCGGCCG CCATCGCCAC CGGAATGACC
CTGCTGGCCG GTGGCCTGGC GCTGGTCACG CACGTGCGGT CCGAGGGCTA CGGCACGCTG
CTGGCCGGGC TACTCATCAT GGGTGCGGGG TGCGCGCTGG CGAACCCGGC CATCGTCGAG
GCGGTGATGA GCGCGATCCC GCCGAACAAG GCCGGCGCCG GGGCCGGCGT CGACGGCACG
ATGACCGAGG TCGGCGGCAG TCTCGGCATC GCCGTGCTGG GCGCGGTGCT CAACGCCCGG
TTCGCCGCGC TGCTGCCCGC GGCACTGGCC GGCGCCGGCT CGTTCCCCGC GGCCCTGGCC
GCCGCGGGCT CGGACCGGGA GGTGGTCACC GTCGCGTTCG CCGACGCGTT GCGGACCGGT
CAGACAGTGG GGGCGGTGGC GGTCCTGGTC GGTGGTTTCG TCGCCGCCGC GCTGCTGTAC
CGGGCCGACC GCTTGTCCGG GCCCGGCTGA
 
Protein sequence
MSLIDSPPND AGRGLSTPMD RAAAGRVPHR WLILAVLCLT QLVVVLDNTV LTVAVPVLTV 
ELNAGTADVQ WMINAYALVL SGLLLSAGSA VDRYGRRRML LVGLVLFGLG SLAAGLARTT
EQLIAARAGM GVGGALLVTA TLAVAMQVFD SGERSRAIGI WAATSALGFA AGPPIGGAVL
AHLPWGAIFL MNIPIVLICL LAGWTLVPES RDPSGGRLDL GGVALSTAGL TAIVWAIISG
PELGWASTKV LGAGAAGVLL LVSFVRWEQR VAHPMLDMHF FRNRRFVGAV CGVVLITFGA
TGALFLLTQQ LQFVRGFSAW EAGLRMVPFA LSIVLLNVSG IAAMVIRRLG LPAAIATGMT
LLAGGLALVT HVRSEGYGTL LAGLLIMGAG CALANPAIVE AVMSAIPPNK AGAGAGVDGT
MTEVGGSLGI AVLGAVLNAR FAALLPAALA GAGSFPAALA AAGSDREVVT VAFADALRTG
QTVGAVAVLV GGFVAAALLY RADRLSGPG