Gene Sare_1372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1372 
Symbol 
ID5707291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1587066 
End bp1588505 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content69% 
IMG OID641270883 
Productmajor facilitator transporter 
Protein accessionYP_001536264 
Protein GI159037011 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0033883 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGCGGT CGAGAGCTCC GACGGGGACG GGGCCAACCC AGCGGTGGCA CATCCTCAAC 
GTCGCGCTGG TCGTCTCGTT CATCAGCCTG TTCGATCTGA GTGTCATCAC CGTGATGCTT
CCCTCGATCG ACCGCGCGCT GGGCGTCGGC AACGGCGCCG TCCAATGGGT CGTCTCCGGC
TACTCGCTGA TGTTCGCGCT CGCTCTCGTC CCCGCGGGTC GGCTCGGCGA CGCCAAAGGA
CGACGGTACG TCCTGCTCCT CGGCCTCGCC TTGTTCGTCG GCGCCAGCAC GATGGCCGGA
CTGGCCGACT CCGGCACCAC ACTGGTGGTC GCCCGGCTCT TACACGGTGC CGCCGCCGGT
GTCCTCGTAC CCCAGATCAC CGCCCTGGTG GAGGAGTTGT TCCGCCCCGG AGAACGAGGG
CGCCCATTCG GAGTGCTCGG TGCCACAACC AGCGTGGCCA CCGCGGTCGG GCCGGTGCTC
GGCGGGCTGA TCATCGCCGT GGGCGGGGCG GAGAACGGCT GGCGGCTGGT CTTTCTCATC
AACGTCCCCG TCGGCATGGT CACGGCGGTG CTCATCTGGC GGCTGTTTCC GACGAAACCA
GTGGACAGCC AGGCAGACCA TCGACCCGAC CTGATCGGCA CGCTGCTGCT GGGCGCCGGG
ATCCTGTTGG TGATCCTGCC CGCGCTGCAA CGGGAACGGT GGACGGGGCT GCTGCCGTGG
CTGTTGGTGC CGGTCGGGAT CGGTGTCCTC GTCGCCTTCC GGTCCTGGGA AACCTCCACG
CTGCGACGCC ACCCGCCGGT CTTCGACTTC CGCCTGCTCA CCCACCGGAC GTACCGCCTG
GGGCTGCTAC TCGGGTTCCT CTACTACGCC GGCTTCACCT CACTACCGGT GATCTTCAGC
TTCCACCTGC AGTACGGCCT ACACCGCAGC CCCCTGCACA CCGGCCTCAC CATCGCGCCA
TTCGCGATCG GCTCCGCGAT CGGCGCCCTG CTAGGCGGAC GGAACGTCGA CCGTTACGGC
CGTCCACTGC TGGCCATCGG ACTCGCCATC GTCATGGCCA GCCTCACCAT GATGATGGCC
CTCACCGGCC TGCACGGGAC GACCTTCTCG ATCGCGAACG CGTTCCCGCT ACTGCTGGCC
GGGATCGGCA GCGGCCTCGT GATCACACCG AACAACACCC TGACCCTCCA GCAGGTGCCC
CGGGCTGACG CCGGCAGCGC CGCCGGCATG TACCGCACCA TGCGGCAGGT CGGATCCGCG
ATCGGGTTGG CGATGATCAC CGCGACCCTG CTCGCGGCGG TCGCGGCCAA CGACGGACGC
TGGCCGACCG CACTTCGCTA CGCGCTCGCG GTGGAGGTCG GCATCGTCGC CGCCGCCCTG
CTCACCAGCC TCGTCGACAT CCTCGGCAGC CGCCGCCACG GACGGAAGCG GGCCCCCTGA
 
Protein sequence
MSRSRAPTGT GPTQRWHILN VALVVSFISL FDLSVITVML PSIDRALGVG NGAVQWVVSG 
YSLMFALALV PAGRLGDAKG RRYVLLLGLA LFVGASTMAG LADSGTTLVV ARLLHGAAAG
VLVPQITALV EELFRPGERG RPFGVLGATT SVATAVGPVL GGLIIAVGGA ENGWRLVFLI
NVPVGMVTAV LIWRLFPTKP VDSQADHRPD LIGTLLLGAG ILLVILPALQ RERWTGLLPW
LLVPVGIGVL VAFRSWETST LRRHPPVFDF RLLTHRTYRL GLLLGFLYYA GFTSLPVIFS
FHLQYGLHRS PLHTGLTIAP FAIGSAIGAL LGGRNVDRYG RPLLAIGLAI VMASLTMMMA
LTGLHGTTFS IANAFPLLLA GIGSGLVITP NNTLTLQQVP RADAGSAAGM YRTMRQVGSA
IGLAMITATL LAAVAANDGR WPTALRYALA VEVGIVAAAL LTSLVDILGS RRHGRKRAP