Gene Sare_2343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2343 
Symbol 
ID5707971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2696938 
End bp2698512 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content65% 
IMG OID641271821 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001537192 
Protein GI159037939 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.132405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.210612 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGCTG CGGCCGAAGC CGAGACGAGT GAGGCCCGCC CGTGGACGCG AGCCCAGAAG 
TGGACGATCG TCGCGGCCGG TCTTGGCATG TTCATCACCC TTCACGACGT GCTGGTCGCG
AATGTGGCAC TGCCCCGAAT CCAGTCCTTC TATGGTTTGC GGGAGTCCGG GCTTCAGTGG
ATCGTAGCCG CATACAGCAT GGGCATGGCT GTTGCCATCA TGCCCGCAGC TACGATGGCC
GATCGATTCG GTCGGCGGCG ACTGTTTCTG ACCGCGGTTG TCGTGTTCAG CCTCGCCTCG
GTGGCCGCCG GGGCGACGTC CGTATTCGCC GTGATGCTGG CGGCGCGGGC GGTGCAGGGC
GTGGCCTCCG CCGTCATCAC CGTCTCCGCG CTGGCCCTGG TGTCGGCGAC TTTCCCACAC
AAGCGGCAGC GTTTCCGCGC TCTCGGCTTC TTCGTGGCCG TCGCCGATAT TGGTCTGGCA
CTCGGCCCAC CGCTCGGTGG GTTTCTCGCC GAGAACGCGA GTTGGCGGGT TGTCTTCTTC
GTCAATGTTC CGGTCGCCGT CGTGGCGGTC GGGCTCACCC TTCGATACGT CGCGGAGTCC
CGTGAGTCCA CCCGCCGTGT CGTCGACCTG TTCGGGCAAC TGTTGTTCGT GGTGACCGTG
GGCGCCTTCA CCTTCGCCGT GATCGATGGG CATGACCTGG GCTGGGGGTC TCCGATAATT
CTTGGGGCGT TCACCGTCTT CGCCGCCGGC CTTGTTGCCT TCATCGTCCG CGAACTTCGC
AGCCGGTCGC CAATGATGGA TCTGCGGCTC TTCGCCCACC GTCCCTATCG GTTGGGAATC
ATGGCGATAT TCTTCGGCAT GTTCACGGTG TACGGCACGC TGTTGATTGT CACCCAGTAC
TTTCAAAACG TTCGTTTGTA TTCGCCGTTA GAGGCCGGAC TGCTCATTCT GCCATCCTCA
CTCGCAGCGG TGGTGTTGTC CCCGGTTGCC GGCTTTGTCG CTGCGCGACG TGGCCCCCGC
CTACCTGCCC TCACCGGCCA GTTCCTCGTA GTGAGCGGGT TGGTGGTCAT GGTGGTCGGG
GTGTCGGTGA GTGTGCCTGC CGTCGTGGTC GGTTTCCTGC TGCTGGGCGC CGGGCTCTCC
CTGATCATCG CGCCCGTTCA GGGGTTGGCA CTGAATTCGG TGCCGGTGGA GCGGGCAGGA
ATGGCCTCGG GAATCGTGGC CACCCAGCGG GGTCTCGGGT CAACAGCCGG CTACGCCGTA
CTCGGCATGA TCGTCGCGGT GTGGGTCGGC ATCGCGTTGG ACAGTGACCT GTCCAGGACC
ATCCCGGACC GCGACGAGCG GGCCACCGTT GTCGCCCGAA TCGTCGACAA CACCAACCCC
AACGCGTTCG AGGCGGTAGT CGGGCCCGGT CGACCCATCG CCCACCCCGA TCCCGACCAA
GTGACGAAGA TCCGTGCGGT CGCGGACCGT ACCTTTGTCC GGGGCATGCA GCTCAGCCTG
GGATTTGCCG CCCTGGTGGC CCTGATCATA CTGGTGATCC TCTTCCGCAA CTTCCGTGCC
TCCCAAGACC GCTGA
 
Protein sequence
MGAAAEAETS EARPWTRAQK WTIVAAGLGM FITLHDVLVA NVALPRIQSF YGLRESGLQW 
IVAAYSMGMA VAIMPAATMA DRFGRRRLFL TAVVVFSLAS VAAGATSVFA VMLAARAVQG
VASAVITVSA LALVSATFPH KRQRFRALGF FVAVADIGLA LGPPLGGFLA ENASWRVVFF
VNVPVAVVAV GLTLRYVAES RESTRRVVDL FGQLLFVVTV GAFTFAVIDG HDLGWGSPII
LGAFTVFAAG LVAFIVRELR SRSPMMDLRL FAHRPYRLGI MAIFFGMFTV YGTLLIVTQY
FQNVRLYSPL EAGLLILPSS LAAVVLSPVA GFVAARRGPR LPALTGQFLV VSGLVVMVVG
VSVSVPAVVV GFLLLGAGLS LIIAPVQGLA LNSVPVERAG MASGIVATQR GLGSTAGYAV
LGMIVAVWVG IALDSDLSRT IPDRDERATV VARIVDNTNP NAFEAVVGPG RPIAHPDPDQ
VTKIRAVADR TFVRGMQLSL GFAALVALII LVILFRNFRA SQDR