Gene Sare_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4090 
Symbol 
ID5704743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4651097 
End bp4652668 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID641273516 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001538871 
Protein GI159039618 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.588022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0353646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG CAACCCAGGC GGCCGTGCGA CCGAAGATTC GGATCGTGCT GTTCGGACTG 
ATGATCGCCA TGATGCTGGC GATGTTGGAC AACATGATCG TCAGCACCGC CCTGCCGAGG
ATCGTGGGTG AGTTCGGCGG GCTGGACCAC TTCACCTGGG TCGTTACCGC CTACGTTCTC
GGGACCACCG TCTCGACGCC GATCTGGGGC AAGCTCGGGG ATCTCTTCGG CCGGAAGTCG
ATCTTCCTGA CCTCGGTAGT GATCTTCCTG GTCGGGTCCG CGCTGTGTGG GATGGCGGGC
TCCAACCTGC TCGGCGGGCC GGACGATGGG ATGGCGGAGC TGATCGCCTT CCGGGCCGTA
CAGGGCCTTG GCGCCGGCGG TCTGTTGGTC GGTGTGTTGG CGATCATCGG CGACCTGGTG
CCGCCTCGTG AGCGGGGACG CTACCAAGGC ATGATCGCGG GGATCATGGC GATCGCGCTG
GTGGCCGGTC CGCTGGTCGG CGGCTTCATC ACCGACCATC TCTCCTGGCG TTGGGCGTTC
TACGTCAACC TGCCACTCGG CGGAGTCGCG CTGGTGCTGC TTGTCGCTAC CCTGCGGCTG
CCCCGGTACC GCACCGAACA CCGGATCGAC TGGCTTGGTG CCGCGCTGCT CGCGGTCGGC
ATCACCGCGA TGGTCCTGAT CACCACGTGG GGTGGCAACG AGTACGCCTG GAGGTCCCCG
CAGATCGTCG GTCTGGTCGG CCTGGCGGTG GCTGCGCTCG CCGTCTTCGC GGTCGTCGAG
CTACGGGCGG TCGAGCCGAT TCTGCCGCTG AAGCTCTTCG CCAACCGTAA CTTCGCGCTG
ATCTCCGCGA TCGGCTTCCT GCTCGGCTTC GCGATGTTCG GCGCGATGAG CTTCCTGCCG
CTCTACCAGC AGACCGTGCA GGGCGCCTCA GCCACCGAGT CCGGACTGCT CCTGTTGCCG
TTGATGTTCG GCATGCTGGT GGTCTCGCTG GCGGTGGGCC GGACGATCAC CAGGACCGGC
CGGTACCGGG TGTTCCCGAT CGTCGGTGGC GTGGTGATGA GCGCCGGCAT GGCTCTGCTG
ACGTACCTCG ACGCGCAGAC CGGCAGGACG GAATCCTCGC TCTACCTCTT CGTGCTCGGC
GTCGGCATGG GCTTTCTCAT GCAGACCACG ATGCTCATCG CACAGAACAG CGTCGACCAG
CGTGACTTGG GCGCGGCGAG CGGTGCGGCG ACCTTCTTCC GTTCGATCGG CGGCTCGTTC
GGCATCTCGC TCTTCGGAGC TGTCTTTGCC AGCCGGCTGG CCGGTTCGCC GGGCGGCGGT
GCGTTCGGGG GCGGTGAGGC CGGGACGGCG ATGGACCTGG CGAAGCTCCA GCAACTGCCG
GCGGCGGCCC GCGAACTGGT CTTTGGTGGC CTCGCCGACG CGATCTCGCA CGTGTTCCTG
TGGGCGCTGT TGTTCACCCT TGTGGTGCCG GTGCTCGCCT GGTTTATCAA GGAGATTCCG
CTGCGCACCG AGAACATCCC CGGACCGGCC GATGAGGCGG AGGACACCCT TGCCGGGACG
CCCACTCTCT GA
 
Protein sequence
MTQATQAAVR PKIRIVLFGL MIAMMLAMLD NMIVSTALPR IVGEFGGLDH FTWVVTAYVL 
GTTVSTPIWG KLGDLFGRKS IFLTSVVIFL VGSALCGMAG SNLLGGPDDG MAELIAFRAV
QGLGAGGLLV GVLAIIGDLV PPRERGRYQG MIAGIMAIAL VAGPLVGGFI TDHLSWRWAF
YVNLPLGGVA LVLLVATLRL PRYRTEHRID WLGAALLAVG ITAMVLITTW GGNEYAWRSP
QIVGLVGLAV AALAVFAVVE LRAVEPILPL KLFANRNFAL ISAIGFLLGF AMFGAMSFLP
LYQQTVQGAS ATESGLLLLP LMFGMLVVSL AVGRTITRTG RYRVFPIVGG VVMSAGMALL
TYLDAQTGRT ESSLYLFVLG VGMGFLMQTT MLIAQNSVDQ RDLGAASGAA TFFRSIGGSF
GISLFGAVFA SRLAGSPGGG AFGGGEAGTA MDLAKLQQLP AAARELVFGG LADAISHVFL
WALLFTLVVP VLAWFIKEIP LRTENIPGPA DEAEDTLAGT PTL