Gene Sare_0790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0790 
Symbol 
ID5705033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp885439 
End bp887064 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content72% 
IMG OID641270309 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_001535700 
Protein GI159036447 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.004165 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00652278 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGAGC AGGGACGCGG CCGCTGGTGG GGGCTGTTCG CCATCAGCCT CGGAGTGGCC 
ACGATCATCG TCGACGTGAC GATCGTCAAC GTCGCGGTTC CGGCGATCAT CGCCGACCTC
GGCGTCAGCT CCGCCGGGGC GCAGTGGGTC CAGGAGGCGT ACACCCTCGT TTTCGCGTCC
CTGCTCCTGG TCGTCGGCCG GGTCGCCGAC CGCACCGGGC GACGGCGAAT GTTCGTCGTC
GGCGTGCTGG TCTTCGTCGC CGCCAGCCTG CTCGCCGCGT CCGCCGGGTC TGGTGCGGAG
CTGATCGGGG CGCGGGTGCT CCAGGGACTC GGCGGCGCGA TGATGCTGCC CACCTCGTTG
TCCCTGCTCA ACGCGACCTT CGCCGGCCGA GAACGGGCGA TCGCGTTCGC GGTCTGGGGC
TCCACCATCG GCGGGTCCGC CGCTCTCGGC CCGCTCCTTG GCGGTTGGCT CACCACGGAG
CTCTCCTGGC GGTGGGCGTT CGGCATCAAC GTCCCGATCG GCGTGCTGGT CATCGCCGCC
ACGCTCGTGC TGGTCCCCGA GTCCCGGGAC GACCGTGTCG AGCGGGGCAT CGACCTGGTC
GGGGCGTTGC TGTCGGTGCT CGGCATGACC GGCATCGTCT TCGCCCTCAT CGAGGGGCGC
ACCTATGGCT GGTGGGAGTC AGCCACACCG ATCACCGTGT TCGGCCAGGA CTGGCCAGCC
ACGATCTCAC CGGTGCCGGT CGCTGGGATC GGCGGCCTGG CGGCCCTCGC GCTCCTCCTC
GCCCACCAGG TGCGCCGCAA CCGGCGCGGC CGACCGGCTC TGCTCGACCT CTCGTTGTTC
CGGATCGGCT CCTTCCGAAC CGGCGTCACC GCCGCCGGGA TCGTCAGCCT GGGTGAGTTC
GGTCTGCTCT TCGCCCTCCC GCTCTGGTAC CAGAACGTGC TCGGTTACAG CGCCTTTCAC
ACCGGCCTGG CGCTGCTGCC CCTCGCCGTC GGAAGCTTCT TGTCCAGCGC CCTCGGCGCG
TTGCTCGTCA AACGGTGGGG CGCCACGCGG GTGGTGCAGC TCGGAGTGGC GGCGGAGATC
GCCGGCATCG CCGGACTCGG CCTCGTCGTC GCCCCCGACA CCCGCTGGTG GGCGCCGCTC
GGCCTGCTCT TCGTCTACGG CGTCGGCGTC GGTCTGGCCA CCGCCCAGCT CACCGGGGTG
TCGCTGCGGG AGGTGCCGCT GCGGCGCAGC GGCCAGGGTT CCGGAGTGCA GAGCACCGCG
CGGCAGGTCG GATCCGCCCT CGGTATCGCC GTGCTCGGCA CGGTCCTCTT CGCCGGACTG
AGCGGCCTCC TCACCGCGCG GCTCGCTGAA CGCTCCGACA TCGAATCCAC CCAGCGGGAA
CAGATCGTGA CCACCGTTCG GGAGAGCGGC GGAGCGGCGA TCGCCGGGCT CGCCGCCGAC
CCACGCACCG CGCCGATCGC CGAGGACGCG AAGGCCGCCT TTGCCGATGC CACCCGGTAC
GCGGCCTTCG CCGCCGCCGG CTTCCTGCTC ATCGGCCTGA TCGCCTGTGC CCGGCTGCCC
CGGGACCGCC CCGAGCCAGT GGACCCGGCG CCGATGCCGC AGCCGGCGGC CAGCCCCGCC
AGCTAG
 
Protein sequence
MPEQGRGRWW GLFAISLGVA TIIVDVTIVN VAVPAIIADL GVSSAGAQWV QEAYTLVFAS 
LLLVVGRVAD RTGRRRMFVV GVLVFVAASL LAASAGSGAE LIGARVLQGL GGAMMLPTSL
SLLNATFAGR ERAIAFAVWG STIGGSAALG PLLGGWLTTE LSWRWAFGIN VPIGVLVIAA
TLVLVPESRD DRVERGIDLV GALLSVLGMT GIVFALIEGR TYGWWESATP ITVFGQDWPA
TISPVPVAGI GGLAALALLL AHQVRRNRRG RPALLDLSLF RIGSFRTGVT AAGIVSLGEF
GLLFALPLWY QNVLGYSAFH TGLALLPLAV GSFLSSALGA LLVKRWGATR VVQLGVAAEI
AGIAGLGLVV APDTRWWAPL GLLFVYGVGV GLATAQLTGV SLREVPLRRS GQGSGVQSTA
RQVGSALGIA VLGTVLFAGL SGLLTARLAE RSDIESTQRE QIVTTVRESG GAAIAGLAAD
PRTAPIAEDA KAAFADATRY AAFAAAGFLL IGLIACARLP RDRPEPVDPA PMPQPAASPA
S