Gene Sare_0984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0984 
Symbol 
ID5707524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1109153 
End bp1110367 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content74% 
IMG OID641270499 
Productmajor facilitator transporter 
Protein accessionYP_001535886 
Protein GI159036633 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.973774 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00222704 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAACCTGA AGCCTTACCG GGCGGCGCTC GCCCTGCCCG GTCTCCGAAC TCTGCTGATC 
GTGGCGGTCC TCGCCCGTAT CCCGCTCACC GCGACCGGGC TGACCCTCAC GTTCTACGTC
GTCCAGGACC TCGGCCGAGG GTACGGAGCC GCCGGGCTGG TCGGCGCCGC GATCACCGTC
GGCGCGGCCG TCGGCGGCCC GGTGCTGGGC CGCCTGATCG ACCGGCGCGG CCTTCGCCCG
GTGCTGGTGT TGACCGCCGT GGCCGAGGCG ATCTTCTGGT CGACCGCGCC GATGCTGCCG
TACCCACTGC TGCTGCCGGC CGCGTTCCTC GCCGGTTCGC TGGCGCTGCC GATCTTCTCG
GTGATCCGCA GTTCCATCGC GGCAATCGTG CCGGCGGACC GCCGCCGAGC CGCGTACGCG
CTGGACTCGG TGTCGGTGGA ACTGGCCTTC ATGATGGGTC CGGCCCTGGC CACCGTCGCG
GTCACCACCA TCTCCGCGCG CACCACGCTC TACCTGGTGG GGGCCGGCAT CGTCGCCGCC
GGCGTCGGGC TCTTCCTGCT CGACCCGCCA CTTCGGGGTG CCAGCGACCC GGTAGGCCCG
CAGCGTAAGG TGCCGCGGCG GGAGTGGCTC ACCCCCCGGA TGGTCGCCGT ACTGGCCGTC
AGCACCGCCG CCACCGTGGT GCTGGGCGGC ACCGACGTGG CGGTGATCGC GGTGCTGCGC
GACAACGGCG ACATCGGGTT CACCGGCGTG GTGCTGGCCA TCTGGGCCGT CGCCTCGCTG
GTCGGTGGCT TCGCCTACGG GGCGGCCACC CGGGCCCCGT CCCCGTTGGC GTTGCTGGCG
GTCCTGAGCA TCGCCACGAT CCCGGTCGGA CTGGCCGGCG CGAACTGGTG GCTGCTCGGC
CTGGTACTGA TCCCAGCCGG CCTGCTCTGC GCCCCGACTC TCGCCGCCAC CTCGGACGCG
ATCAGCCGGT TGGCACCCGT GGACGCGCGC GGCGAGGCGA TGGGCCTGCA CGGCTCCGCC
ATCACCGTCG GCATCGCGGT CGGCGCCCCA CTGGCCGGTG CCGTCATCGA CGCGTCGGCA
CCGGCCTGGG GCTTCGCCGT GACCGGCGCG GTGGGTGGCC TGGTCGCCCT GGTGGTGCTT
CCGATAGAGC TGCGCCGCCG CAGGGCTGGG GCACCGGCGC CCGTTCCCGA GCCCGAGCTG
ACCCACGCCG CCTAG
 
Protein sequence
MNLKPYRAAL ALPGLRTLLI VAVLARIPLT ATGLTLTFYV VQDLGRGYGA AGLVGAAITV 
GAAVGGPVLG RLIDRRGLRP VLVLTAVAEA IFWSTAPMLP YPLLLPAAFL AGSLALPIFS
VIRSSIAAIV PADRRRAAYA LDSVSVELAF MMGPALATVA VTTISARTTL YLVGAGIVAA
GVGLFLLDPP LRGASDPVGP QRKVPRREWL TPRMVAVLAV STAATVVLGG TDVAVIAVLR
DNGDIGFTGV VLAIWAVASL VGGFAYGAAT RAPSPLALLA VLSIATIPVG LAGANWWLLG
LVLIPAGLLC APTLAATSDA ISRLAPVDAR GEAMGLHGSA ITVGIAVGAP LAGAVIDASA
PAWGFAVTGA VGGLVALVVL PIELRRRRAG APAPVPEPEL THAA