Gene Sare_2348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2348 
Symbol 
ID5706932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2701238 
End bp2702704 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content68% 
IMG OID641271826 
Productmajor facilitator transporter 
Protein accessionYP_001537197 
Protein GI159037944 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.720535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.469317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCG AGAACCACAA CGGGGAGTTA CCTCGGACGG GGTCGGGTCG TCCGGTGCCG 
CGGGGACCGG TGGTGGTCGT AGGTGTGCTG GTGGCCTTCA TCATCGGGCT CAACATCGCC
CTGTTGGTGA TCGTCTTGCC GACGATCCGG AGTACGCTCG GACTGGATTC GAGTAGTCAG
CAATGGCTGA TCTCCGCGTT CTACCTCGCG TTCGGCCTGG TGTTGGTGCC GGCCGGTCGA
TTCGGTGATG TACGCGGCCG GCGCGCCATC TTCGTGACCG GCGTGACGGC GTTCGTGGTG
GCGAGCGGGG TCGCGGCCTT CGCCTCACAC GGGGCGTGGC TGATCGGCGC CCGGCTGGTG
CAGGGTGTTG GCGTCGGGCT GGCGTTCGCG CAGGTCTTCG GAACGATACA GCGGCTGTAT
TCCGCCCGGG AGCGGGGACT TCCGTTCGGA GCGGTCATCG CGGGCGTCAG CGTCGCCCGC
GTGTCCGGTC CGGTGCTCGG CGGTGGACTG GTCGCCCTCG GCGGTGCCGA GTGGGGCTGG
CGGTGGTCCT TTCTGGTCAA TGTGCCGGTG GGAATCGTCG TCGCGTTCCT CGGCTGGCGG
TTGTTCCCGG TCGCGGAGCG AGTCGCGCGG CCGAGGATGG ACGTGACGGG GGCCGTCCTG
TTGATGGTCG GGCTGGGCTT GGTGTGGCTG ACGCAGGGCG AGCAGTGGCC GGGGTGGGTT
CGCTGGACAC TCCTGCCCGC GGGGTTGGTG CTGCTGGTCG GATTCGTGTT CTGGGAATAC
CGGTACACCC GTCGGGGTGA GCCGATGTTC ACCATAAGAT TGTTCCGGTT TCGGTCGTTC
GCGGCGGGGA TGGTCATCGC CACGTTCTAC ACCGCCGGCT ACGACGGCAT TTACTACCTG
ATGTCGGAAT ACCTCCAACA TGGGCTCGGG CATAACGAAC TGGTGACCGG CATCGCGCTC
ACCCCGCTGG CGCTGGGAGT AGCCGTCAGT TCGGTGATCG GTGGCGCCAA GGCGGGGCGG
ATGGGCAGCC GGCTGGTTGT CTCGGGGCTG GTCCTCGTGG CGGTCGGGCT GACCGCGCTG
CTGGTCGCCG ACCTCTTCCT TCCCGGCCCG GACTCCCCAC ACGCGGCCAC GCTGCCGCTA
CTGCTGGCCG GACTGGGTGG CGGACTGGTC ACCTCAGGGG TGGGCGGTGG CCTGGTGAAC
GCGCCGAACC TGACAGTGGC CCTGTCCCCC GTGCCACAGA CCGAGGGCGG AAGTGCTGGC
GGGATGCTCG AGACCGGGCA GGCGTTCGGT GGCGGTCTGG GAGTTGGTGT CGTCGGCACG
GTCATCTTCG CGAGTCTCGA CCAGACGGAC AACTGGTTGA CCGCCTTCCG GCTACCCGTT
CTGGTCATCG TCGGACTCTT CGTCGTCGCG CTGGCAGCCG CCCTGATCAG CCTGTTCTTC
CCGGACCGGG CCAGGCCACG GTCATGA
 
Protein sequence
MTAENHNGEL PRTGSGRPVP RGPVVVVGVL VAFIIGLNIA LLVIVLPTIR STLGLDSSSQ 
QWLISAFYLA FGLVLVPAGR FGDVRGRRAI FVTGVTAFVV ASGVAAFASH GAWLIGARLV
QGVGVGLAFA QVFGTIQRLY SARERGLPFG AVIAGVSVAR VSGPVLGGGL VALGGAEWGW
RWSFLVNVPV GIVVAFLGWR LFPVAERVAR PRMDVTGAVL LMVGLGLVWL TQGEQWPGWV
RWTLLPAGLV LLVGFVFWEY RYTRRGEPMF TIRLFRFRSF AAGMVIATFY TAGYDGIYYL
MSEYLQHGLG HNELVTGIAL TPLALGVAVS SVIGGAKAGR MGSRLVVSGL VLVAVGLTAL
LVADLFLPGP DSPHAATLPL LLAGLGGGLV TSGVGGGLVN APNLTVALSP VPQTEGGSAG
GMLETGQAFG GGLGVGVVGT VIFASLDQTD NWLTAFRLPV LVIVGLFVVA LAAALISLFF
PDRARPRS