Gene Sare_0236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0236 
Symbol 
ID5705964 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp265560 
End bp266768 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content69% 
IMG OID641269766 
Productmajor facilitator transporter 
Protein accessionYP_001535162 
Protein GI159035909 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00072705 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATGCCGT TCATGGTCAC GAATACCATG ACATTGGGAC GGCCGTTCTG GACGTTTTGG 
AGCGCCACGG CCCTCGCCAA CGTGGGTGAT GGGATTCGGC TGGCGGCGTT TCCTCTGCTC
GCCGCGTCGT TGACGGCCAA CCCGGTTGGC GTGGCCGCGG TGACCGCGGC CCAGGCCCTG
CCCTGGTTGG TGACCGGTCT ACTCGCCGGG TCGCTGGCCG ACCGCCGCGG CGCCCGTACT
CTGCTCGCCC AGGCCGACAT CGCCCGGGTA GTCGTCCTGG GCGTTCTGGT CGTCGCCGTG
GCGATGGGCT GGGCGTCCCT GCCGCTTGTC CTACTGGCCA GCTTCCTGCT CGGTGTCGGC
GAGACCGTGC GCGACACTGC CGCACAGACA GCACTTCCCG GCCTGGTGCC AGAGCGACTG
CTCGAGCGCG CCAACGGAAG GCTGGTCGCC GGCGAAATCG TCGGTAACGA GTTTGTCGGC
CCGCCGGTCG GCGCCGCGCT GTTCGTGGCG GGCGCGGCGT TGCCGTTCGC GACGAATGGC
GCGTCCCTCG CCCTGGCCGT CATGCTCGTG CTGACCCTGC CGCTGAGCGT GGCCGCCCGT
CCACCGCAGG ACGCGCCGAC GCACGTCAGG CAGGGTGTGG TGGCGGGCCT GCGATGGCTG
GCACGCCATC GCGTGCTCCG AACACTCGCG CTGGTCACCG CTGCGGTCGC CGCCGCTGAC
AGCGCATGGT TCGCGGTCCT GGTGCTCTAC GCGACAGACC GGCTCGGCAC CGGCGCGGCT
GGCTTCGGAG TCCTGCTCGC CGCCGGAGCC CTCGGCGGCC TTCTTGGCTC GTTCCTCGTT
GACCGGCTCG TCGCGGGCCG CCGGCACCGT GCGATCATCA CTTGGTCGCT GGCCATCACC
GCCGGTATCC CCGCGGTGCT CGCCGTGACC TCTCAATTGT GGGCGGCGAT ACTCGTCATC
GTGGTCACGA GTGGCTCGTT CGCTGTACTC AACGTCACTG TCGTGTCACT GCGTCAACGC
CTGGTGCCCC GCGAGTTGCT CGGGCGTGTG GTAGCAGCCG GCCGCACACT GAGCTTCAGC
GCCGCCGCCG CGGGTGCATT GCTTGGCGGT GTGCTCACGG CGACGATCAC AATCGAGGCG
ACGTTCATTT TCAGCGGACT GGTCGCAGTT TCGGCGACCA TCGCATGGTG GGTTGCGTCC
CGGCCCTGA
 
Protein sequence
MMPFMVTNTM TLGRPFWTFW SATALANVGD GIRLAAFPLL AASLTANPVG VAAVTAAQAL 
PWLVTGLLAG SLADRRGART LLAQADIARV VVLGVLVVAV AMGWASLPLV LLASFLLGVG
ETVRDTAAQT ALPGLVPERL LERANGRLVA GEIVGNEFVG PPVGAALFVA GAALPFATNG
ASLALAVMLV LTLPLSVAAR PPQDAPTHVR QGVVAGLRWL ARHRVLRTLA LVTAAVAAAD
SAWFAVLVLY ATDRLGTGAA GFGVLLAAGA LGGLLGSFLV DRLVAGRRHR AIITWSLAIT
AGIPAVLAVT SQLWAAILVI VVTSGSFAVL NVTVVSLRQR LVPRELLGRV VAAGRTLSFS
AAAAGALLGG VLTATITIEA TFIFSGLVAV SATIAWWVAS RP