Gene Sare_2558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2558 
Symbol 
ID5708424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2913245 
End bp2914489 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content75% 
IMG OID641272021 
Productmajor facilitator transporter 
Protein accessionYP_001537391 
Protein GI159038138 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.653406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000197888 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCTCCG ACACGACCCC ACTGCGAGGT GGCCCGCACC CGGGCCCGAT CGCCTCGCCC 
GACCGGCGGG CCCGGTCACT CGCGGCAGCC CTCTACGGGT ACGCGTTCCT GCGCGACCTC
GTCCTGCTCT ACCCCGTCTA CCCGCTGCTG TTCACCGACA CCGGTCTGAC GGTGTGGCAG
ATCTCGACCT TGTTCGTCAT CTGGTCGGCC AGTTCGATCG TGCTGGAGGT CCCCTCCGGG
GCGTTGGCCG ACGCCGTCTC CCGGCGGCTG CTGCTCTGCC TCGCGCCGCT GGTGACCGCC
GCCGGCTTCG CGCTCTGGAC ACTCGTGCCC TCGTACCCGG CCTTCGCGGT GGGCTTCCTG
CTCTGGGGAG TCGGCGGCGC GCTCGCCTCC GGTGCGTTGG AGGCCCTGGT CTACACCGGC
CTGGAGCGGC TCGGCGCGGC CAGCCGGTAC GCCCGTGTCA TCGGCCGGGC CCGCACCGCG
GAAACCCTCG GCGTGTTGGC CTCCCTCGTG TTGGCGGCGC CGGTACTCGC CCTCGGCGGC
TACCCCGCTC TCGGCGTGGC GAGCGTCCTG GCCTGCCTGG TGGCCGCCGC CGTCGCCACC
CGCCTACCGG AGCACCGCGA GCCGGCCGCC GGGCCGGGCG CCGACCCCGC CGACGGTGAA
CACGGCTGGT GGGTCTCCCT GCGGGGCGGG CTGGCCGAGG CGCACGCCGA CCGGACGGTG
CGCGCGGCGG TGCTGCTGGT CGCCGCCGTC GCCGCCGGAT GGGGGGCGCT CGACGAGTAC
CTTCCGCTGT TGGCCCGAGA CGTCGGGGCG AGCGGGCCGG CCGTGCCGCT GCTGCTCGTC
CTCACCTGGG TCGGTGTCGC CGTCGGCGGC CTGCTCGCCC CGGCGGGGGA GCGGCTGGGC
CGCCGCGGGT ACGCCGGCCT GATCGGTGGG TCCGCCGGGG CGCTCGCCGC CGGCGCGCTG
ATCGGTCACC CAGTCGGGTT CCTGCTGGTG GGCGTCGCCT TCTGCGGTTT CCAACTGGCC
ACTGTGCTCG CCGACGTCCG GCTCCAGGCG CGGATCGTCG GCCCGGCCCG GGCCACCGTC
ACCTCGCTCG CCGGGATGGC GACCGACACG ACGATCATCG CGTGCTACGT CGGGTACGGC
CTGCTCGCCA CCGTCGCCGG CAACCGGGTC GCGTTCGCGG TGGCGGTGGC GCCCTACCTC
GTCGTGGCGC TGCTGGTGGC CGTCGTACGA CCGGTCCGCC GATGA
 
Protein sequence
MISDTTPLRG GPHPGPIASP DRRARSLAAA LYGYAFLRDL VLLYPVYPLL FTDTGLTVWQ 
ISTLFVIWSA SSIVLEVPSG ALADAVSRRL LLCLAPLVTA AGFALWTLVP SYPAFAVGFL
LWGVGGALAS GALEALVYTG LERLGAASRY ARVIGRARTA ETLGVLASLV LAAPVLALGG
YPALGVASVL ACLVAAAVAT RLPEHREPAA GPGADPADGE HGWWVSLRGG LAEAHADRTV
RAAVLLVAAV AAGWGALDEY LPLLARDVGA SGPAVPLLLV LTWVGVAVGG LLAPAGERLG
RRGYAGLIGG SAGALAAGAL IGHPVGFLLV GVAFCGFQLA TVLADVRLQA RIVGPARATV
TSLAGMATDT TIIACYVGYG LLATVAGNRV AFAVAVAPYL VVALLVAVVR PVRR