Gene Sare_0365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0365 
Symbol 
ID5703859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp424082 
End bp425365 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content68% 
IMG OID641269890 
Productmajor facilitator transporter 
Protein accessionYP_001535285 
Protein GI159036032 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0592694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA CCGCGCAACC CCGGCCTGGT ACGCCGGTTC CCCCGCTGCG TCACAACCGT 
GACTTCCTCC TGCTCTGGAG TGGGACCGCC GTCTCCCTCG TCGGTCTGAC CGTATCGACC
ATCGCCTACC CCTTGTTGAT CCTGGCCGTC ACTGGATCGA AGGCCGCAGC CGGTGTCGTC
GGCTTCTTCT CGCTCCTGCC GTCACTGCTG TTCCAACTGC CGGCAGGGGT GCTGGTCGAC
AGGTGGAACC GCCGTCGGCT GATGATCTGG TGTGACGTGA TCCGCGCCCT CGGCGCCGCG
AGTGTGGTCC TCGCGCTGGC TCTCGACGAG CTGACGTTGG CGCATGTGGT GGTGGTGGGG
TTCGTCGAGG GCACGATGTC GGTGTTCTTC AACCTCGCGG CACATGCCGC GGTGCCCAAC
GTCGTTCATC CGGACCACCT GTCGACCGCA CTGTCGCGTA ACGAGGCACG TTCCCGGGCG
GCGACCATGC TCGGCACCAC CCTCGGTGGC GTCCTGTTCG GCTTGAGCCG TGCCTTGCCG
TTCGCGCTGC ACGCGGTCAC CCATGTGATC TCGCTGGTCA CGTTGTTGTT CATTCGGTCT
GACTTCCAGA GCGACCGGCA GGTGCGTACC CGCACCACCG GGATGCTCGC CGAGGTCGGG
GAAGGAACGC GCTGGCTGTG GCGCCAGCCG TTCCTGCGCA CGGCCGCGCT GCTCGTCGCC
GGCAGTAACT TGCTGTTCCG CGCGCTGTTC CTTGTCGTCG TGGTGATGGC AACCGACGTC
GGCGCGTCGC CGGCGGCGGT CGGTGTGCTG CTCAGTGTCG CCGGCGCTGG CGGGGTGCTC
GGTTCCCTCG TCGCGGGCTG GTGTCAACGC CGGCTGCCGC TTTCGGCCCT GGTGGTCGGC
GCGAACTGGG TGTGGGCCGT ACTCATGGGC GTGATCGTGC TCACTGACAG CCTGTATCTG
CTCGCCGCCG CCTACGCCGC GATGTGGTTC GTCGGACCGG TGTGGAATGT CGCCGTCGCC
ACCCATCAGC TCCGTGTCAC CCCGGACCGG CTGCGGGGTC GGGTCCTTGG CGCGATGGGC
CTGCTCGCCA GCGGCGCGTT GCCGGTTGGC GCCCTGATCG GCGGTCTGCT CCTGGAATGG
TCCGATGCGC GCACCGCCGC GCTGGTGCTC GCCGGCTGGA TGGTGCTGCT GGCCCTTGTC
GCGACGATCG CCCCCTCGCT GCGAAGGGCG ATCGCACCTG TCGAGACCAC GACCACCCCC
GACGCCGAAC CAGCCGTTCG ATGA
 
Protein sequence
MTTTAQPRPG TPVPPLRHNR DFLLLWSGTA VSLVGLTVST IAYPLLILAV TGSKAAAGVV 
GFFSLLPSLL FQLPAGVLVD RWNRRRLMIW CDVIRALGAA SVVLALALDE LTLAHVVVVG
FVEGTMSVFF NLAAHAAVPN VVHPDHLSTA LSRNEARSRA ATMLGTTLGG VLFGLSRALP
FALHAVTHVI SLVTLLFIRS DFQSDRQVRT RTTGMLAEVG EGTRWLWRQP FLRTAALLVA
GSNLLFRALF LVVVVMATDV GASPAAVGVL LSVAGAGGVL GSLVAGWCQR RLPLSALVVG
ANWVWAVLMG VIVLTDSLYL LAAAYAAMWF VGPVWNVAVA THQLRVTPDR LRGRVLGAMG
LLASGALPVG ALIGGLLLEW SDARTAALVL AGWMVLLALV ATIAPSLRRA IAPVETTTTP
DAEPAVR