Gene Sare_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1506 
Symbol 
ID5705484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1735027 
End bp1736313 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content68% 
IMG OID641271012 
Productextracellular solute-binding protein 
Protein accessionYP_001536393 
Protein GI159037140 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.629333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000020483 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCCTCA CCACGCGGCG TTCCCGCCTG GCGGCCAGCG CCCTTGCCGC GACAACCGCC 
ATCGGCGGCC TCGCGGCCTG CGGCAACGAC GACGAACCGG TCGCCGGCGA GAAGCCCGGC
AAGCTGGTTC TCGAAACGTT CGGCGAGTTC GGCTACGAAG AGCTCATCAA GCAGTACGAG
AAGGACACCG GCATCAAGAT CGAGCTGCGC AAGACCGCGC AGCTGGGCGA GTACCGACCC
AAGCTGGTGC GCTACCTGGC CACCGGCAAG GGCGCGGGCG ACGTCGTCGC CCTGGAGGAG
GGCATCCTCA ACGAGTTCAA GTCCAACCCG CGCAACTGGG TGGACCTCGC CCCGCTGGTC
GACGACCACT CCACGGACTA CCTGCCCTGG AAGTGGGAGC TGGGCAAGGC GCCGGACGGC
CGGCTGATGG GCTTGCCGAC CGACGTCGGC AGCCTGGCCG TCTGCTACCG CAAGGATCTG
TTCGAGGCGG CCGGCCTACC CACCGACCGG GAGCAGGTGT CGGCGCTCTG GCCGGACTGG
GACAGCTTCC TGCAGACCGG CCGCACGTAC AAGCAAGGTA GCGGTGGCAA GGCCTTCATC
GACTCGGTCA CCGCCGTCGC CGACGCGGCG CTGTTCCAGC AGGGCGCCGA CCTCTTCTAC
GACAAGGAGA ACAACATCAT CGCGGGGACC AGCCCCGCGG TGAAGACGGC CTGGGACACC
GCGGTCTCGA TGGCCGACAT CTCCGCCAAG GTCGCCACCT GGTCACCGGA GTGGTCCGCC
GGCTTCAAAC AGGGCAGCTT CGCGGCCACC TTCTGCCCAT CCTGGATGCT CGGCATCGTC
GCGGACAACT CCGGGGAGGA GAACAAGGGC AAGTGGGACG TGGCGGCCGT GCCCGGCAGC
GGCGGCAACT GGGGCGGCTC CTGGCTGGCC GTGCCGGAGC AGAGTGCCCA CCACGAGGAG
GCGGCGAAGC TCGCCGAGTT CCTGACCAGC GCCGCCAGCC AGGTGGAGGC CTTCAAGGCC
AAGGGCCCGC TGCCCACCCA CCTGGAGGCG TTGCAGGACG AGGCGTTCCT CAGCTACACC
AACGAGTACT TCAGCGACGC GCCGACCGGC AAGATCTTCG GTGAGAGTGT CAGCAAGATC
GAGCCGATTC ACCTGGGCCC GAAGCACCAG GCGGTGAAGG AAAACGCCTT CGGACCGGCC
CTGCGGTCCT TCGAGAACGG ACAGGCCGGC GAGGACGAGG CCTGGCAGCA GTTCACCAAG
GACGCCGAGA TCCAGGGCGC CTTCTGA
 
Protein sequence
MSLTTRRSRL AASALAATTA IGGLAACGND DEPVAGEKPG KLVLETFGEF GYEELIKQYE 
KDTGIKIELR KTAQLGEYRP KLVRYLATGK GAGDVVALEE GILNEFKSNP RNWVDLAPLV
DDHSTDYLPW KWELGKAPDG RLMGLPTDVG SLAVCYRKDL FEAAGLPTDR EQVSALWPDW
DSFLQTGRTY KQGSGGKAFI DSVTAVADAA LFQQGADLFY DKENNIIAGT SPAVKTAWDT
AVSMADISAK VATWSPEWSA GFKQGSFAAT FCPSWMLGIV ADNSGEENKG KWDVAAVPGS
GGNWGGSWLA VPEQSAHHEE AAKLAEFLTS AASQVEAFKA KGPLPTHLEA LQDEAFLSYT
NEYFSDAPTG KIFGESVSKI EPIHLGPKHQ AVKENAFGPA LRSFENGQAG EDEAWQQFTK
DAEIQGAF