Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1506 |
Symbol | |
ID | 5705484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1735027 |
End bp | 1736313 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641271012 |
Product | extracellular solute-binding protein |
Protein accession | YP_001536393 |
Protein GI | 159037140 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.629333 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000020483 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCCTCA CCACGCGGCG TTCCCGCCTG GCGGCCAGCG CCCTTGCCGC GACAACCGCC ATCGGCGGCC TCGCGGCCTG CGGCAACGAC GACGAACCGG TCGCCGGCGA GAAGCCCGGC AAGCTGGTTC TCGAAACGTT CGGCGAGTTC GGCTACGAAG AGCTCATCAA GCAGTACGAG AAGGACACCG GCATCAAGAT CGAGCTGCGC AAGACCGCGC AGCTGGGCGA GTACCGACCC AAGCTGGTGC GCTACCTGGC CACCGGCAAG GGCGCGGGCG ACGTCGTCGC CCTGGAGGAG GGCATCCTCA ACGAGTTCAA GTCCAACCCG CGCAACTGGG TGGACCTCGC CCCGCTGGTC GACGACCACT CCACGGACTA CCTGCCCTGG AAGTGGGAGC TGGGCAAGGC GCCGGACGGC CGGCTGATGG GCTTGCCGAC CGACGTCGGC AGCCTGGCCG TCTGCTACCG CAAGGATCTG TTCGAGGCGG CCGGCCTACC CACCGACCGG GAGCAGGTGT CGGCGCTCTG GCCGGACTGG GACAGCTTCC TGCAGACCGG CCGCACGTAC AAGCAAGGTA GCGGTGGCAA GGCCTTCATC GACTCGGTCA CCGCCGTCGC CGACGCGGCG CTGTTCCAGC AGGGCGCCGA CCTCTTCTAC GACAAGGAGA ACAACATCAT CGCGGGGACC AGCCCCGCGG TGAAGACGGC CTGGGACACC GCGGTCTCGA TGGCCGACAT CTCCGCCAAG GTCGCCACCT GGTCACCGGA GTGGTCCGCC GGCTTCAAAC AGGGCAGCTT CGCGGCCACC TTCTGCCCAT CCTGGATGCT CGGCATCGTC GCGGACAACT CCGGGGAGGA GAACAAGGGC AAGTGGGACG TGGCGGCCGT GCCCGGCAGC GGCGGCAACT GGGGCGGCTC CTGGCTGGCC GTGCCGGAGC AGAGTGCCCA CCACGAGGAG GCGGCGAAGC TCGCCGAGTT CCTGACCAGC GCCGCCAGCC AGGTGGAGGC CTTCAAGGCC AAGGGCCCGC TGCCCACCCA CCTGGAGGCG TTGCAGGACG AGGCGTTCCT CAGCTACACC AACGAGTACT TCAGCGACGC GCCGACCGGC AAGATCTTCG GTGAGAGTGT CAGCAAGATC GAGCCGATTC ACCTGGGCCC GAAGCACCAG GCGGTGAAGG AAAACGCCTT CGGACCGGCC CTGCGGTCCT TCGAGAACGG ACAGGCCGGC GAGGACGAGG CCTGGCAGCA GTTCACCAAG GACGCCGAGA TCCAGGGCGC CTTCTGA
|
Protein sequence | MSLTTRRSRL AASALAATTA IGGLAACGND DEPVAGEKPG KLVLETFGEF GYEELIKQYE KDTGIKIELR KTAQLGEYRP KLVRYLATGK GAGDVVALEE GILNEFKSNP RNWVDLAPLV DDHSTDYLPW KWELGKAPDG RLMGLPTDVG SLAVCYRKDL FEAAGLPTDR EQVSALWPDW DSFLQTGRTY KQGSGGKAFI DSVTAVADAA LFQQGADLFY DKENNIIAGT SPAVKTAWDT AVSMADISAK VATWSPEWSA GFKQGSFAAT FCPSWMLGIV ADNSGEENKG KWDVAAVPGS GGNWGGSWLA VPEQSAHHEE AAKLAEFLTS AASQVEAFKA KGPLPTHLEA LQDEAFLSYT NEYFSDAPTG KIFGESVSKI EPIHLGPKHQ AVKENAFGPA LRSFENGQAG EDEAWQQFTK DAEIQGAF
|
| |