Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3967 |
Symbol | |
ID | 5705244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4505009 |
End bp | 4506355 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641273392 |
Product | extracellular solute-binding protein |
Protein accession | YP_001538748 |
Protein GI | 159039495 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0182057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.154584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTCT TCGCCAGACC ACGCCAAGCC CTCGTGATCG CTGGCGCGCT CGGCCTGGCC ATCAGTGCCA CCGCCTGCGG TACCGGCGAC AACGACGGCA GCGGTAAGGC CGATTCCCCG GAATGCGCGG CATACCAGAA GTATCAGGGC CACGGCGGCG CCGAGGTCTC CATCTACGCG TCCATTCGTG ACGCGGAGGC AGACCTGCTC GAACAGTCGT GGGAGCAGTT CGCAGAATGC ACCGGCATCG AGATCGACTA CGAGGGCAGC GGCGAGTTCG AGGCGCAGCT CCAGGTGCGG GTCGACGGCG GCAACGCACC GGACATCGCC TTCGTTCCGC AGCCGGGCCT GCTGAGCCGA TTCGCGCAGG CCGGCAAGCT CAAGCCGGCA TCGGCCGAGA CCAAGGCGAT GGCCGAGGAA AACTACGCCG CCGACTGGCT GAAATACAGC ACCGTCGCGG GACAGTTCTA CGGCGCTCCG CTGGGGTCGA ACGTCAAGTC GTTCGTCTGG TACTCACCAA AGATGTTCCA GGAGCAGGGT TGGACCGTCC CGACCACCTG GGACGACCTG ATCAAACTCA GCGACTCGGC CGCGGCTGGC GGCATCAAGC CATGGTGTGT CGGCATCGAG TCCGGTGACG CCACCGGCTG GCCGGCCACC GACTGGATCG AGGACGTGCT GCTGCGGACG CAGACCCCCG AGGTCTACGA CCAGTGGACC ACGCACGGCA TACCGTTCAA CGACCAGCGT GTGGTGGACG CGGTCGACCG TGCCGGCACC ATCCTGCGAA ACGAGAAGTA CGTCAACGGC GGCTACGGCG GCGTGAAGAG CATCGCCACC ACGTCGTTCC AGGAGGGCGG TCTGCCGATC CTCCAGGGTG AGTGCGCCCT GCACCGGCAG GCGTCCTTCT ACGCCAACCA GTGGCCCGCG GACAGCCGGG TGGCCGAGGA CGGCGACGTC TTCGCGTTCT ACTTCCCGGC CATCGACCCG TCGAAGGGCA AGCCGGTGTT GGGAGGCGGC GAGTTCACCG TCGCTTTCGA CGACCGCCCC GAGGTCCAGG CGGTACAGAC GTACCTCGCC TCCGGCGAGT ACGCCAACAG TCGGGCCAAG CTGGGCAACT GGGTGTCGGC GAACAGGAAG CTCGACGTGG CCAACGTCGC GAACCCGATC GACAAGCTGT CGGTCGAGAT CCTTCAGGAC GAGAGCACGG TCTTCCGCTT CGATGGTTCC GACCTGATGC CCGCCGCCGT CGGCGCCGGG ACATTCTGGA AGGAGATGGT GTCCTGGATC AGCGGCAAGG ACACCAAGGC GGCCCTGGAC GCCATCGAGA GTTCCTGGCC CCGCTGA
|
Protein sequence | MAVFARPRQA LVIAGALGLA ISATACGTGD NDGSGKADSP ECAAYQKYQG HGGAEVSIYA SIRDAEADLL EQSWEQFAEC TGIEIDYEGS GEFEAQLQVR VDGGNAPDIA FVPQPGLLSR FAQAGKLKPA SAETKAMAEE NYAADWLKYS TVAGQFYGAP LGSNVKSFVW YSPKMFQEQG WTVPTTWDDL IKLSDSAAAG GIKPWCVGIE SGDATGWPAT DWIEDVLLRT QTPEVYDQWT THGIPFNDQR VVDAVDRAGT ILRNEKYVNG GYGGVKSIAT TSFQEGGLPI LQGECALHRQ ASFYANQWPA DSRVAEDGDV FAFYFPAIDP SKGKPVLGGG EFTVAFDDRP EVQAVQTYLA SGEYANSRAK LGNWVSANRK LDVANVANPI DKLSVEILQD ESTVFRFDGS DLMPAAVGAG TFWKEMVSWI SGKDTKAALD AIESSWPR
|
| |