Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0747 |
Symbol | |
ID | 5707779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 831413 |
End bp | 832714 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641270266 |
Product | extracellular solute-binding protein |
Protein accession | YP_001535657 |
Protein GI | 159036404 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00190936 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCGCA CAGCCAAGGG GGTCGCCGTA CTTGCCTCCA CCACCCTTGC TCTGGCCCTC GCCGCCTGTG GCGGGGACAG TCAGGGCGAG GAGGAGCGCC CAGCGGCGGA TCCCGCAGCT ATGAAGGCAG AACTGACCTG GTGGGACACG TCAGACCCGA AGAACGAGGG TCCGGTGTTC CAGGAGCTGA TCGCACGGTT CAACGAGACC TACCCGAGTA TAAAGATCAA CTATCAGTCG GTCCCGTTCG GTGAGGCCCA GAACAAGTTC AAGACCGCCG CGCAGGCCAA GACCGGCGCA CCGGACATCC TGCGGGCGGA GGTGGCCTGG GTGCCGGAGT TTGCCTCGCT GGGCTACCTC TACGCGCTGG ATGGCTCCGA GCTGCTTGCC GACGAGGCGG ACTTCCTGGC TACCCCGCTC GCGTCGAACA AGTACGACGG CAAGACCTAC GGCGTCCCGC AGGTGACCGA CACGCTGTCG CTCATGTACA ACAAGGAACT GTTGGCCGAG GCCGGCGTCG CCGCAGCGCC GACGACCTGG GCCGAGCTGA AGACCGCGGC CCAGGCCGTC ACGCAGAAGA CCGGTGCCGA GGGCCTCTAC GTCAATCCGG CCGGCTACTT CCTGCTGCCC TTCATGTACG GCGAGGGCGG CGACCTGGTC GACGTCGAGG CCAAGAAGAT CACCGTTGGC TCGGACCGTA ACGTCGCCGG GCTGAAGATC GCCAAGGACC TGATCGACAG CGGTGCCGCC GTCAAGCCCT CCGCGAACGA TTCCTACGGG ACGATGATGA CGCTCTTCAA GGAGCAGCAG GTCGCCATGA TCATTAACGG TCCGTGGGAG GTCAACAACG TTACGCAGGC GCCGAGCTTC GGTGGCGCGG AGAACCTCGG CATCGCTCCG GTCCCGGGCG GCTCGGCCAG GGCCGGCGGC CCGGTCGGGG GGCACAACTA CACCATCTGG TCCGGGATGC CACAGGAGAA GGTCGACGCC GCGGTCGCGT TCGTGGCCTT CATGAGTTCC ACCGAGTCGC AGGCATTCCT CTCCGAAAAA CTCGGCCTGC TGCCGACCCG CAAGTCGGCC TACGACCTCG ACGCGGTGCG GAACAACCCG ATCGTCACCG CCTACCAGCC CGCCGTGGAG GCCGCCGTGG GCCGTCCCTG GATTCCCGAG GCCGGCCAGT TCTTCGAACC GCTGGACCAG ATGGCCACCG AGGTTCTGAT CCAGAACCGG GATCCGAAGG CCGCGCTCGA CGCTGTCGCC AAGAGGTACC AGGCGGAGGT CGTCACCTCG TTCGGGCTCT GA
|
Protein sequence | MSRTAKGVAV LASTTLALAL AACGGDSQGE EERPAADPAA MKAELTWWDT SDPKNEGPVF QELIARFNET YPSIKINYQS VPFGEAQNKF KTAAQAKTGA PDILRAEVAW VPEFASLGYL YALDGSELLA DEADFLATPL ASNKYDGKTY GVPQVTDTLS LMYNKELLAE AGVAAAPTTW AELKTAAQAV TQKTGAEGLY VNPAGYFLLP FMYGEGGDLV DVEAKKITVG SDRNVAGLKI AKDLIDSGAA VKPSANDSYG TMMTLFKEQQ VAMIINGPWE VNNVTQAPSF GGAENLGIAP VPGGSARAGG PVGGHNYTIW SGMPQEKVDA AVAFVAFMSS TESQAFLSEK LGLLPTRKSA YDLDAVRNNP IVTAYQPAVE AAVGRPWIPE AGQFFEPLDQ MATEVLIQNR DPKAALDAVA KRYQAEVVTS FGL
|
| |