Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1080 |
Symbol | |
ID | 5704071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1212359 |
End bp | 1213639 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641270595 |
Product | extracellular solute-binding protein |
Protein accession | YP_001535979 |
Protein GI | 159036726 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00103993 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGAACAC GCTTGGCCGG GCTTGCCCTG TCCTCGGCCG TGTTGACGGT GCTCGCGGGA TGCGGGCTGT CCGACACGGA CTCCAGTGAT GAACAGGTCA CCACCACCGG CGAGATCAAA GGCACCGTGA CCCTGCAGAC CTGGGCCCTG AAGCCCAAGT TCACCGACTA CATGCAGGGT GTGATCGACG GTTTCGAGGA GAAGTACCCG GGCACGACCG TCGAGTGGCT GGACCAGCCC GGCGAGGGGT ACTCGGAGAA GGTACTCAGC CAGGCGGCGA GTGGTCAGCT GCCCGACGTG ACGAACCTCC CGCCGGACTT CGCGCTGCCC CTCGCTGAGC AGGGGCTGCT GCTCGATGTC GACCAGGCCG ACGACAAGCT CCGCGACGAG TACGTCGACG GCGCCGTCGC CTCCTACGAG TTCGCCGGTC AGACCGGGGT CTACGGCTAC CCGTGGTACC TCAACACCGA CGTGAACTAC TGGAACTCGG AGTTGCTGTC CCAGTACGGG CTGGACGTAG CCAAGCTCCC GACCACGGTC GAGGAACTGG TCGCCCAGGC CCGGATCGTC AAGGAGAAGT CCGACGGCAA GATCCACCTG ATGAGCCGGA AGCCCGGCGT CGAGGACCTG ACCCAGGCCG GAGTCGACAT CCTCTCCTCG GACGGCAAGA AGTTCGTCTT CAACACCCCC GAGGCCGTGG CGGTTCTCGA CACCTACCGG GACGCGTTCG CCGAAGGGCT CCTGCCCCGC AACGTGCTGA CCGACGCCTA CCTCGGCAAC ATGGAGCTGT TCAAGAAGCA GCAGGTCGCC TGGACCACCG GCGGCGGTAA CTCGATCAAC GACATCAAGG TCGACAACCC GACGCTGGCC GAGAAGGTTG TCGCGTCACC GACGATCGGC ACCCCGCCGC TGTACACCCA GGGTCTGTCG GTATCGAAGC GGAGCGAGAA CCTGCCCACG GCCATCGCTC TGGCCCGCTG GGTGACCAGC CCGGAGAATC AGGCGGCGTT CGCCGAGGTC GTGCCGGGCA TCTTCCCGAG CACGATCGCC TCGGCGGAGG ACCCGCAGTT CAGCGTGAGC GACGGCAGCA ACGTCGGGGA CGCGAAGAAG ATCGCCTTCA CCTCGTTGGC CGAGGCGCAG CTGTTGAAGC CGGTCGTCGT CGACCAGGCC ATGGACGACT TCATCAAGCA GCAGTTCTCC CTGGCGATCA GTGGGGAGAT CACCTCGAAG GAGGCCCTGG ACAAGGCGGT CGACAAGTGC AACGAGCTGC TCAACGACTG A
|
Protein sequence | MRTRLAGLAL SSAVLTVLAG CGLSDTDSSD EQVTTTGEIK GTVTLQTWAL KPKFTDYMQG VIDGFEEKYP GTTVEWLDQP GEGYSEKVLS QAASGQLPDV TNLPPDFALP LAEQGLLLDV DQADDKLRDE YVDGAVASYE FAGQTGVYGY PWYLNTDVNY WNSELLSQYG LDVAKLPTTV EELVAQARIV KEKSDGKIHL MSRKPGVEDL TQAGVDILSS DGKKFVFNTP EAVAVLDTYR DAFAEGLLPR NVLTDAYLGN MELFKKQQVA WTTGGGNSIN DIKVDNPTLA EKVVASPTIG TPPLYTQGLS VSKRSENLPT AIALARWVTS PENQAAFAEV VPGIFPSTIA SAEDPQFSVS DGSNVGDAKK IAFTSLAEAQ LLKPVVVDQA MDDFIKQQFS LAISGEITSK EALDKAVDKC NELLND
|
| |