Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0393 |
Symbol | |
ID | 5705652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 453427 |
End bp | 454731 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641269918 |
Product | extracellular solute-binding protein |
Protein accession | YP_001535313 |
Protein GI | 159036060 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0128577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGCCA CCAACCGCCG CCGCCTCGCC GCGGTCGCCC TCGCAGCCGC CACCACCCTG CTCGTCACCA CCGCCTGCGG AAACGGCGAC GACGCGGGCG ACGGCACGAT CACACTCACC GTCGACGTCT TCGGTCAATT CGGCTACGAC GAGCTGTACC AGCAGTACAT GGACGACAAC CCCGGTGTGA CGATCGTCGA GCGGGGCACC GGTGGCAACC TCGACGAGTA CTCCCCCAAG CTCACCCAGT GGCTGGCCGC CGGAAAAGGT GCCGGCGACA TCGTCGCCAT CGAGGAGGGC CTGCTGGTCG AATACAAGGC CAACCCGGAC AACTTCGTCA ACCTGCTCGA CCACGACGCG GGCGAGTTGC AGGGCAACTT CCTGGACTGG AAGTGGAACG CCGGGCTCAC CGCGGACGGT GCGCACCTGA TCGGGCTCGG CACCGATGTC GGCGGCATGG CGATGTGCTA CCGCACGGAC CTGTTCGCCG AGGCCGGGCT GCCAACCGAA CGGGACGCCG TCTCCGCACT CTGGCCGACC TGGGCGGACT ACATCCGCGT CGGCGAGAGG TTCACCGCCG CGAAGACCGG GGCGGCGTTC CTGGACGCCG CCACCAACAC GTTCAGCACG ATCGTGTTGC AGACGGCTGG CAACACGAAC GGCTACCACT ACTACGACAC CAACGACGAG CTCGTCGTGG ACACCAACCC GGCCGTGCGG CAGGCGTGGG ACACCACCAT GGACATCATC GACTCCGGCC TGTCCGGAAG GTACAGCGCG TGGTCGGAGG AGTGGGTCTC CGCCTTCAAA CAGGCCACGT TCGCCACCAT CGCCTGCCCC GCCTGGATGA CCGGCGTCAT CGAGGGCAAC ACCGGAACCG CGGGCGCGGG CAAGTGGGAC ATCGCCCGGG TGCCCGGCGA CGGTGGCAAC TGGGGCGGCT CGTACCTCGC CGTGCCGAAA CAAAGCCGGC ACCAGGCGGA AGCCGTCAAA CTGGCCATGT TCCTGACCAG CGCCGAAGGG CAGATCGGGG CGTTCAAGGC CAAGGGCCCG CTGCCCTCGT CGCCGCAGGC ACTCGGCGAC CCCGCGGTCG CCGAAGCCAC CAACGCGTAC TTCTCCGACG CCCCCGTCGG GCAGATCTTC GCCGCCGGCG CCAAGGGGTT GAAGCCGGTC TACATGGGCC CGAAGAACCA GGCCGTCCGC ACCGAGGTGG AAAACGCGGT CCGCACCGTC GAGCTCGGTC AGCGGACCCC CGAGCAGGGC TGGAGCGACG CGGTGACGAA CGCGAAGAAG GCCGCCGCCA AGTAG
|
Protein sequence | MGATNRRRLA AVALAAATTL LVTTACGNGD DAGDGTITLT VDVFGQFGYD ELYQQYMDDN PGVTIVERGT GGNLDEYSPK LTQWLAAGKG AGDIVAIEEG LLVEYKANPD NFVNLLDHDA GELQGNFLDW KWNAGLTADG AHLIGLGTDV GGMAMCYRTD LFAEAGLPTE RDAVSALWPT WADYIRVGER FTAAKTGAAF LDAATNTFST IVLQTAGNTN GYHYYDTNDE LVVDTNPAVR QAWDTTMDII DSGLSGRYSA WSEEWVSAFK QATFATIACP AWMTGVIEGN TGTAGAGKWD IARVPGDGGN WGGSYLAVPK QSRHQAEAVK LAMFLTSAEG QIGAFKAKGP LPSSPQALGD PAVAEATNAY FSDAPVGQIF AAGAKGLKPV YMGPKNQAVR TEVENAVRTV ELGQRTPEQG WSDAVTNAKK AAAK
|
| |