Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2999 |
Symbol | |
ID | 5707609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3406849 |
End bp | 3408108 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641272446 |
Product | extracellular solute-binding protein |
Protein accession | YP_001537814 |
Protein GI | 159038561 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.662901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000395876 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATGC GTCATTCGGT TGCCGCAGTG GCAGCCGCAT CGGCGATGCT CCTAGCCGCG TGCGGGGGTA GCTCCGACGA CACCGCCGGC GGGTCGGTCG ACTCGCTGAA GTTCTACAAC GACAAGGCGG CCTGGAAGCC GCAGTTCGAG GAGGTCAGCA AGGTCTCCCA GGACGAGATC GGCCTCGCGC TCGAGCCGGT TGGCTACTCC GAAGCCAACC AGTACGCCGC GTTCATCCGG TCCTCGTTCC GGACCAAGGA AAAGCCCGAC CTGTTCACCT GGCATACAGG CAAGGAACTT GAGGACCTCG TCAGACAGGG ACTGGTCGCC GAGACCACGT CTCTGTGGGA CAAGGCCATC GCCGATGGGG ACGTCCCCGA AGATCTCCGT GAGTACTTCA CGGTCGACGG CAAACAGTAC TGCGTCCCCC TGCAGGCCGG CTACTGGGTG ATGTTCTACA ACAAGCGCAT TTTCGACCAG GAGGGCATCA CGCCCCCGAG CACGTGGGCG CAGCTCGAGG CCGCTGCCGA GAGGCTCAAG GGCGCGGGCG TCACGCCGTT TCACCAGACG AACGTCCTGT TCACGTTCTC GTGGTTCCAG ACCCTCCTGA CAGGCACCGA CCCGGAGCTC TACGAGGCAC TGTCCACGGG TGAGGCGAAG TACACCGACC CTGGCGTGGT GAGCGTCATG GACAAGTGGC GGGCCATGCT CGATAAGGGG TACTTCAGCG ATCCGGGCTC CAAGACCGAC CCGCAGGTGA TGCTCAAGAA CGGCGACGTC GCCATGATCA ACATGGGTAC CTGGTTCAAC GGCAACCTCA AGTCAGTCGG CATGGAGATC GACAAGGACT ACGGGATGTT CGTCATCCCC AACGTCGACC CGTCGCTCGC CACCAGGCCG ATGGTCGTCG AGGCCGGCCC GATGTGCACT GCTGCCGACG CGACGCACCG CGAGGAGGCC GAGAGGTACT CGGCGTGGTG GTTCACCCCA CCGGCGCAGA CCGCCTGGGC GAACGCTCGC GGTGAACTCT CGTTCAACCC GAGGGCCGAG GTCAGCGACG AGACCCTCGC CAGCCTCAGC GACAAGATCA ACAAGGGTGA CTACAGGCTG ATGAACCGCT ACTTCGAGGC CGCACCCGTG CCGGTGCTGA CCGCCGCGCT CGACGGATTC GGCGCCTTCG TCACCAAGCC CGGCGACCCG ATGCCGGTGC TCAAGGAGGT GCAGGCGGCC GCCGACGCCT ACTGGGCCGA GCAGGGGTAG
|
Protein sequence | MKMRHSVAAV AAASAMLLAA CGGSSDDTAG GSVDSLKFYN DKAAWKPQFE EVSKVSQDEI GLALEPVGYS EANQYAAFIR SSFRTKEKPD LFTWHTGKEL EDLVRQGLVA ETTSLWDKAI ADGDVPEDLR EYFTVDGKQY CVPLQAGYWV MFYNKRIFDQ EGITPPSTWA QLEAAAERLK GAGVTPFHQT NVLFTFSWFQ TLLTGTDPEL YEALSTGEAK YTDPGVVSVM DKWRAMLDKG YFSDPGSKTD PQVMLKNGDV AMINMGTWFN GNLKSVGMEI DKDYGMFVIP NVDPSLATRP MVVEAGPMCT AADATHREEA ERYSAWWFTP PAQTAWANAR GELSFNPRAE VSDETLASLS DKINKGDYRL MNRYFEAAPV PVLTAALDGF GAFVTKPGDP MPVLKEVQAA ADAYWAEQG
|
| |