Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1722 |
Symbol | |
ID | 3775422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 1794143 |
End bp | 1795228 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637800161 |
Product | thiosulphate-binding protein |
Protein accession | YP_400739 |
Protein GI | 81300531 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTCC GTTCCTTCCT CCCGCTTCGG CCCGGTATCA AGCGATCGCT TGCAGGTGGT TTGGCCGCCG CAATTGCAGC AGGTAGCTTT GCGACTACGG CTGGGGCGCG CAGTGCCCAA ACCAGTGCCG CAGGGGCTAC TCCTCAGCAA GTTGCTCAAG CCAAGCAGGA GCTGGTTCTT GTGTCCTATG CAGTCACCAA GGCTGCCTAC GATCGCATCA TTCCCAAGTT CACGGCCAAG TGGAAAAAAG AAAAAGGGCA AGACGTCACC ATTCGCGGTA GTTACGGTGG GTCTGGCTCC CAAACTCGGG CCATCCTCGA TGGCTTGGAA GCTGACATCG CGGCTTTGGC CCTGGAATCA GACACGGCTC GCCTAGAACG AGCGGGGCTG ATTGGCAAAA ACTGGCAAAG ACGACTGCCT AATAACTCCA ACCTCTCCAG TTCGGTGGTA GTTATTTGTA CTCGTAAAGG GAATCCCAAA AAGATCAAAG GCTGGGCTGA CTTAGCGAAA CCTGGGGTTC GGATCGTGAC AGCCAACCCC AAAACCTCTG GTGGGGCTCG CTGGAACTTC CTCGCGCTCT ACGGTTCTGT CGCTAAGAAT GGGGGCACCG ATAAACAAGC CTTTGACTTT GTGAAGAAAG TCTATGACAA CGTCCCTGTC TTGGCTAAGG ACGCTCGGGA ATCCACTGAT ATCTTCTACA AGAAGAATCA GGGCGATGTG CTCCTGAACT ACGAGAATGA AGTGATTCTG GCGCGCCTCA ATGGCGAAGA TGTGGGAACC TGCATCACGC CGCAGGTCAA CATTGCGATC GAAACGCCGA TCGCGGTGGT CGATAAGGTC GCCAACAAAC GAAAGACTAA GGCGATCGCT GATGCCTTTA CTCGCTTTGT CTTTACCCCT GAGGCGCAAG AGGAGCTCGC TAAGGTTGGT TTCCGGCCGG CTAATGCCGC TGTCGCTCAA AAATATCGCA AGAATTTCCC CCCTCTGACC AAGCTCTACA ACATCAAGAG CTTTGGTGGC TGGGCTGCTG CTGACAAGAA GTTCTTTGCT GATGGCGGCG TCTTCGACCA AATCCAAGGT CGTTAA
|
Protein sequence | MAFRSFLPLR PGIKRSLAGG LAAAIAAGSF ATTAGARSAQ TSAAGATPQQ VAQAKQELVL VSYAVTKAAY DRIIPKFTAK WKKEKGQDVT IRGSYGGSGS QTRAILDGLE ADIAALALES DTARLERAGL IGKNWQRRLP NNSNLSSSVV VICTRKGNPK KIKGWADLAK PGVRIVTANP KTSGGARWNF LALYGSVAKN GGTDKQAFDF VKKVYDNVPV LAKDARESTD IFYKKNQGDV LLNYENEVIL ARLNGEDVGT CITPQVNIAI ETPIAVVDKV ANKRKTKAIA DAFTRFVFTP EAQEELAKVG FRPANAAVAQ KYRKNFPPLT KLYNIKSFGG WAAADKKFFA DGGVFDQIQG R
|
| |