Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1686 |
Symbol | |
ID | 3775385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1753022 |
End bp | 1754047 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637800124 |
Product | thiosulphate-binding protein |
Protein accession | YP_400703 |
Protein GI | 81300495 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.863569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.139145 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTA ATCGCCGTGC AGTGCTGACC CTAGCAATTT TCGGGAGTCT GACTGCTTTG CCATCGCTGC TGCAATCTCA AGCAAATGCT CAGTCTCGGC CGGTTGAACT CACGTTAGTG AGCTATGCCG TTGCTAAGCC TGTCTTTGAG AAACTGATTC CTGAATTCCA AAAAGAATGG AAAGCCAAGA CGGGGCAGAC CGTCACTTTT AAGGAATCCT ACGGTGCCTC TGGAGCCCAA ACCCGCGCTG TCCTCGGCGG ATTAGAGGCC GACATCTTGG CTCAGAACAT CACGAGTAAT GTCACACCAC TGGTCGAGAA GGGTTTAGTT CGCTCCAATT GGAATACCCG CCTCCCCAAC AGTGCTTCGC CAGCAACGAC AGTCATGGCA ATTATCGTGC GGCCGGGCAA TCCTAAAAAA ATTCAAACCT GGACCGATTT AGCAAAGCCT GACGTCAGCC TTGTTTCGCT CAACCCCAAA ACTTCCGGCA ATGCGCGCTG GGGAATTTTG GCGGGTTATG GCTCTGTTTT GAAAGCGCAA GGTGCCACCC CAGCTCGCAA TTTTCTATTT AGCTTTGCCA AAAATATCAA GACCCAAGTT AATTCTGGGC GCGAAGCAAC GGATGCCTTT GTTAAAAATC GCGTTGGAGA TGCCCTGATT AATTTTGAGA ACGAAATCAT TGTGACCAAT GAAGCCGTTC CGAGAGATTT TCCCTACGTT GTTCCCTCAG CGAACGTCCG CGTCGACTTC CCGGTCACGG TGATTGATAC TGTTGTCGAT AAGCGCGGTA CGCGGCGCGT GGCTGAGGCT TTTACCCAAT TTCTCTTTAC CCCCAAAGCC CAAGCCATCT ATGCCGAAGC TGGCTATCGC CCTTTTGATC GCCAAGTTTT CCAACGCTAC GCCAAGCAAT TCAAGCCCGT GCAGCAGTTG CGCACGATCG CAGATTTTGG GGGCTGGCCG ACCATCGACA AAACCCTCTA CGCCGATGGT GCGCTGTTTG ATCAGGCCCA AAAAGCCGCT CGCTAA
|
Protein sequence | MKVNRRAVLT LAIFGSLTAL PSLLQSQANA QSRPVELTLV SYAVAKPVFE KLIPEFQKEW KAKTGQTVTF KESYGASGAQ TRAVLGGLEA DILAQNITSN VTPLVEKGLV RSNWNTRLPN SASPATTVMA IIVRPGNPKK IQTWTDLAKP DVSLVSLNPK TSGNARWGIL AGYGSVLKAQ GATPARNFLF SFAKNIKTQV NSGREATDAF VKNRVGDALI NFENEIIVTN EAVPRDFPYV VPSANVRVDF PVTVIDTVVD KRGTRRVAEA FTQFLFTPKA QAIYAEAGYR PFDRQVFQRY AKQFKPVQQL RTIADFGGWP TIDKTLYADG ALFDQAQKAA R
|
| |