Gene Synpcc7942_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1686 
Symbol 
ID3775385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1753022 
End bp1754047 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content52% 
IMG OID637800124 
Productthiosulphate-binding protein 
Protein accessionYP_400703 
Protein GI81300495 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.863569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.139145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTA ATCGCCGTGC AGTGCTGACC CTAGCAATTT TCGGGAGTCT GACTGCTTTG 
CCATCGCTGC TGCAATCTCA AGCAAATGCT CAGTCTCGGC CGGTTGAACT CACGTTAGTG
AGCTATGCCG TTGCTAAGCC TGTCTTTGAG AAACTGATTC CTGAATTCCA AAAAGAATGG
AAAGCCAAGA CGGGGCAGAC CGTCACTTTT AAGGAATCCT ACGGTGCCTC TGGAGCCCAA
ACCCGCGCTG TCCTCGGCGG ATTAGAGGCC GACATCTTGG CTCAGAACAT CACGAGTAAT
GTCACACCAC TGGTCGAGAA GGGTTTAGTT CGCTCCAATT GGAATACCCG CCTCCCCAAC
AGTGCTTCGC CAGCAACGAC AGTCATGGCA ATTATCGTGC GGCCGGGCAA TCCTAAAAAA
ATTCAAACCT GGACCGATTT AGCAAAGCCT GACGTCAGCC TTGTTTCGCT CAACCCCAAA
ACTTCCGGCA ATGCGCGCTG GGGAATTTTG GCGGGTTATG GCTCTGTTTT GAAAGCGCAA
GGTGCCACCC CAGCTCGCAA TTTTCTATTT AGCTTTGCCA AAAATATCAA GACCCAAGTT
AATTCTGGGC GCGAAGCAAC GGATGCCTTT GTTAAAAATC GCGTTGGAGA TGCCCTGATT
AATTTTGAGA ACGAAATCAT TGTGACCAAT GAAGCCGTTC CGAGAGATTT TCCCTACGTT
GTTCCCTCAG CGAACGTCCG CGTCGACTTC CCGGTCACGG TGATTGATAC TGTTGTCGAT
AAGCGCGGTA CGCGGCGCGT GGCTGAGGCT TTTACCCAAT TTCTCTTTAC CCCCAAAGCC
CAAGCCATCT ATGCCGAAGC TGGCTATCGC CCTTTTGATC GCCAAGTTTT CCAACGCTAC
GCCAAGCAAT TCAAGCCCGT GCAGCAGTTG CGCACGATCG CAGATTTTGG GGGCTGGCCG
ACCATCGACA AAACCCTCTA CGCCGATGGT GCGCTGTTTG ATCAGGCCCA AAAAGCCGCT
CGCTAA
 
Protein sequence
MKVNRRAVLT LAIFGSLTAL PSLLQSQANA QSRPVELTLV SYAVAKPVFE KLIPEFQKEW 
KAKTGQTVTF KESYGASGAQ TRAVLGGLEA DILAQNITSN VTPLVEKGLV RSNWNTRLPN
SASPATTVMA IIVRPGNPKK IQTWTDLAKP DVSLVSLNPK TSGNARWGIL AGYGSVLKAQ
GATPARNFLF SFAKNIKTQV NSGREATDAF VKNRVGDALI NFENEIIVTN EAVPRDFPYV
VPSANVRVDF PVTVIDTVVD KRGTRRVAEA FTQFLFTPKA QAIYAEAGYR PFDRQVFQRY
AKQFKPVQQL RTIADFGGWP TIDKTLYADG ALFDQAQKAA R