Gene Synpcc7942_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1722 
Symbol 
ID3775422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1794143 
End bp1795228 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content54% 
IMG OID637800161 
Productthiosulphate-binding protein 
Protein accessionYP_400739 
Protein GI81300531 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCC GTTCCTTCCT CCCGCTTCGG CCCGGTATCA AGCGATCGCT TGCAGGTGGT 
TTGGCCGCCG CAATTGCAGC AGGTAGCTTT GCGACTACGG CTGGGGCGCG CAGTGCCCAA
ACCAGTGCCG CAGGGGCTAC TCCTCAGCAA GTTGCTCAAG CCAAGCAGGA GCTGGTTCTT
GTGTCCTATG CAGTCACCAA GGCTGCCTAC GATCGCATCA TTCCCAAGTT CACGGCCAAG
TGGAAAAAAG AAAAAGGGCA AGACGTCACC ATTCGCGGTA GTTACGGTGG GTCTGGCTCC
CAAACTCGGG CCATCCTCGA TGGCTTGGAA GCTGACATCG CGGCTTTGGC CCTGGAATCA
GACACGGCTC GCCTAGAACG AGCGGGGCTG ATTGGCAAAA ACTGGCAAAG ACGACTGCCT
AATAACTCCA ACCTCTCCAG TTCGGTGGTA GTTATTTGTA CTCGTAAAGG GAATCCCAAA
AAGATCAAAG GCTGGGCTGA CTTAGCGAAA CCTGGGGTTC GGATCGTGAC AGCCAACCCC
AAAACCTCTG GTGGGGCTCG CTGGAACTTC CTCGCGCTCT ACGGTTCTGT CGCTAAGAAT
GGGGGCACCG ATAAACAAGC CTTTGACTTT GTGAAGAAAG TCTATGACAA CGTCCCTGTC
TTGGCTAAGG ACGCTCGGGA ATCCACTGAT ATCTTCTACA AGAAGAATCA GGGCGATGTG
CTCCTGAACT ACGAGAATGA AGTGATTCTG GCGCGCCTCA ATGGCGAAGA TGTGGGAACC
TGCATCACGC CGCAGGTCAA CATTGCGATC GAAACGCCGA TCGCGGTGGT CGATAAGGTC
GCCAACAAAC GAAAGACTAA GGCGATCGCT GATGCCTTTA CTCGCTTTGT CTTTACCCCT
GAGGCGCAAG AGGAGCTCGC TAAGGTTGGT TTCCGGCCGG CTAATGCCGC TGTCGCTCAA
AAATATCGCA AGAATTTCCC CCCTCTGACC AAGCTCTACA ACATCAAGAG CTTTGGTGGC
TGGGCTGCTG CTGACAAGAA GTTCTTTGCT GATGGCGGCG TCTTCGACCA AATCCAAGGT
CGTTAA
 
Protein sequence
MAFRSFLPLR PGIKRSLAGG LAAAIAAGSF ATTAGARSAQ TSAAGATPQQ VAQAKQELVL 
VSYAVTKAAY DRIIPKFTAK WKKEKGQDVT IRGSYGGSGS QTRAILDGLE ADIAALALES
DTARLERAGL IGKNWQRRLP NNSNLSSSVV VICTRKGNPK KIKGWADLAK PGVRIVTANP
KTSGGARWNF LALYGSVAKN GGTDKQAFDF VKKVYDNVPV LAKDARESTD IFYKKNQGDV
LLNYENEVIL ARLNGEDVGT CITPQVNIAI ETPIAVVDKV ANKRKTKAIA DAFTRFVFTP
EAQEELAKVG FRPANAAVAQ KYRKNFPPLT KLYNIKSFGG WAAADKKFFA DGGVFDQIQG
R