Gene Synpcc7942_1542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1542 
Symbol 
ID3774966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1599353 
End bp1600381 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content53% 
IMG OID637799975 
Productiron-stress chlorophyll-binding protein 
Protein accessionYP_400559 
Protein GI81300351 
COG category 
COG ID 
TIGRFAM ID[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.398653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0103257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACTT ACAACAACCC AGAAGTCACT TACGACTGGT GGGCTGGCAA TGCCCGCTTT 
GCCAATCTCT CGGGTCTCTT CATTGCGGCT CACGTGGCCC AAGCAGCACT GATCATGTTT
TGGGCCGGTG CTTTCACGTT GTACGAAATC TCTTGGCTCA CTGCAGACCA GTCCATGGGT
GAGCAAGGCC TCATTCTGCT GCCGCATCTA GCCACCCTTG GATTAGGTGT GGGCGATGGC
GGACAGGTGA CAGACACTTA TCCACTCTTT GTCGTGGGTG CCGTTCATCT GATCGCCTCC
GCAGTCTTGG GCGCGGGTGC CCTATTCCAC ACATTCCGAG CACCCAGTGA TTTGGCAGCT
GCATCGGGAG CTGCTAAGCG GTTCCACTTC GACTGGAATG ATCCCAAACA ACTAGGCCTC
ATTCTGGGAC ACCACTTGCT GTTCCTCGGG GTTGGAGCAT TGCTGCTGGT GGCAAAGGCA
ACAACTTGGG GTGGCCTATA CGACGCAGCC AGTCAGACAG TCCGTTTGGT AACAGAACCG
ACGCTTAATC CAGCGGTGAT TTATGGTTAT CAGACTCATT TCGCCAGCAT TGATAACCTT
GAAGACTTAG TCGGTGGCCA TGTTTATGTT GGCGTCATGC TAATTGCCGG AGGCATTTGG
CACATTTTGG TTCCGCCATT TCAATGGACT AAAAAAGTCT TGATCTACTC TGGCGAAGCA
ATTCTGTCGT ACTCCTTGGG TGGCATCGCT CTCGCCGGTT TTGTCGCTGC TTACTTCTGC
GCCGTCAACA CCCTAGCGTA CCCCGTGGAA TTCTACGGTG CGCCGCTGGA AATCAAATTA
GGTGTCACTC CCTACTTTGC AGATACGGTT CAACTGCCCT TTGGTGCCCA TACGCCTCGT
GCTTGGCTAT CCAATGCCCA CTTCTTCTTG GCTTTCTTCT GCCTACAAGG CCATCTCTGG
CATGCTTTAC GGGCAATGGG CTTCGACTTT CGTCGAGTTG AAAAAGCACT CAGCTCTGTA
GAAGCCTAA
 
Protein sequence
MQTYNNPEVT YDWWAGNARF ANLSGLFIAA HVAQAALIMF WAGAFTLYEI SWLTADQSMG 
EQGLILLPHL ATLGLGVGDG GQVTDTYPLF VVGAVHLIAS AVLGAGALFH TFRAPSDLAA
ASGAAKRFHF DWNDPKQLGL ILGHHLLFLG VGALLLVAKA TTWGGLYDAA SQTVRLVTEP
TLNPAVIYGY QTHFASIDNL EDLVGGHVYV GVMLIAGGIW HILVPPFQWT KKVLIYSGEA
ILSYSLGGIA LAGFVAAYFC AVNTLAYPVE FYGAPLEIKL GVTPYFADTV QLPFGAHTPR
AWLSNAHFFL AFFCLQGHLW HALRAMGFDF RRVEKALSSV EA