Gene Synpcc7942_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0697 
Symbol 
ID3775867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp689222 
End bp690748 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content58% 
IMG OID637799109 
Productphotosystem II core light harvesting protein 
Protein accessionYP_399716 
Protein GI81299508 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000016119 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0946335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTAC CCTGGTACCG TGTCCACACG GTCGTCCTCA ATGATCCGGG ACGACTGATT 
GCAGTGCACT TGATGCACAC TGCCCTCGTG GCAGGTTGGG CAGGTTCGAT GGCTTTATAC
GAACTTGCCA TTTTTGATCC TTCTGATGCC GTGCTGAACC CCATGTGGCG GCAAGGCATG
TTCGTGTTGC CGTTCATGGC GCGTTTGGGC GTCACCCAAT CTTGGGGTGG CTGGAGCATC
ACCGGCGAAA CCGCCGTGGA TCCTGGCTAT TGGAGCTTTG AAGGCGTCGC GATCGCCCAC
ATCGTACTGT CGGGTCTGCT GTTCCTCGCA GCAGTCTGGC ACTGGGTCTA CTGGGACCTC
GAACTCTTTA CCGATCCCCG CACCGGCGAA CCGGCCTTGG ACCTGCCAAA AATGTTTGGC
ATCCACCTGT TCCTCTCCGG TCTTCTTTGC TTCGGCTTCG GTGCTTTCCA CCTGTCTGGC
CTCTGGGGCC CGGGGATGTG GGTCTCCGAT CCCTACGGCT TGACCGGCCA TGTCCAACCA
GTTGCCCCGG CCTGGGGTCC TGAAGGCTTC AACCCCTTCA ATCCGGGTGG CATTGTGGCT
CACCACATTG CAGCCGGTGT CGTTGGCATC GTCGCAGGCC TCTTCCACCT GACGGTTCGT
CCCCCCGAGC GCCTCTACAA AGCGCTGCGG ATGGGCAACA TCGAAACCGT CTTGTCGAGC
TCCTTGGCAG CAGTCTTCTT CGCTGCTTTT GTGGTCGCTG GCACGATGTG GTACGGCAAC
GCTGCCACGC CAGTCGAACT GTTTGGCCCA ACTCGCTACC AGTGGGACCA AGGCTACTTC
CGTCAGGAAA TTGCCCGCCG GGTTGATACG GCTGTCGCCA GTGGCGCTTC TCTAGAGGAA
GCTTGGAGCT CCATTCCTGA AAAACTGGCC TTCTATGACT ACGTCGGCAA CAGCCCTGCT
AAAGGTGGCT TGTTCCGTAC CGGTCAGATG AACAAAGGTG ACGGGATTGC CCAAGGCTGG
CTCGGCCACG CTGTCTTCAA GGACAAAAAT GGCGATGTGC TCGACGTCCG TCGCTTGCCG
AACTTCTTCG AGAACTTCCC GATCGTCTTG ACTGACAGCA AAGGTGCTGT GCGGGCAGAC
ATTCCTTTCC GTCGTGCTGA AGCGAAATTC AGCTTCGAGG AAACCGGAAT TACGGCTAGC
TTCTACGGCG GTTCTCTGAA TGGCCAAACC ATCACTGATC CGGCGCAGGT GAAGAAATAC
GCCCGTAAGG CTCAGTTGGG TGAAGCGTTC GAATTCGACA CCGAAACCCT TAACTCGGAC
GGTGTGTTCC GGACTTCGCC GCGTGGCTGG TTCACCTTTG GTCACGCCAG CTTTGCTCTG
CTCTTCTTCT TTGGCCATAT CTGGCACGGC TCTCGGACGC TGTTCCGCGA TGTCTTTGCT
GGGATTGAAG CCGACTTGGG CGAGCAGATT GAATTCGGGG CCTTCCAGAA ATTGGGTGAC
CCGACCACTC GGAAAACAGC CGCTTAA
 
Protein sequence
MGLPWYRVHT VVLNDPGRLI AVHLMHTALV AGWAGSMALY ELAIFDPSDA VLNPMWRQGM 
FVLPFMARLG VTQSWGGWSI TGETAVDPGY WSFEGVAIAH IVLSGLLFLA AVWHWVYWDL
ELFTDPRTGE PALDLPKMFG IHLFLSGLLC FGFGAFHLSG LWGPGMWVSD PYGLTGHVQP
VAPAWGPEGF NPFNPGGIVA HHIAAGVVGI VAGLFHLTVR PPERLYKALR MGNIETVLSS
SLAAVFFAAF VVAGTMWYGN AATPVELFGP TRYQWDQGYF RQEIARRVDT AVASGASLEE
AWSSIPEKLA FYDYVGNSPA KGGLFRTGQM NKGDGIAQGW LGHAVFKDKN GDVLDVRRLP
NFFENFPIVL TDSKGAVRAD IPFRRAEAKF SFEETGITAS FYGGSLNGQT ITDPAQVKKY
ARKAQLGEAF EFDTETLNSD GVFRTSPRGW FTFGHASFAL LFFFGHIWHG SRTLFRDVFA
GIEADLGEQI EFGAFQKLGD PTTRKTAA