Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0697 |
Symbol | |
ID | 3775867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 689222 |
End bp | 690748 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637799109 |
Product | photosystem II core light harvesting protein |
Protein accession | YP_399716 |
Protein GI | 81299508 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000016119 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0946335 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTAC CCTGGTACCG TGTCCACACG GTCGTCCTCA ATGATCCGGG ACGACTGATT GCAGTGCACT TGATGCACAC TGCCCTCGTG GCAGGTTGGG CAGGTTCGAT GGCTTTATAC GAACTTGCCA TTTTTGATCC TTCTGATGCC GTGCTGAACC CCATGTGGCG GCAAGGCATG TTCGTGTTGC CGTTCATGGC GCGTTTGGGC GTCACCCAAT CTTGGGGTGG CTGGAGCATC ACCGGCGAAA CCGCCGTGGA TCCTGGCTAT TGGAGCTTTG AAGGCGTCGC GATCGCCCAC ATCGTACTGT CGGGTCTGCT GTTCCTCGCA GCAGTCTGGC ACTGGGTCTA CTGGGACCTC GAACTCTTTA CCGATCCCCG CACCGGCGAA CCGGCCTTGG ACCTGCCAAA AATGTTTGGC ATCCACCTGT TCCTCTCCGG TCTTCTTTGC TTCGGCTTCG GTGCTTTCCA CCTGTCTGGC CTCTGGGGCC CGGGGATGTG GGTCTCCGAT CCCTACGGCT TGACCGGCCA TGTCCAACCA GTTGCCCCGG CCTGGGGTCC TGAAGGCTTC AACCCCTTCA ATCCGGGTGG CATTGTGGCT CACCACATTG CAGCCGGTGT CGTTGGCATC GTCGCAGGCC TCTTCCACCT GACGGTTCGT CCCCCCGAGC GCCTCTACAA AGCGCTGCGG ATGGGCAACA TCGAAACCGT CTTGTCGAGC TCCTTGGCAG CAGTCTTCTT CGCTGCTTTT GTGGTCGCTG GCACGATGTG GTACGGCAAC GCTGCCACGC CAGTCGAACT GTTTGGCCCA ACTCGCTACC AGTGGGACCA AGGCTACTTC CGTCAGGAAA TTGCCCGCCG GGTTGATACG GCTGTCGCCA GTGGCGCTTC TCTAGAGGAA GCTTGGAGCT CCATTCCTGA AAAACTGGCC TTCTATGACT ACGTCGGCAA CAGCCCTGCT AAAGGTGGCT TGTTCCGTAC CGGTCAGATG AACAAAGGTG ACGGGATTGC CCAAGGCTGG CTCGGCCACG CTGTCTTCAA GGACAAAAAT GGCGATGTGC TCGACGTCCG TCGCTTGCCG AACTTCTTCG AGAACTTCCC GATCGTCTTG ACTGACAGCA AAGGTGCTGT GCGGGCAGAC ATTCCTTTCC GTCGTGCTGA AGCGAAATTC AGCTTCGAGG AAACCGGAAT TACGGCTAGC TTCTACGGCG GTTCTCTGAA TGGCCAAACC ATCACTGATC CGGCGCAGGT GAAGAAATAC GCCCGTAAGG CTCAGTTGGG TGAAGCGTTC GAATTCGACA CCGAAACCCT TAACTCGGAC GGTGTGTTCC GGACTTCGCC GCGTGGCTGG TTCACCTTTG GTCACGCCAG CTTTGCTCTG CTCTTCTTCT TTGGCCATAT CTGGCACGGC TCTCGGACGC TGTTCCGCGA TGTCTTTGCT GGGATTGAAG CCGACTTGGG CGAGCAGATT GAATTCGGGG CCTTCCAGAA ATTGGGTGAC CCGACCACTC GGAAAACAGC CGCTTAA
|
Protein sequence | MGLPWYRVHT VVLNDPGRLI AVHLMHTALV AGWAGSMALY ELAIFDPSDA VLNPMWRQGM FVLPFMARLG VTQSWGGWSI TGETAVDPGY WSFEGVAIAH IVLSGLLFLA AVWHWVYWDL ELFTDPRTGE PALDLPKMFG IHLFLSGLLC FGFGAFHLSG LWGPGMWVSD PYGLTGHVQP VAPAWGPEGF NPFNPGGIVA HHIAAGVVGI VAGLFHLTVR PPERLYKALR MGNIETVLSS SLAAVFFAAF VVAGTMWYGN AATPVELFGP TRYQWDQGYF RQEIARRVDT AVASGASLEE AWSSIPEKLA FYDYVGNSPA KGGLFRTGQM NKGDGIAQGW LGHAVFKDKN GDVLDVRRLP NFFENFPIVL TDSKGAVRAD IPFRRAEAKF SFEETGITAS FYGGSLNGQT ITDPAQVKKY ARKAQLGEAF EFDTETLNSD GVFRTSPRGW FTFGHASFAL LFFFGHIWHG SRTLFRDVFA GIEADLGEQI EFGAFQKLGD PTTRKTAA
|
| |