Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9605_0470 |
Symbol | |
ID | 3737673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9605 |
Kingdom | Bacteria |
Replicon accession | NC_007516 |
Strand | - |
Start bp | 459125 |
End bp | 460684 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637775060 |
Product | photosystem II chlorophyll-binding protein CP47 |
Protein accession | YP_380799 |
Protein GI | 78212020 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGATTGC CCTGGTATCG GGTGCACACC GTTGTCATTA ATGACCCGGG CCGCCTTCTG GCTGTGCACC TCATGCATAC AGCCCTCGTA GCCGGCTGGG CCGGCTCCAT GGCTCTCTAC GAGCTGGCCA TTTTCGATCC GTCTGACCCT GTCCTGAACC CCATGTGGCG TCAGGGCATG TTCGTGATGC CTTTCATGTC CCGCCTTGGC GTGACCGGCA GCTGGGGTGG ATGGAGCATC ACCGGCGAAA CGGGTGTTGA TCCCGGTTTC TGGAGTTTTG AGGGCGTTGC TGCCGCTCAC ATAGTTTTCT CCGGCCTGAT GATGCTGGCC GCCATCTGGC ACTGGACTTA TTGGGATCTT GAGATCTGGC AGGACCCCCG CACTGGCGAA CCCGCCCTCG ACCTGCCGAA GATCTTCGGC ATCCACCTGC TTCTCGCAGG ACTCGGCTGC TTCGGTTTCG GTGCTTTCCA CCTCACTGGC GTCTTCGGGC CAGGCATGTG GATTTCTGAC CCCTATGCAT TAACTGGTCA TCTCGAGGCG GTTCAACCGT CTTGGGGGCC TGAAGGTTTC AACCCCTTCA ACCCCGGTGG CATCGTTGCC CACCACATTG CCGCCGGCAT CGTCGGCATC ATTGCTGGCA TTTTCCACAT CACCACGCGA CCGCCCGAGC GCCTCTACAA AGCGCTTCGG ATGGGCAACA TCGAAACGGT TCTGGCCAGC GCCATCGCAG CCGTGTTCTT CGCAGCCTTC ATCGTGGCTG GAACCATGTG GTACGGCTCT GCCGCGACCC CCGTCGAGCT GTTTGGCCCC ACCCGTTATC AGTGGGATCA GAACTACTTC AAAACTGAGA TCAATCGTCG GGTTCAAACC GCGATGGATG ATGGTGCCAC CCAGGAAGAA GCCTTCGAGG CCATCCCTGA GAAGCTCGCT TTTTATGACT ATGTTGGCAA CAGCCCCGCC AAAGGTGGTC TGTTCCGCGT GGGTCCGATG GTGAACGGCG ATGGTTTGGC AACCGCCTGG GTTGGTCACA TCGCATTCAG TGACAATGAA GGTCGCAACC TCGAAGTCCG TCGCCTGCCG AACTTCTTCG AGAACTTCCC CGTCGTTCTG GAAGACGAGC AGGGCATCGT TCGTGCAGAC ATTCCCTACC GTCGCGCAGA AGCCAAGTTC TCCTTCGAAC AACAAGGCGT GACCGCCAAG GTGTTCGGTG GCGCACTTGA CGGCCAGACC TTCACTGACC CTGCCGACGT AAAGCGCCTT GCCCGTAAGG CACAGCTGGG TGAAGCCTTC GACTTCGACC GTGAGACCTA CAACTCTGAC GGCACGTTCC GCAGCTCGCC ACGCGGCTGG TTCACCTTTG GCCACGCCAC CTTCGCGCTG CTGTTCTTCT TCGGTCACAT CTGGCACGGT GCCCGCACCC TGTACCGCGA CGTTTTCGCT GGTATTGATC CCGACCTCGG AGATCAGGTG GAGTTTGGCC TCTTCGCCAA GCTCGGTGAC AAAACCACCC GTCGCCTGCC CGAGGGCTAC GTTCCCCCCG CAGGAACTCC TCTCAACTGA
|
Protein sequence | MGLPWYRVHT VVINDPGRLL AVHLMHTALV AGWAGSMALY ELAIFDPSDP VLNPMWRQGM FVMPFMSRLG VTGSWGGWSI TGETGVDPGF WSFEGVAAAH IVFSGLMMLA AIWHWTYWDL EIWQDPRTGE PALDLPKIFG IHLLLAGLGC FGFGAFHLTG VFGPGMWISD PYALTGHLEA VQPSWGPEGF NPFNPGGIVA HHIAAGIVGI IAGIFHITTR PPERLYKALR MGNIETVLAS AIAAVFFAAF IVAGTMWYGS AATPVELFGP TRYQWDQNYF KTEINRRVQT AMDDGATQEE AFEAIPEKLA FYDYVGNSPA KGGLFRVGPM VNGDGLATAW VGHIAFSDNE GRNLEVRRLP NFFENFPVVL EDEQGIVRAD IPYRRAEAKF SFEQQGVTAK VFGGALDGQT FTDPADVKRL ARKAQLGEAF DFDRETYNSD GTFRSSPRGW FTFGHATFAL LFFFGHIWHG ARTLYRDVFA GIDPDLGDQV EFGLFAKLGD KTTRRLPEGY VPPAGTPLN
|
| |