Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3069 |
Symbol | |
ID | 7104545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3211838 |
End bp | 3213364 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643476093 |
Product | photosystem II chlorophyll-binding protein CP47 |
Protein accession | YP_002373206 |
Protein GI | 218247835 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03039] photosystem II chlorophyll-binding protein CP47 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACTAC CTTGGTATCG AGTTCACACA GTTGTCCTGA ATGATCCAGG CCGACTTATT TCCGTTCACC TCATGCACAC CGCCCTTGTT GCGGGTTGGG CAGGTTCCAT GGCTCTGTAC GAGCTAGCTA TTTTTGATCC GAGTGATCCC GTTCTCAACC CCATGTGGCG ACAAGGGATG TTCGTCCTTC CCTTCATGGC CCGCTTAGGA GTCACTGGCT CCTGGGGTGG CTGGAGTGTC ACCGGAGAAA CAGGTGTAAA CCCTGGTTTC TGGTCCTTTG AAGGCGTTGC TGCCGCCCAC ATCGTTCTCT CTGGGTTACT CTTCCTAGCT GCCGTTTGGC ACTGGGTTTT CTGGGATCTC GAACTCTTTG TTGATGCCCG TACTGGCGAA CCCGCCCTCG ACTTACCTAA GATGTTCGGG ATTCACCTGT TCTTATCTGG GTTACTCTGC TTCGGTTTCG GAGCCTTCCA CCTCACCGGA CTCTGGGGAC CGGGGATGTG GGTATCTGAC CCCTACGGCT TAACCGGCCA TGTCCAACCC GTTGCCCCAG AATGGGGTCC GGCCGGGTTT AACCCCTTCA ACCCAGGGGG AGTTGTGGCT CACCACATTG CAGCCGGAAT TGTGGGCATT ATTGCGGGTC TATTCCACCT AACGGTACGA CCCCCCGAAC GGCTCTATAA AGCCCTCAGA ATGGGGAATA TTGAAACCGT TCTCTCTAGC AGTATTGCCG CCGTCTTCTT TGCGGCCTTT GTCGTTGCTG GAACGATGTG GTACGGTAAC GCAACCACCC CCATTGAACT GTTCGGACCG ACCCGTTATC AATGGGATAA TGGCTACTTC AAACAAGAAA TTGAACGTCG TGTTGAAGCC AATGTAGCGG CGGGCGATAC TTTAGGGGAA GCTTGGTCTA AAATTCCCGA AAAACTTGCC TTTTACGACT ATGTTGGCAA CAGCCCCGCA AAAGGCGGTT TATTCCGTAC CGGAGCCATG GATAGTGGCG ATGGTATCGC CCAAGCTTGG TTAGGTCATC CTGTCTTTAC GGACAAAGAC GGTCGGGAGT TAACCGTACG TCGGATGCCT AACTTCTTTG AAACTTTCCC CATCGTTCTA ACCGATGCTG ATGGAGTCGT CCGTGCTGAC ATTCCCTTCC GTCGGGCAGA ATCTAAACTG AGTATTGAGC AAAGCGGTGT TACCGTTAGC TTCTATGGTG GTGCGCTTGA TGGCCAAAGC TTCAGCAACC CCGCTCAGGT TAAACAGTTT GCTCGTCAAG CCCAATTAGG CGAACCCTTC GAGTTTGACC GCGAAACCCT CGGTTCTGAT GGGGTATTCC GTACCAGTCC TCGCGGTTGG TTTACCTTCG GACACGCCGT CTTCGCCCTA CTGTTCTTCT TTGGTCATAT TTGGCATGGT TCTCGTACCC TGTACCGAGA TGTCTTCGCT GGAATTGACC CCGACCTAGA GGAACAAGTG GAATTTGGCT TGTTTGCTAA GGTGGGTGAC TTGAGTACCC GTCGTACCGA GTCTTAA
|
Protein sequence | MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELAIFDPSDP VLNPMWRQGM FVLPFMARLG VTGSWGGWSV TGETGVNPGF WSFEGVAAAH IVLSGLLFLA AVWHWVFWDL ELFVDARTGE PALDLPKMFG IHLFLSGLLC FGFGAFHLTG LWGPGMWVSD PYGLTGHVQP VAPEWGPAGF NPFNPGGVVA HHIAAGIVGI IAGLFHLTVR PPERLYKALR MGNIETVLSS SIAAVFFAAF VVAGTMWYGN ATTPIELFGP TRYQWDNGYF KQEIERRVEA NVAAGDTLGE AWSKIPEKLA FYDYVGNSPA KGGLFRTGAM DSGDGIAQAW LGHPVFTDKD GRELTVRRMP NFFETFPIVL TDADGVVRAD IPFRRAESKL SIEQSGVTVS FYGGALDGQS FSNPAQVKQF ARQAQLGEPF EFDRETLGSD GVFRTSPRGW FTFGHAVFAL LFFFGHIWHG SRTLYRDVFA GIDPDLEEQV EFGLFAKVGD LSTRRTES
|
| |