Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2071 |
Symbol | |
ID | 7104312 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2141327 |
End bp | 2144038 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643475128 |
Product | Phycobilisome linker polypeptide |
Protein accession | YP_002372259 |
Protein GI | 218246888 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTA AGGCAAGTGG TGGAAGCTCG TTAGCACGGC CCCAACTGTA TCAGACGGTT CCCGTTTCTG CGATCACACA AGCAGAACAA CAGGATCGCT TTTTGGCGAA CCCCGAACTC AATGAGTTGG TCGCCTATTT TCAATCAGGA AGCAAGCGAC TAGCGATCGC CCAGATATTA ACCGACAATT CCGACCTAAT TGTCTCACGG GCAGCTAATC GTATCTTCAC TGGGGGGTCC CCCATGGCAT ATCTCGAAAA ACCCCCAGTG GAGGAAGTCA GAGAAATGGC CATGGCTGGA GGAGGACAAG CTCCCAGTCT ACAAAGAAGT ATGGCTTTAG GAACCGTGAC CTATGCCGAA GGTGGCGGTG GTGGCGGCGG CGGCTTTTTC GGAGGCTTAC GCTCAATCTT AAGTTCCACA GGACCGATTC CTGCCGGGTT CCGTCCCATC AATATCTCCC GCTATGGTCC GAGCAATATG CAAAAGTCCT TGCGGGATCT ATCGTGGTTC TTGCGCTATG TCACCTATGC CATTGTCGCT GGAAACCCCA GTATTATCGT CGTGAACACC CGTGGACTAC GGGAAGTAAT TGAAAGAGCC TGTTCAACAG ATGCGACCAT TGTAGCCCTG CAAGAAATGC GAGCCGCAGC CAGAGATTAC TTCCGTCAGG ATGCCGAAGC TCAGGCGATC GTCACCGAAT ACTTCGATGT CCTGATCACC GAATTTAAAG CACCGACCCC CTCTAATAAA TTGCGTCAGC GTCCTTCATC GGATCAGCAA GGATTAGCTC TGCCCCAAAG TTACTACAAT GCGGCACAAA CACGGCAAAA ATTCGTCATG AAGCCGGGCT TATCGGAGTC AGAAAAATCG GCAGTCGTCA AAGCAGCCTA TCGTCAACTG TTTGAACGGG ATATTACCCG CGCCTACGGC CAGTCGATTT CCTATCTCGA ATCTCAGGTA AGAAACGGCG ACATCTCCAT GAAAGAGTTC GTCCGTCGCC TGTGTAAGTC TCCCTTGTAT CGCAAGCAAT TCTTCGAGCC CTTCATCAAC AGTCGTGCCC TAGAATTGGC CTTCCGTCAT ATTTTGGGTC GTGGTCCGAG TTCCCGGGAA GAAGTACAAA CCTACTTTTC GATCGTCTCT AGTGGTGGAT TAGCTGGTCT GGTAGATGCT TTAGTCGATT CTCAGGAGTA TTCCGACTAC TTTGGGGAAG AAACTGTTCC CTATCTTCGC GGATTAGGAC AAGAAGCTCA AGAATGCCGT AACTGGGGAA TGCAGCAAGA TCTGTTTAAC TACAGCGCAC CTTTCCGCAA AGTACCTCAA TTTATCACGA CCTTCGCTAA ATACGATCGC CCCTTACCCG ACCAGCACGT TTACGGCTCA GGGAATGATC CCCTAGAAAT TCAATTTGGG GCAATTTTCC CGAAAGAAAC CCGTAATCCG AGCAATCGTC CTGCGCCCTT TAGCAAAGAT ACCAAACGGA TTCTCATTCA CCGTGGACCA GGGATTAATA ACCAAAATAG CAATCCTACC GCACGGGGTG AATTCCCTGG ATCATTGGGA GCCAAAGTTT TCCGCTTAAA TAACGAACTC CCTGGCAGCA GCAACGGAGT GAGTATTAAA TACGGGGAAA GTTCCACGCA AGCAGTGATT CGCGCTGCCT ATCGCCAAGT CTTTGGACGG GATGTCTATG AAGGGCAACG GCTAAGTGTC GCAGAAGTTA AGCTAGAAAA CGGCGAAATT ACCCTACGGG AGTTTATCAA AACCTTAGCG AAATCGGACA CCTTCCTCAA GACCTACTGG ACTCCTTTCT ATGTGGTCAA GGCGATCGAA TATATCCACC GTCGTCTTTT GGGTCGTCCT ACCTACGGCC GTCAGGAGAT GAACAAATAT TTCGATCTAG CCTCGAAAAA AGGTTTCTAT GCCCTTGTCG ATGAGATGAT CGATAGTAAA GAGTATAGCG AAGCCTTTGG CGAAGATACT GTCCCCTACG AACGCTATTT AACCCCTGCC GGAATGCAGC TACGCATGGC GCGTCCCGGG TCAATTCGTG AGGATATTGG TCAACGGGTA GACAAGGAAA CGACTCCCCG CTTTATCGAG TTGGGACAAG TTAGTGCGAT CCGTACCGAA CCCGAAATTG CTTTCCGCAT TAATCAAGGG GTTACGGTTG AGCGTCAGCA AACCAAGATC TTTAAGTTAC TCTCAACAGC CGATAAAGTG GCGGTTAAAA ACGTCATTCG CGCTGCCTAC CGTCAGATTT TTGAACGGGA TCTCGAACCT TACATTGTTC AAGCAGAATT TACCGCTCTT GAAAGTAAGC TGAGTAATGA GGAAATTTCT GTTAAAGAGT TCATTGAACA GTTAGGCTGT TCTGATCTTT ATATCAAGGA ATTTTATGCT CCCTATCCCA ATACTAAGGT TATTGAATTG GGAACTAAAC ATTTCCTCGG TCGGGCACCG TTAACCCAGA AAGAGATCCA AAAATACAAT CAAATTCTGG CAACTCAAGG CATTCGTGCC TTTATTGGAG CCATGGTTGA TAGCATGGAA TACTTACAAT TGTTCGGGGA AGATACGGTT CCCTATCGTC GTTTCCCGAC CCTTCCTGCG GCGAATTTCC CCAATACGGA ACGGCTTTAT AATAAGCTGA CTAAGCAGGA TAAAGAGTTG GTGGTTCCTA GTTTTGAACC CGTGGTTAAA GTGGGTGGTT AA
|
Protein sequence | MSVKASGGSS LARPQLYQTV PVSAITQAEQ QDRFLANPEL NELVAYFQSG SKRLAIAQIL TDNSDLIVSR AANRIFTGGS PMAYLEKPPV EEVREMAMAG GGQAPSLQRS MALGTVTYAE GGGGGGGGFF GGLRSILSST GPIPAGFRPI NISRYGPSNM QKSLRDLSWF LRYVTYAIVA GNPSIIVVNT RGLREVIERA CSTDATIVAL QEMRAAARDY FRQDAEAQAI VTEYFDVLIT EFKAPTPSNK LRQRPSSDQQ GLALPQSYYN AAQTRQKFVM KPGLSESEKS AVVKAAYRQL FERDITRAYG QSISYLESQV RNGDISMKEF VRRLCKSPLY RKQFFEPFIN SRALELAFRH ILGRGPSSRE EVQTYFSIVS SGGLAGLVDA LVDSQEYSDY FGEETVPYLR GLGQEAQECR NWGMQQDLFN YSAPFRKVPQ FITTFAKYDR PLPDQHVYGS GNDPLEIQFG AIFPKETRNP SNRPAPFSKD TKRILIHRGP GINNQNSNPT ARGEFPGSLG AKVFRLNNEL PGSSNGVSIK YGESSTQAVI RAAYRQVFGR DVYEGQRLSV AEVKLENGEI TLREFIKTLA KSDTFLKTYW TPFYVVKAIE YIHRRLLGRP TYGRQEMNKY FDLASKKGFY ALVDEMIDSK EYSEAFGEDT VPYERYLTPA GMQLRMARPG SIREDIGQRV DKETTPRFIE LGQVSAIRTE PEIAFRINQG VTVERQQTKI FKLLSTADKV AVKNVIRAAY RQIFERDLEP YIVQAEFTAL ESKLSNEEIS VKEFIEQLGC SDLYIKEFYA PYPNTKVIEL GTKHFLGRAP LTQKEIQKYN QILATQGIRA FIGAMVDSME YLQLFGEDTV PYRRFPTLPA ANFPNTERLY NKLTKQDKEL VVPSFEPVVK VGG
|
| |