Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3601 |
Symbol | |
ID | 8392942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3668363 |
End bp | 3669478 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644981532 |
Product | pentapeptide repeat protein |
Protein accession | YP_003139255 |
Protein GI | 257061367 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGACT ATTCGCCTGT TCTTCATCTT GTCACCTTCC ATAACCCCTA TCCTAACTGT GTTTCCCTCA AATTAGACAC CATGGGGAGT CAATCCTCAT CGGGCCAGAT AATGCTTCAG TTACAGGGAC ACTTCAACGA ACAGGAAAAA GACCTCCTCA ATGGTCATCT GAAATTCGGT TTAAAGGGGG GTATACTATC ACTTGAACTG GAAAATGGAG AAATAATCTA TCCTGAACCC TTACTAGAAG ATTGGGCACA ACTTCAAACC CAGTCGTCTG TCAATCCAAG TTGGGAATTA ACCCCAAAAA CTGGGGCATC TATCCTAAAA ATCGATAATA TTACTGTTCC TTTCGCCATT ATTCAACCTC AAACTGAGCC ATTATATCTA ACAGTCACCT TAAAAGTCAC TCCTCAAAAC CTTTCTATTA CCAATGCAGA AGGGTTATGG CGGCACGATA TTCACCCCAA CCAACACGCG ATATTAGAAC GGGTATTAGC CCAATTTTTG TATAAAAATC GCTTATCTTC TCATTTGTGC CGTTTAGTTT TTAGCTCTAA TAAGGGGACT CATCAGGCAA CCCTAGAAGA CTATCCTAGC CAAGAACTTG ACTCCCATGA ATTAGCTCAA CTGCATCAAC GCATTGAACA GCTTTACGCT GCCAATACCC ATAATTTAGC TGAATTGATC AAATTAGCGC ATTTTAACCC TTTAACAGAC CTAGCAGGAG GCAATTTTTT AGCGGCTGAA TTAAGCGCAG TGGAGTTAAG TGGAGCGAAT CTGACTCAAA CCAATTTTCG AGGAGCGAAT TTGACCGATG CAGAGTTAAG CGAGGCTATC CTAAACTATT GTAAATTCAG TGGAGCCGAC TTAAGTGGGG CTTATTTAGG CAATGCTCAA TTAGTGAAAG CGGATTTTCA TCGCGCGAGT TTAGCCGTTG CTAACCTCAT TGGGGCGAAT CTAACGGAAG CTAACTTAAG GGAAGCTAAC TTAATTGACG CTAATTTAAG CGGAGCAACC GTTAAAGACG CAAAATTCGG CGAAAATCCA GGCATGACCC CAGAATTAGA GCAGAGTTTA CACGAACGCG GTGCAATTTT TGTCCATAAT CCTTAA
|
Protein sequence | MSDYSPVLHL VTFHNPYPNC VSLKLDTMGS QSSSGQIMLQ LQGHFNEQEK DLLNGHLKFG LKGGILSLEL ENGEIIYPEP LLEDWAQLQT QSSVNPSWEL TPKTGASILK IDNITVPFAI IQPQTEPLYL TVTLKVTPQN LSITNAEGLW RHDIHPNQHA ILERVLAQFL YKNRLSSHLC RLVFSSNKGT HQATLEDYPS QELDSHELAQ LHQRIEQLYA ANTHNLAELI KLAHFNPLTD LAGGNFLAAE LSAVELSGAN LTQTNFRGAN LTDAELSEAI LNYCKFSGAD LSGAYLGNAQ LVKADFHRAS LAVANLIGAN LTEANLREAN LIDANLSGAT VKDAKFGENP GMTPELEQSL HERGAIFVHN P
|
| |