Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2699 |
Symbol | |
ID | 7102089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2786122 |
End bp | 2787327 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643475737 |
Product | pentapeptide repeat protein |
Protein accession | YP_002372856 |
Protein GI | 218247485 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTA TTGCAGCAAT TCTTGTCGTA ACTCCTTTAT TCCTAAGTTC TGTCGTCAAA GCCGAAAATC CTAGTTCAGT ACAACGGTTA TTAACAACTA AAGAATGTAT CGGTTGTAAC TTACAAAATG CTAACCTAAA AGGCCTTAAT TTAGAAGGAG TCAATCTAGA AAAAGCCAAC TTAAAAAATG CTAATTTGCA GGGAGCTAAC CTGAACAATG CTCACCTCAA ACAGGCCATT TTACAGGATG CTCGATTGAT GGATGCTCAA CTCGAAGGAA CTGTACTCGA AGCAGCTAAT CTGATCAATA CAAAGCTTGA TGGTGCTAAT TTAAATAATG CTAACCTTAA AGGCGTTAAT CTGGTCAATT CACAGATGAA TGGCATTATT TTGACTAATG CAAACTTAGA AGGAGCCACA ATGAGGGGTG TTTCCCTCCA AGAAGCCAAT TTAGATGGAG CTATCTTAAT CCAGGCTGAT TTAACCGTTC ATGATGAAAA ACGAGTGAAT CTGACGGGTG CGAGTCTCAA AAATGCGGAT TTGTCAGGGG CACATCTTCG CGGTATCAGA CTCAAAGATG CTAACCTTGA AGGGGCTAAT CTGGAAAAAA CTGACTTTAC CCGCGATATT CCTAATAATA CCACCGCTAA AGGAGCTCTC AGTGTAGCTA CCTCACCCAT TCCCTTAGTT TTGCCTGGTG CTGTCTTGGG TGCTATTGGG AACTTTGCTA TTGGAGAAGC TTCTGCGTTG AATGCGGATG TTAGTAATAC CAATTTAGCA GGAGCCAATT TAGAAGAAGC TAATCTCCAA GACATTAATT TAGAGAACTC CAATCTCAAG AATGCTAACT TAGAAAAAGC TAATTTACAC AATGCTTATT TAGTCAATAC GAATTTGACT AATGCCAATT TAAGTTTAGC CAAATTAACT AATATTAATA TGGAGGGAGT TAACTTAAGT AGTGCTAACT TAGCCGGGGC TAATTTGGAT AAATCCTATC TAGCTAAGGC TAATCTGACC AATGCTAAGC TTGAATCAGC CAAATTAACG AATGTTAATT TAACGGACAC TCAGTTAACA AATGCTAACT TAATGAAAGC CCAATTAGCT AATGCTAACT TAAGCAATTC TAACTTGTGT GGGGCAACCA TGCCTGATGG TTTGATTTCT CAAATAGGAT GTACTGCGGC CAATATTCAG TCATAA
|
Protein sequence | MKTIAAILVV TPLFLSSVVK AENPSSVQRL LTTKECIGCN LQNANLKGLN LEGVNLEKAN LKNANLQGAN LNNAHLKQAI LQDARLMDAQ LEGTVLEAAN LINTKLDGAN LNNANLKGVN LVNSQMNGII LTNANLEGAT MRGVSLQEAN LDGAILIQAD LTVHDEKRVN LTGASLKNAD LSGAHLRGIR LKDANLEGAN LEKTDFTRDI PNNTTAKGAL SVATSPIPLV LPGAVLGAIG NFAIGEASAL NADVSNTNLA GANLEEANLQ DINLENSNLK NANLEKANLH NAYLVNTNLT NANLSLAKLT NINMEGVNLS SANLAGANLD KSYLAKANLT NAKLESAKLT NVNLTDTQLT NANLMKAQLA NANLSNSNLC GATMPDGLIS QIGCTAANIQ S
|
| |