Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2701 |
Symbol | |
ID | 7105038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2788388 |
End bp | 2789593 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643475739 |
Product | pentapeptide repeat protein |
Protein accession | YP_002372858 |
Protein GI | 218247487 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTGCTTCAAT TCTTCTCCTA ACTCCTCTGT TACTAACCTC GGTTGTTAGA GCAGAAAATC CGAGTTCAGT AGAACGATTA TTAACCACAA GAGAATGTGT TGGTTGCGAC TTAGAAAATG CTAATCTCAA AGGCATGAAT TTAGAAGGAG CCAATCTCGA AAAAGCAAAT TTAAAAAATG CCAATTTGGA GAGAGCTAAC CTGAAGAATG CTAACTTAAA ACAAGCCATT TTGCAAGATG CGAAATTAAC CGAAGCTCAA CTCGAAGGCG TTATGCTTGA TGGGGTTAAT TTTATTAATG CCAAGCTAAA AGGGGTTAAT TTAAGCGGAG TGAACCTTAA AGGGGTTAAT TTCGTCAATG CGGAGATGGA TGGCATTATC TTAACGAATG CTAACCTAGA AGGAGCCCAG ATGAGGGGTG TCACCCTCGA AGGAGCCAAC CTAGACGGAG CTAACTTACA GGGAGTTGAT TTAACGGTTC ATGACGAAGA ACGAGCCAAT TTAACGGGTG CAAGTCTAAA AAATGCTGAC TTGTCAGGGG GTTTTCTGCG GGGTATCAGA CTAAAAAACG CTAACCTTGA AGGCGCAAAT CTTTCTAAAA CCGACTTTAG CCGCGATATT CCTAATAATA CCACCGCTAA AGGAGCTCTT AATGTCGCTA CTACCCCCAT TCCTTTAATT TTTCCTGGGG CAATTTTGGG GGTTATTGGA GACGTAGCTA TTAATGAAGC TTCGGCTCTC AATGCGGATG TGAGTTATGC CAATTTAGAA GGAGCTAACT TACAAGATTC TAACCTAGAA GACATTAATT TTGAGAGTTC TAATCTAAAG AATGCTAACT TACAAAATGC GAATTTAAAC AATGCTTATT TAGTCAATAC TAACTTGACT AATGCTAATT TAAGTTCAGC TAATTTAACT AACATTAATA TGCAAGGAGT GAACTTAAGT TATGCTAACT TAATGGGAGC TAATTTAGAC GGTTCTTATT TAGTTAATGC TAATTTGAGC CATGGTAACC TTGAATCAGC GCACTTAACA AGTATTAATA TGAGTGGTGC TCAGTTAAGT AATGCTAACT TAAGCGAAGC TAAATTAACC GATTCCAACT TGAGTAATTC TAACTTGTGT AGTGCCACGA TGCCTGATGG TTCGATTTCT CAAATTGGAT GTACTGCTGT TAATCTTGAC AAGTAA
|
Protein sequence | MKKIASILLL TPLLLTSVVR AENPSSVERL LTTRECVGCD LENANLKGMN LEGANLEKAN LKNANLERAN LKNANLKQAI LQDAKLTEAQ LEGVMLDGVN FINAKLKGVN LSGVNLKGVN FVNAEMDGII LTNANLEGAQ MRGVTLEGAN LDGANLQGVD LTVHDEERAN LTGASLKNAD LSGGFLRGIR LKNANLEGAN LSKTDFSRDI PNNTTAKGAL NVATTPIPLI FPGAILGVIG DVAINEASAL NADVSYANLE GANLQDSNLE DINFESSNLK NANLQNANLN NAYLVNTNLT NANLSSANLT NINMQGVNLS YANLMGANLD GSYLVNANLS HGNLESAHLT SINMSGAQLS NANLSEAKLT DSNLSNSNLC SATMPDGSIS QIGCTAVNLD K
|
| |