Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4140 |
Symbol | |
ID | 7104548 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4342898 |
End bp | 4344388 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643477129 |
Product | protein of unknown function DUF344 |
Protein accession | YP_002374228 |
Protein GI | 218248857 |
COG category | [S] Function unknown |
COG ID | [COG2326] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGATA ACCTAGATTT AAAATTAACC CTTGATAAAG AAACTTATCA ATCCCAACTA GAACAGTTGA TGCGTCAACT GCGATCGCTG CAAAAAGCGT GTTGGGACAA TAAACTCCCC GTCATTATTG TACTCGAAGG TTGGGCAGCG GCCGGCAAAG GAACGTTATT ACAAAAAACT ATTGGCTATA TGGACCCCCG TGGGTTTACG GTTCATCCGA TTTTAGCAGC GACTCCAGAT GAGGAAAAAT ACCCGTTTTT GTGGCGATTT TGGCATAAAC TGCCAGCTAA GGGGAGTATC GGCATTTTTT ATCACAGTTG GTATACCCAT GTCCTAGAAG ATCGTTTGTT TCAAAAGGTG AATAACAGTG ATATTCCCCT CCTAATGCGC GATATTAACG CCTTTGAACA TCAATTAGTG GATGATGGGG TAGCTATGGC TAAATTTTGG ATTCATTTGA GTCGAAAGGA GATGAAAAAG CGACTCAAGA AGTATGAAGC GGATGAACTG GAGTCTTGGC GAGTTCGTCC AGAAGATTGG CAACAGGCTA ACCGCTATGA TGAATATGCA GCCTTAGCTG AAGAAATGTT AACCTTTACC AGTACGGGTC ATGCGCCTTG GACGTTAGTG GAAGGAGACT GTCAGCGATG GACTCGTATT AAAGTATTAT CGCAGATTGT AGCCACCATT ACTCAAGCAT TAGATCTGCG GAAACTCCCT CAAACCGCTA TTCCCTCCTT ACCTCCCCAA ACGGAATTAC AACCCACAGA GCCCGATTTT TTGGGTAAGG TGGATTTAAG TCTGCATTTG TCTAAAGACG AGTATCGGCA ACGGTTAGGG GAAGCGCAGG TTAAACTGCG TCAGTTACAA TTGCGGATTT TTCGGGAAAA TATCCCTGTT TTAGTGACTT TTGAGGGATG GGATGCAGCC GGAAAGGGAG GGGCAATTAA ACGCCTTACG GATACTTTAG ACCCGCGAAG TTACAAAGTC AATGCTTTTG CAGCCCCAAG CCAAGAAGAG AAGCAATACC ATTATTTATG GCGATTTTGG CGATATTTGC CAGGGGGAGG AACAATAGGC ATTTTTGACC GCAGTTGGTA TGGTCGGGTG TTAGTGGAAA GAATTGAAGG GTTTGCCAAT GAGTTAGAAT GGCGGCGATC TTATAAAGAA ATTAATGAAT TTGAAGCCCA ATTAACCCAT GGGGGCTATG TATTAGTTAA GTTTTGGTTA CATATTGGTT TGGATGAACA ATTAAGACGG TTTGAAGAAC GGCGAGATAA TCCTTTTAAA AATTATAAAT TAACCGACGA AGATTGGCGA AATCGAGATA AATTTCCGTT ATATTATGTC GCAGTTAATC AAATGATTGC CCGTACCAGC ACCCCCGCAG CTCCTTGGTA TATTGTCCCT GGCAATGATA AATATTATGC CCGTGTTTTT GTCATTGAAA CGTTGATTAG TGCTATTGAA ACTGAGTTAA AACGGCGATG A
|
Protein sequence | MLDNLDLKLT LDKETYQSQL EQLMRQLRSL QKACWDNKLP VIIVLEGWAA AGKGTLLQKT IGYMDPRGFT VHPILAATPD EEKYPFLWRF WHKLPAKGSI GIFYHSWYTH VLEDRLFQKV NNSDIPLLMR DINAFEHQLV DDGVAMAKFW IHLSRKEMKK RLKKYEADEL ESWRVRPEDW QQANRYDEYA ALAEEMLTFT STGHAPWTLV EGDCQRWTRI KVLSQIVATI TQALDLRKLP QTAIPSLPPQ TELQPTEPDF LGKVDLSLHL SKDEYRQRLG EAQVKLRQLQ LRIFRENIPV LVTFEGWDAA GKGGAIKRLT DTLDPRSYKV NAFAAPSQEE KQYHYLWRFW RYLPGGGTIG IFDRSWYGRV LVERIEGFAN ELEWRRSYKE INEFEAQLTH GGYVLVKFWL HIGLDEQLRR FEERRDNPFK NYKLTDEDWR NRDKFPLYYV AVNQMIARTS TPAAPWYIVP GNDKYYARVF VIETLISAIE TELKRR
|
| |