Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4125 |
Symbol | |
ID | 7101911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4325950 |
End bp | 4327320 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643477114 |
Product | pentapeptide repeat protein |
Protein accession | YP_002374213 |
Protein GI | 218248842 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCAT CACAACTACT GCAACAGTAT GAACAAGGCA GAAGGGACTT CCGAGGAGAA GACCTCAGAG GTCAATCGTT TAGAGGAAAA AACCTCGCCC ATACAGACTT CAGTCAAGCC GATATTCGAG GGACTAATTT TACCCAGGCT AACCTAACAG GAGCCAAATT TTGTCAGGCT AAAGGAGGAC TGACTAAACC AAGAGAAATT CTTTTAGGAT TGATTTCTTT TTTATTGGCT GGACTATCAG GGATTTTTTC AGGATTATCC GGCGGTTTAG TCCTATTAAT CTTTGATAGT TCTCATCTGA TCAATCAAAT CGTCGGTTGG ACTGCCTTAA TTATTTTGAT GATTTTTTGT ATCTTCAGTT ACTATCAGGG ATTAACTGGG GGAGTAGCAG CGGCGACTCT CTCCTTTATG GGGACTTTAG TGGTGGCTCT GGTAATCACA CAAGAGGTTT CTATCTCCGT GACTACTATC TTATACATTG GAGTGATTTT TATTGGTCAT GTTGCTGTTG CTCTAGCAGG AGCTCTTACA GGAGCCCTCG CTGGAGCTTT TGCTGGAGCA ACTGTGGGTG CTGTCATTGG GGTTGGTGCT TTCCCCCTAG CGATCATTTT TGCCTTAGCT TTAGCTGAAG CTAACGATAA ACTCTTTTCC ATTGCAGGAG CCGTGACTGC TGTTATAATT TTGAGCTTAC TGAGCATTTA TCTGGGTTTA CGCGCGATGA AAGGGGATAA TCGAGATAGT TGGCTGCGTC ATCTGGCTTT TGCTTTTTCA GCGATCGGAG GAACCAGTTT TTATCGAGCG AATTTAACTA ATGCGGACTT AACGGGAGCT TTGCTCAAAG GAACCGATTT AAGAGACGCT GATTTAACTC GTACTCGCTT TTATCAAGCT CAACAACTCA ACCGCGCTAG AGTCGATAAT ACTATCCTCG CTCAACCGGA AATTAGAGAC TTACTGATTG ATCCGACAAC GGGCTACAAA CAATACTATT ACAAAGCCAA TTTACGAGGA GCTAATCTCG ATTATGCTAA CTTACACGGG GCTAATCTCA AGCAAGCGGA TCTCACCGAT GCAACTTTAC GACACGCTAA TTTAGAGGGA GCTAATCTTA CTAAAATTTT AGCATCAGGA ACCGATTTTA GCGGAGCAAC TTTTCATGGT GTTTGTATCG AAGGGTGGAA AATTGATGTT TTTACAAAAT TAGGGCAAGT TGATTGTGAA TACGTTTTTT TAAGGGAAAA ACCGAATGAA TACGGTAGTC GAGAACGTTG TCCCTATGAT CCAGAGGGAA AATTTGAACC TGGGGACTTT GAACTGTTGT ATAAAGTGAG AAATTACGCT CAAGCACAGC TTGAAGATTG A
|
Protein sequence | MKASQLLQQY EQGRRDFRGE DLRGQSFRGK NLAHTDFSQA DIRGTNFTQA NLTGAKFCQA KGGLTKPREI LLGLISFLLA GLSGIFSGLS GGLVLLIFDS SHLINQIVGW TALIILMIFC IFSYYQGLTG GVAAATLSFM GTLVVALVIT QEVSISVTTI LYIGVIFIGH VAVALAGALT GALAGAFAGA TVGAVIGVGA FPLAIIFALA LAEANDKLFS IAGAVTAVII LSLLSIYLGL RAMKGDNRDS WLRHLAFAFS AIGGTSFYRA NLTNADLTGA LLKGTDLRDA DLTRTRFYQA QQLNRARVDN TILAQPEIRD LLIDPTTGYK QYYYKANLRG ANLDYANLHG ANLKQADLTD ATLRHANLEG ANLTKILASG TDFSGATFHG VCIEGWKIDV FTKLGQVDCE YVFLREKPNE YGSRERCPYD PEGKFEPGDF ELLYKVRNYA QAQLED
|
| |