Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3130 |
Symbol | |
ID | 7102434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 3273895 |
End bp | 3274965 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643476156 |
Product | pentapeptide repeat protein |
Protein accession | YP_002373267 |
Protein GI | 218247896 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTAGAGG AAGAACAACG ACAGATAATT CTCCAGGGAG TAAAAACTTG GAATCGATGG AGAAGTGAAA ATCCAGCCAT CAAGCCTAGT CTAGTCGGCG TTAACCTCAG TGAGATGAAT CTAAAGGAAG TTAATCTCAG TGACACCAAT CTAAGGGAAA CTAACTTAAG TCGGGCAAAT CTTACTGAAG CTAATTTGGT GGCAGCAGAT GCCAGAGAAG CTAATTTAGT GGGGGCTGAC TTAAGTGGTG CTAATCTGAT GAAATCTAAA CTCAGTTTAG CCAAGTTTGG CAGAGCTAAC CTAACGGGAG CAAGCCTAAA TCGAGCTAAT TTAAGTGGGG CTATCATGAG TTTAGCGGAT CTGAGTTGGT CAACGTTGAG TTGGGCTAAT TTGAGTTGGG CTAACCTCAG TGGGGCTAAT CTGAGTCATT GTAACTTGAG TCGAGCCAAT TTGAGCGGTA TCGATCTGAG TTGGGCTAAC TTGAATTGGG CGAATCTGAG TGAAGCGAAT CTTAAAGAGG CGATTCTTGT TACCACCCAA GCGTTGAATA CAAATTTTAG TCATGCTATT TTAACCGGGG CTTGTATTAA AGACTGGAAG ATTAATCTAG AAGCTAATCT TGATAATATT AACTGTCAAT ATATTTTCCT AGAATGGGAA CATCAAGAAC GCCGTCCTCT TGATACAAAT GACTATTTTA GACCAGGAGA TTTTGCAAGA TTTTTAACTA AAAAAACAGA AACCTTAGAG TTAGTTTTTA CTAATGGAAT TGACTGGGAA ATTTTTCTAG AATCTTGGCG GAAAATTGAA AACGAAGTTA ATCATTACGA TGTTGAGCTT CAAAGAATTG AAAAAAAATC GAATGAATCC TTGATTATTG GGTTAGCAGT CTCAAAAGAT TTGGATAAAA CTGTCCTTGA AAGTTCTTTT TGGGAGCATT ATCAAACTCT TTTAAAAATG CAAGATACCA ACAACGAATT ACGCAAATCA CAGATCCTTT TACAAAGACT AGAAAATACT AAAATATTAA ATATTATTCA AGCGATCGCT CAGAAGAATC GTAAAAAATA A
|
Protein sequence | MLEEEQRQII LQGVKTWNRW RSENPAIKPS LVGVNLSEMN LKEVNLSDTN LRETNLSRAN LTEANLVAAD AREANLVGAD LSGANLMKSK LSLAKFGRAN LTGASLNRAN LSGAIMSLAD LSWSTLSWAN LSWANLSGAN LSHCNLSRAN LSGIDLSWAN LNWANLSEAN LKEAILVTTQ ALNTNFSHAI LTGACIKDWK INLEANLDNI NCQYIFLEWE HQERRPLDTN DYFRPGDFAR FLTKKTETLE LVFTNGIDWE IFLESWRKIE NEVNHYDVEL QRIEKKSNES LIIGLAVSKD LDKTVLESSF WEHYQTLLKM QDTNNELRKS QILLQRLENT KILNIIQAIA QKNRKK
|
| |