Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_0752 |
Symbol | |
ID | 7102804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 778515 |
End bp | 780659 |
Gene Length | 2145 bp |
Protein Length | 714 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643473849 |
Product | pentapeptide repeat protein |
Protein accession | YP_002370991 |
Protein GI | 218245620 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTTA ATTTTAATCA AGCGAATCTC AGTGGACGTT CTTTTAAAGG ACAAAACCTT ACAGGCGCAG ACTTCAGATG CGCTAACATT CGAGGTGTAA ACTTTGCTAA CGCCACTCTG ACAGGTGCTA ACTTTAGTCA TGCAAAAGCT GGCTTAAGTT TTTATGGTTT AGCTAGCTTA ATCTCTATTT TTGTGATTCT TGCGGCTCTT TCAGGATTGA TTGCCGGATA TGCAGAAACT TTTATAGGAT TGGTAGTCAG CTTATTATCA TCATCTCAAG AAAGAATTTT GGCAAGGCAA ATACTTGTCA TTATAGGTCT GCTAACTTCT ATTAGTTTCA TTGTTATAAT AATTCGCCAA GGAATAGGAA TAAGTTTAGC AATTCTTGCA ATAGTTTTTG CTGTTATAAC TGCCATAATT GCCTATCAAT ATGATTACAG GGTTGGCGCA GATGCTTTAA TTCAGTTTTT TGTAATTGCC GTCATAGTTG CGAGTATTTT ACTCGGAGCT TTAGTTCAAG CTATTTTTCT ATCTATTACA AAAAAAAATA GGGCACTTAT TCTATTTGCA ATCCCAGCCG TAGCTGGAGC AATCGCTGGA GCTATCAAGG GAGTTAAAGG AATTCGACAG GAATTTGTAT CTGTATCCCT AATAATGGCT TTTATTGTGA GCATTGGCTT GTTGGGTCTT AGTGTCTATA TTGGTATACG AGCAATGGCT GGAGATAAAA AATATGACTT AGTTCGAGCA ATTGCCAATG CTAGTTGTAC GGCAATGGGA ACCAGTTTCC GAGGAGCTAA CTTAACTGAT GCTGACTTCA CTGAAGCTAA ACTCCAGAAC ACAGATTTTA GAAAAGCAAT TCTCAAACGA ACTTGCTGGT TTCAAGCTAC AAATCTCGAT CTAGCTGGTG TTGAGAATAC TTATCTAGAA AACCCTTCCT TGCGACAGTT GGTTATTTCT AAAAACGGAG AGGGTGAATT CTATGACTAC CAAGATTTGA GAGGCTTGAA TTTTAAAAAT GCCAACCTAG TAGATGCCAG TTTTATCGGA GCAGATCTCA GTGAAGCAAC TTTACAAGAC GCTGATCTTT CCAGAGCAAA ACTTGTACAG ACACGACTAT ACGGAAGCGA TCTCACCCAT GCCATTTTAA CAGGTGTCTG CATCCAGGAT TGGGCAATCT CTACCGACAC CCAGCTACAA CAAGTAAGAT GCGAGTATGT CTATGTGCGC CTACCGACAA AGGAAGATCC TGACCCCTGG CGAAAACCAG ACAATAGACA AGAGACTTTT AGAGAGGGTG ATTTCTCTGA TTTTATTGCT CCTATCATCA AGACGCTGGA TCTCTATCGA CAACAAAATA TAGATCCTCG TAAGATGGCG AGTACATTTA AAACCCTTGA TTTATACCAC TACGAAGGAA TTGATCCCAG TGCAGCCGCG ATCACTCTCA AACAACTCTC TGAACAATAT CCCGATGCAG GGCTAGAAGT GGTTGCCCTT GAGGGACGAG GGGATGAAAA AGTGCGGCTT CAAGCCGTTG TTACAGAGAC AGTAAACCAA TCTCAACTGA GTGCAGAGTA TTTTGAGCGC TATCGAGAAA TTTCCGCGTT ACCCTATAAA GATATCCAAG CACTGTTAGC GGGAATGGCA GAAAAGGACG AACGAATCCG TAGTTTAGAA CGAATGGTAA ATTCTGCTAT TCAAGGCAAA AAATTTTACG TTGAAACTTA CTATCAAATG GGGGATACCG TGTCTGAAAA AAGTTCAATT AATTTTACAG CCCGCGATAT TAGCGGTGTA GTCAATTTAG GGAGCATTAG TGGCAATGTT ACTAACACTA TTAACCAACT GCCTGAATCA ACCGATCCGA ATCAACCGAG TCTCAAAGAG TTACTAACTA AATTACAGGC TGCCATTGAG TCCGAAACAG AGTTACCAGA TAAAAACAAA GCTCTAGCTT TAGATCAGGT AAAAACTTTG GCAGAATTAG GACAAAAGCC CGAAGACAGT GCATTGCAAA AAGCTGCTCA ACTAGCGATA ATGGCTCTTA AGGGGATAAC ATCAGGGTTA TCTGAAACTA CTAAACTGGT TGTAGAATGT ACTAAGTTAT TACCCGCAAT CTCTACTTTA CTAGCTCTCG TATAG
|
Protein sequence | MTFNFNQANL SGRSFKGQNL TGADFRCANI RGVNFANATL TGANFSHAKA GLSFYGLASL ISIFVILAAL SGLIAGYAET FIGLVVSLLS SSQERILARQ ILVIIGLLTS ISFIVIIIRQ GIGISLAILA IVFAVITAII AYQYDYRVGA DALIQFFVIA VIVASILLGA LVQAIFLSIT KKNRALILFA IPAVAGAIAG AIKGVKGIRQ EFVSVSLIMA FIVSIGLLGL SVYIGIRAMA GDKKYDLVRA IANASCTAMG TSFRGANLTD ADFTEAKLQN TDFRKAILKR TCWFQATNLD LAGVENTYLE NPSLRQLVIS KNGEGEFYDY QDLRGLNFKN ANLVDASFIG ADLSEATLQD ADLSRAKLVQ TRLYGSDLTH AILTGVCIQD WAISTDTQLQ QVRCEYVYVR LPTKEDPDPW RKPDNRQETF REGDFSDFIA PIIKTLDLYR QQNIDPRKMA STFKTLDLYH YEGIDPSAAA ITLKQLSEQY PDAGLEVVAL EGRGDEKVRL QAVVTETVNQ SQLSAEYFER YREISALPYK DIQALLAGMA EKDERIRSLE RMVNSAIQGK KFYVETYYQM GDTVSEKSSI NFTARDISGV VNLGSISGNV TNTINQLPES TDPNQPSLKE LLTKLQAAIE SETELPDKNK ALALDQVKTL AELGQKPEDS ALQKAAQLAI MALKGITSGL SETTKLVVEC TKLLPAISTL LALV
|
| |