Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_0672 |
Symbol | |
ID | 8389978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 677094 |
End bp | 678449 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 644978693 |
Product | sun protein |
Protein accession | YP_003136449 |
Protein GI | 257058561 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0363723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAACT CAGATCAGAT AACCCATGCG CGTCCCTTGG CCTTGGTAAT TCTCCGTGAG ATTGAAAGAC GAGGAACTTT CACGGATATA GCGATTGATC GCGGTTTAAA ATCAACCGAT CTTAGTGGCA GCGATCGCGC TTTAGTCACA GAATTAGTCT ATGGTATCGT CCGACGCAAA CGAACCCTTG ATACCCTGAT TGATCAGTTG GGGAAAAAAG CCTCCCACCA ACAACCTCCG GATTTACGTC TGATCCTTTA TATTGGACTC TATCAACTGC GTTATTTAAG CCAAATTCCC CCTTCTGCGG CGGTGAATAC TGCCGTTAAC TTGGCCAAAG AAAATAGCTT ACAACGGCTT TCTGGGGTGG TTAATGGCAT TTTACGGCAA TATATTCGCC TTGCTCAAGA AAACAATGAC CCTCTAATCT TACCCGATGA TCCCATCTCA AGATTAGGGG TTCTCTATAG TTTTCCTGAT TTCATGATTA AACTGTGGTT AGAACAATGG GGACTAGAAA CCACCGAAGA ATTATGTAAT TGGTTTAATC AACCTCCTGT CTTAGATATC CGGATTAATC CTTTAAAAAC AACCTTAGAG GAAGTTAAAA CGACCTTAAG CCAAGGAAAT CTGACGCTAA TGCCGTTAGA GATCCCCCAA GGATTAAGGT TACAGGGTAA AACGGGAGCG ATTCAAGATT TACCCGGATT TAAAGAGGGA TGGTGGACGG TACAAGATAG CAGTGCTCAA TTGGTGAGCC ATTTACTTGA TCCTCAGCCA TCGGAGGTGA TTATCGATGC CTGTGCTGCA CCAGGGGGAA AAACCACCCA TATTGCTGAA TTAATGGGGG ATCAAGGAAC AATTTGGGCT TGCGATCGCT ATGCCTCCCG CTTGAAAAAA TTGTCAGCCA ATAAGGAACG ATTGCAGCTA AACTCAATTA AAATCGTTAC GGGAGATAGT CGTCAATTAG ACCAATTTCA GGGAATTGCT GATCGCGTCT TAGTGGATGC ACCCTGTTCA GGACTGGGAA CCCTACACCG ACACCCTGAT ATTCGTTGGC GACAAACCCC AGAAAAGATC GAAGAATTGG CTATTTTACA GAAAGAATTA TTAGAAACGA CAGCTAATTG GGTCAAACCC CAAGGGATTT TAGTCTATGC TACTTGTACT TTAACTTATC AAGAAAATGA AGGAGTTATT GAACACTTCC TTGCTTCCCA TCCCCATTGG AAGATTGATG TCCCTTCTCC TGATTCACCC GCAGCTAAGT GGATGACAGC ATCAGGAGCG ATAAAAATTT TACCTCATCA ACAGGACATG GATGGATTTT TCATGGTGAA GTTAAAGAAA GGTTGA
|
Protein sequence | MSNSDQITHA RPLALVILRE IERRGTFTDI AIDRGLKSTD LSGSDRALVT ELVYGIVRRK RTLDTLIDQL GKKASHQQPP DLRLILYIGL YQLRYLSQIP PSAAVNTAVN LAKENSLQRL SGVVNGILRQ YIRLAQENND PLILPDDPIS RLGVLYSFPD FMIKLWLEQW GLETTEELCN WFNQPPVLDI RINPLKTTLE EVKTTLSQGN LTLMPLEIPQ GLRLQGKTGA IQDLPGFKEG WWTVQDSSAQ LVSHLLDPQP SEVIIDACAA PGGKTTHIAE LMGDQGTIWA CDRYASRLKK LSANKERLQL NSIKIVTGDS RQLDQFQGIA DRVLVDAPCS GLGTLHRHPD IRWRQTPEKI EELAILQKEL LETTANWVKP QGILVYATCT LTYQENEGVI EHFLASHPHW KIDVPSPDSP AAKWMTASGA IKILPHQQDM DGFFMVKLKK G
|
| |