Gene PCC8801_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0652 
Symbol 
ID7105711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp680668 
End bp682023 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content43% 
IMG OID643473752 
Productsun protein 
Protein accessionYP_002370895 
Protein GI218245524 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACT CAGATCAGAT AACCCATGCG CGTCCCTTGG CCTTGGTAAT TCTCCGTGAG 
ATTGAAAGAC GAGGAACTTT CACGGATATA GCGATTGATC GCGGTTTAAA ATCAACCGAT
CTTAGTGGCA GCGATCGCGC TTTAGTCACC GAATTAGTCT ATGGGATCGT CCGACGCAAA
CGAACCCTTG ATACCCTGAT TGATCAGTTG GGGAAAAAAG CCTCCCACCA ACAACCTCCG
GATTTACGTC TGATCCTTTA TATTGGACTC TATCAACTGC GTTATTTAAG CCAAATTCCC
CCTTCTGCGG CGGTGAATAC TGCCGTTAAC TTGGCCAAAG AAAATAGCTT ACAACGGCTT
TCGGGGGTGG TTAATGGCAT TTTACGGCAA TATATTCGCC TTGCTCAAGA AAACAATGAC
CCTCTAATCT TACCCGATGA TCCCATCTCA AGATTAGGGG TTCTCTATAG TTTTCCTGAT
TTCATGATTA AACTGTGGTT AGAACAATGG GGACTAGAAA CCACCGAAGA ATTATGTAAT
TGGTTTAATC AACCTCCTGT CTTAGATATC CGGATTAATC CTTTAAAAAC GACCTTAGAG
GAAGTTAAAA CGACCTTAAG CCAAGGAAAT CTGACGCTAA TGCCGTTAGA GATCCCCCAA
GGATTAAGAT TACAGGGTAA AACGGGAGCG ATTCAAGATT TACCCGGATT TAAAGAGGGA
TGGTGGACGG TACAAGATAG CAGTGCTCAA TTGGTGAGCC ATTTACTTGA TCCTCAGCCA
TCGGAGGTGA TTATCGATGC CTGTGCTGCA CCAGGGGGAA AAACCACCCA TATTGCTGAA
TTAATGGGGG ATCAAGGAAC AATTTGGGCT TGCGATCGCT ATGCCTCCCG CTTGAAAAAA
TTGTCAGCCA ATAAGGAACG ATTGCAGCTA AACTCAATTA AAATCGTTAC GGGAGATAGT
CGTCAATTAG ACCAATTTCA GGGAATTGCT GATCGCGTCT TAGTGGATGC ACCCTGTTCA
GGACTGGGAA CCCTACACCG ACACCCTGAT ATTCGTTGGC GACAAACCCC AGAAAAGATC
GAAGAATTGG CTATTTTACA GAAAGAATTA TTAGAAACGA CAGCTAATTG GGTCAAACCC
CAAGGGATTT TAGTCTATGC TACTTGTACT TTAACTTATC AAGAAAATGA AGGAGTTATT
GAACACTTCC TTGCTTCCCA TCCCCATTGG AAGATTGATG TCCCCTCTCC TAATTCACCC
GTAGCTAAGT GGATGACAGC ATCAGGATCG ATAAAAATTT TACCTCATCA ACAGGACATG
GATGGATTTT TCATGGTGAA GTTAAAGAAA GGTTGA
 
Protein sequence
MSNSDQITHA RPLALVILRE IERRGTFTDI AIDRGLKSTD LSGSDRALVT ELVYGIVRRK 
RTLDTLIDQL GKKASHQQPP DLRLILYIGL YQLRYLSQIP PSAAVNTAVN LAKENSLQRL
SGVVNGILRQ YIRLAQENND PLILPDDPIS RLGVLYSFPD FMIKLWLEQW GLETTEELCN
WFNQPPVLDI RINPLKTTLE EVKTTLSQGN LTLMPLEIPQ GLRLQGKTGA IQDLPGFKEG
WWTVQDSSAQ LVSHLLDPQP SEVIIDACAA PGGKTTHIAE LMGDQGTIWA CDRYASRLKK
LSANKERLQL NSIKIVTGDS RQLDQFQGIA DRVLVDAPCS GLGTLHRHPD IRWRQTPEKI
EELAILQKEL LETTANWVKP QGILVYATCT LTYQENEGVI EHFLASHPHW KIDVPSPNSP
VAKWMTASGS IKILPHQQDM DGFFMVKLKK G