Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1712 |
Symbol | sun |
ID | 4206366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1907519 |
End bp | 1908847 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642566262 |
Product | sun protein |
Protein accession | YP_699027 |
Protein GI | 110803437 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.212482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGCAA GAAAAATAAT AGTTGAAATA TTAGACAATG TCTTATTAAA TGGAGCATAT TCAAATATAG AAATAAATAA GCAATTTGCA TCTAATGATA TAGATCCAAA AGATAAGGGA TTAATAACAG AGGTTGTTTA TGGAACAATA AAATACAAAA AAATGATAGA TATAATTCTT TCAAATTTTG TTGCTGATAT TGGTAAGATA GATGAGAGTG TAGTAAACAT ATTAAGAAGT GCTATATATC AAATGAAATT CCTAGATAGG GTTCCTCCAT ACGCAATAGT TAATGAAGCG GTAAACTTAA CTAAAGAAAC TGAACCTAAT TTAGCTAAGT TTGTAAATGG AGTTTTAAGA AATTATTTAA GAAATGAAAA TAAAAACTTT AAAGTTGGAT TAAGAAATAA CGAAGCTTTA TGTTATGACT TTTCTTTTGA CAGATGGATG ATAGAAATGT TCATAAAACA ATACGGAAAA GATGATGCTT TAAGAATACT TAGAGGATTA AATACAATTC CGTATGTAAC AGTTAGAGTT AATACATGTA AAGCTGATTA TGATGAAGTT TATGAAAGAC TTGAAGAAGA AGGATATGAT ATAGAGGAAG GTGCATTTTC ACCAGAAGCT ATCATAATCA AAAAAGGTAG TGCAATAGAA AAAAATAAAC TTTATCAAGA AGGATTAATA ACAGTTCAAG ATGAAAGCGC TATGTTAGTA GCTCCTTTAT TTGATTTAAA GGATGATGAA CAAGTCATGG ATCTATGTAG TGCACCAGGA ACTAAAGCAA CTCATATAGG CGAATTAATG ATGAATAAAG GAAAAGTAGT AGCTTTTGAT ATTCATGACC ATAAGTTAAC TTTGATAAAG GAAAATATAG ATAGATTAGG ATTAACTAAT GTTGAAGTTG AATTAGGAGA TGCTACAAAG ATAAATTCTA AGTATATAAA TTGGGCTGAT AGAGTATTAT TAGATGTACC TTGCTCAGGT CTTGGAATTA TAAGAAAGAA ACCAGAAATA AAATGGAATA AAAAGAATAA TGATTTAACA GAAGTTGTTA AGGTTCAAAA AGAAATATTA AAAAATGCTT GGAATTATTT AAGAGAAGGT GGAGAATTAG TTTACTCTAC TTGTACTTTA AATAAAAAAG AAAATGAAGA AGTTATAGAT TGGTTCGTAG AAAAAAATTC AGACTGCGAA GTAGAAAAAG TATTTTTAGG TAAGGCTGAT AATGTTGTAT ATAATGATAA CGGAAGTGTT ACCATATTAC CTAATAAGTA CATGGATGGT TTCTTTATTG CTAAGCTTAA GAAAAAAGAA AGTAAATAG
|
Protein sequence | MNARKIIVEI LDNVLLNGAY SNIEINKQFA SNDIDPKDKG LITEVVYGTI KYKKMIDIIL SNFVADIGKI DESVVNILRS AIYQMKFLDR VPPYAIVNEA VNLTKETEPN LAKFVNGVLR NYLRNENKNF KVGLRNNEAL CYDFSFDRWM IEMFIKQYGK DDALRILRGL NTIPYVTVRV NTCKADYDEV YERLEEEGYD IEEGAFSPEA IIIKKGSAIE KNKLYQEGLI TVQDESAMLV APLFDLKDDE QVMDLCSAPG TKATHIGELM MNKGKVVAFD IHDHKLTLIK ENIDRLGLTN VEVELGDATK INSKYINWAD RVLLDVPCSG LGIIRKKPEI KWNKKNNDLT EVVKVQKEIL KNAWNYLREG GELVYSTCTL NKKENEEVID WFVEKNSDCE VEKVFLGKAD NVVYNDNGSV TILPNKYMDG FFIAKLKKKE SK
|
| |