Gene CPR_1712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1712 
Symbolsun 
ID4206366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1907519 
End bp1908847 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content28% 
IMG OID642566262 
Productsun protein 
Protein accessionYP_699027 
Protein GI110803437 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.212482 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA GAAAAATAAT AGTTGAAATA TTAGACAATG TCTTATTAAA TGGAGCATAT 
TCAAATATAG AAATAAATAA GCAATTTGCA TCTAATGATA TAGATCCAAA AGATAAGGGA
TTAATAACAG AGGTTGTTTA TGGAACAATA AAATACAAAA AAATGATAGA TATAATTCTT
TCAAATTTTG TTGCTGATAT TGGTAAGATA GATGAGAGTG TAGTAAACAT ATTAAGAAGT
GCTATATATC AAATGAAATT CCTAGATAGG GTTCCTCCAT ACGCAATAGT TAATGAAGCG
GTAAACTTAA CTAAAGAAAC TGAACCTAAT TTAGCTAAGT TTGTAAATGG AGTTTTAAGA
AATTATTTAA GAAATGAAAA TAAAAACTTT AAAGTTGGAT TAAGAAATAA CGAAGCTTTA
TGTTATGACT TTTCTTTTGA CAGATGGATG ATAGAAATGT TCATAAAACA ATACGGAAAA
GATGATGCTT TAAGAATACT TAGAGGATTA AATACAATTC CGTATGTAAC AGTTAGAGTT
AATACATGTA AAGCTGATTA TGATGAAGTT TATGAAAGAC TTGAAGAAGA AGGATATGAT
ATAGAGGAAG GTGCATTTTC ACCAGAAGCT ATCATAATCA AAAAAGGTAG TGCAATAGAA
AAAAATAAAC TTTATCAAGA AGGATTAATA ACAGTTCAAG ATGAAAGCGC TATGTTAGTA
GCTCCTTTAT TTGATTTAAA GGATGATGAA CAAGTCATGG ATCTATGTAG TGCACCAGGA
ACTAAAGCAA CTCATATAGG CGAATTAATG ATGAATAAAG GAAAAGTAGT AGCTTTTGAT
ATTCATGACC ATAAGTTAAC TTTGATAAAG GAAAATATAG ATAGATTAGG ATTAACTAAT
GTTGAAGTTG AATTAGGAGA TGCTACAAAG ATAAATTCTA AGTATATAAA TTGGGCTGAT
AGAGTATTAT TAGATGTACC TTGCTCAGGT CTTGGAATTA TAAGAAAGAA ACCAGAAATA
AAATGGAATA AAAAGAATAA TGATTTAACA GAAGTTGTTA AGGTTCAAAA AGAAATATTA
AAAAATGCTT GGAATTATTT AAGAGAAGGT GGAGAATTAG TTTACTCTAC TTGTACTTTA
AATAAAAAAG AAAATGAAGA AGTTATAGAT TGGTTCGTAG AAAAAAATTC AGACTGCGAA
GTAGAAAAAG TATTTTTAGG TAAGGCTGAT AATGTTGTAT ATAATGATAA CGGAAGTGTT
ACCATATTAC CTAATAAGTA CATGGATGGT TTCTTTATTG CTAAGCTTAA GAAAAAAGAA
AGTAAATAG
 
Protein sequence
MNARKIIVEI LDNVLLNGAY SNIEINKQFA SNDIDPKDKG LITEVVYGTI KYKKMIDIIL 
SNFVADIGKI DESVVNILRS AIYQMKFLDR VPPYAIVNEA VNLTKETEPN LAKFVNGVLR
NYLRNENKNF KVGLRNNEAL CYDFSFDRWM IEMFIKQYGK DDALRILRGL NTIPYVTVRV
NTCKADYDEV YERLEEEGYD IEEGAFSPEA IIIKKGSAIE KNKLYQEGLI TVQDESAMLV
APLFDLKDDE QVMDLCSAPG TKATHIGELM MNKGKVVAFD IHDHKLTLIK ENIDRLGLTN
VEVELGDATK INSKYINWAD RVLLDVPCSG LGIIRKKPEI KWNKKNNDLT EVVKVQKEIL
KNAWNYLREG GELVYSTCTL NKKENEEVID WFVEKNSDCE VEKVFLGKAD NVVYNDNGSV
TILPNKYMDG FFIAKLKKKE SK