Gene CPF_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1994 
Symbolsun 
ID4202361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2233525 
End bp2234853 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content28% 
IMG OID638082863 
Productribosomal RNA small subunit methyltransferase B 
Protein accessionYP_696427 
Protein GI110800441 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGCAA GAAAAATAAT AGTTGAAATA TTAGACAATG TCTTATTAAA TGGAGCATAT 
TCAAATATAG AAATAAATAA GCAATTTGCA TCTAATGATA TAGATCCAAA AGATAAGGGA
TTAATAACAG AGGTTGTTTA TGGAACAATA AAATACAAAA AAATGATAGA TATAATTCTT
TCAAGTTTTG TTGCTGATAT TGGTAAGATA GATGAGAGTG TAGTAAACAT ATTAAGAAGT
GCTATATATC AAATGAAATT CTTAGATAGA GTTCCTCCAT ACGCAATAGT TAATGAAGCT
GTAAACTTAA CTAAAGAAAC TGAACCTAAT TTAGCTAAGT TTGTAAATGG AGTTTTAAGA
AATTATTTAA GAAATGAAAA TAAAAACTTT AAAGTTGGAT TAAGAAATAA CGAAGCTTTA
TGTTATGACT TTTCTTTTGA TAGATGGATG ATAGAAATGT TCATAAAACA ATATGGAAAA
GAGGATGCTT TAAGAATACT TAGAGGATTA AATACAGTTC CATATGTAAC AGTTAGAGTT
AATACATGTA AAGCTGATTA TGATGAAGTT TATGAAAGAC TTGAAGAAGA AGGATATGAT
ATAGAGGAAG GTGCATTTTC ACCAGAAGCT ATCATAATCA AAAAAGGTAG TGCAATAGAG
AAAAATAAAC TTTATCAAGA AGGATTAATA ACAGTTCAAG ATGAAAGCGC TATGTTAGTA
GCTCCTTTAT TTGATTTAAG GGGTGATGAA CAAGTTATGG ATCTATGTAG TGCACCAGGA
ACTAAAGCAA CTCATATAGG CGAATTAATG ATGAATAAAG GAAAAGTAGT GGCTTTTGAT
ATTCATGACC ATAAGTTAGC CTTAATAAAG GAAAATATAG ATAGATTAGG ATTAACTAAT
GTTGAAGTTG AATTAGGAGA TGCTACAAAG ATAAATTCTA AGTATATAAA TTGGGCTGAT
AGAGTATTAT TAGATGTACC TTGCTCAGGT CTTGGAATTA TAAGAAAGAA ACCAGAAATA
AAATGGAATA AAAAGAACAA TGATTTAACA GAAGTTGTTA AGGTTCAAAA AGAAATATTA
AAAAATGCTT GGAATTATTT AAGAGAAGGT GGAGAATTAG TTTACTCTAC TTGTACTTTA
AATAAAAAAG AAAATGAAGA AGTTATAGAT TGGTTCGTAG AAAGAAATTC AGACTGCGAA
GTAGAAAAAG TATTTTTAGG TAAGGCTGAT AATGTGGTAT ATAATGATAA CGGAAGTGTT
ACCATATTAC CTAATAAGTA CATGGATGGT TTCTTTATTG CTAAGCTTAA GAAAAAAGAA
AGTAAATAG
 
Protein sequence
MNARKIIVEI LDNVLLNGAY SNIEINKQFA SNDIDPKDKG LITEVVYGTI KYKKMIDIIL 
SSFVADIGKI DESVVNILRS AIYQMKFLDR VPPYAIVNEA VNLTKETEPN LAKFVNGVLR
NYLRNENKNF KVGLRNNEAL CYDFSFDRWM IEMFIKQYGK EDALRILRGL NTVPYVTVRV
NTCKADYDEV YERLEEEGYD IEEGAFSPEA IIIKKGSAIE KNKLYQEGLI TVQDESAMLV
APLFDLRGDE QVMDLCSAPG TKATHIGELM MNKGKVVAFD IHDHKLALIK ENIDRLGLTN
VEVELGDATK INSKYINWAD RVLLDVPCSG LGIIRKKPEI KWNKKNNDLT EVVKVQKEIL
KNAWNYLREG GELVYSTCTL NKKENEEVID WFVERNSDCE VEKVFLGKAD NVVYNDNGSV
TILPNKYMDG FFIAKLKKKE SK