Gene CPR_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1333 
Symbol 
ID4204575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1502532 
End bp1503710 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content30% 
IMG OID642565887 
Producthypothetical protein 
Protein accessionYP_698653 
Protein GI110802423 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02877] sporulation protein YhbH 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.181704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATAT TTAGAGATCA AGCAGAAAAC CATGTAGAAC ATGATAGATC TATAGAGGAT 
AGAAGACGTC ACAGGCAGTT AGTAGAAAAA TCTATTAAAG AAAATTTAGG GGATATACTT
TCAGAGGAGA GCATAATTGG AGAGACTAAA AATAAAAAAT ATAAGATTCC TATTAGAGGA
ATAAAGGAAT ATCAATTTAT TTATGGTGCA AATAATAAAG GGGTAACAAC AGGTACTGGA
GAAGAAAGTC GAGGGGATAG AATTTCTAGT GATAAAAGAA AAGCTATTTC TAATAATAAA
GCAGGAAATC AGGAGGGAAA GGATATATAT GAAACTGAGA TAACCTTAGA GGAACTTATG
GATTATATAG TTGAGGATCT TGATTTACCT AACTTAGATA AGAAGAAGTA CTCAGAAATA
ATAGTTGAAA GTGCAGCTAA AAAAAGAGGA TATCAAAAAT ATGGTGTAAG ACCAAGACTT
GCAAAGAAAA AAACTGTTAT GTGTAAAATA GCTAGAAAAC AAGGAAAAAA AAGAGCATTG
CGTGAAATAG GAGAAGAAGC GAAAATAGGA AGATTTCCTT TTAGAGAAGA TGACTTAAGA
TATTATAAAG TGAAAAAACA TCCTAAAAAA GAAAGCAATG CTGTAATGAT TTTTATAATG
GACGTTTCAG GTTCTATGGA TAATACTAAA AAATATTTAG CTAGATCATT TTTCTTTGTT
TTATCTAGGT TTATAAGAAG AAAATATAAT AATGTAGCCT TTGAATTTAT ATCTCATACT
ACTACAGCTA AAAATGTTAA TGAATATGAG TTTTTCCACA AAGGGGAATC AGGAGGAACT
TATATATCTT CAGGAATAAA TGCTGCCATA GATTTAATAA AAGAAAAGTA TAACCCAGGG
GTTTGGAATA TATATCCTTT CTATGCTTCA GACGGTGATA ACTGGAGTGA GGATAATGAA
AAGGCTATGG AAGCTGTAAA TGAAATTTCA GATTTAAGTA ATATGTTTGG ATATATAGAG
CTTTTACCAT CAACTTATTC TACTACAATG TTCTACAGAT TTAAAAAGGA AATAAGTAAG
GAAAATTTTG TCTCTGTAAC TGTAAAGGAA AAGAAGGATC TGTGGAATGC TATAAAATAT
ATGCTATCTG AAGAACTACA GGAAAAGAAT AAGGAATGA
 
Protein sequence
MAIFRDQAEN HVEHDRSIED RRRHRQLVEK SIKENLGDIL SEESIIGETK NKKYKIPIRG 
IKEYQFIYGA NNKGVTTGTG EESRGDRISS DKRKAISNNK AGNQEGKDIY ETEITLEELM
DYIVEDLDLP NLDKKKYSEI IVESAAKKRG YQKYGVRPRL AKKKTVMCKI ARKQGKKRAL
REIGEEAKIG RFPFREDDLR YYKVKKHPKK ESNAVMIFIM DVSGSMDNTK KYLARSFFFV
LSRFIRRKYN NVAFEFISHT TTAKNVNEYE FFHKGESGGT YISSGINAAI DLIKEKYNPG
VWNIYPFYAS DGDNWSEDNE KAMEAVNEIS DLSNMFGYIE LLPSTYSTTM FYRFKKEISK
ENFVSVTVKE KKDLWNAIKY MLSEELQEKN KE