Gene CPR_C0015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_C0015 
Symbol 
ID4206689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008265 
Strand
Start bp16105 
End bp17850 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content30% 
IMG OID 
Productputative phage terminase, large subunit 
Protein accessionYP_699944 
Protein GI110804033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones68 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATA GAGTTACTAG GTACGCTATA GATGTACTTG AAGGGAAAGA AGTTGCTGGA 
AGATATGTTA AGTTAGCGTG TCAAAGGCAT ATTGATGACT TAGAAAGATC TAAATTAGCA
CCTTATAAAT ATGAATTTGA TTTAGAAAAA GCTTTAGAAT CTATTGATTT TTTTGAAGAT
TTAAGGTTTA CAGATGGTGA AATAGCTGGT CAACAAGTTA AGCTTTTTGG ATTTCAAGAT
TTTATTGTTG GTTCAATATT TGGATGGGTT TGTAAAGGAA CTGGTTATAG GAGATTTAAA
AAATCTTATG TTCAGTTAGC AAGAAAAAAT GCTAAATCGC TTTTAAATAG TGGAATAGGT
ATAAAATTAG CAGCATTTGA CAAATATCCG AATGCACAGG TTTACTGTAC AGCAACAAAA
ATGAAACAAG CTAGAATTGT ATGGAAACAA GCTAGAAAGT TTATAGAAAT TGAACCAGAT
TTAAGAGAAA TATTCAAAAT AAAAGATCAT GATGCAATTA TAGAATCGTT GATTAATGGT
GGAGAAATAA TGGCTTTGGG AAGAGATACT GGTACTATAG ATGGATTTGA CCCACACGGT
GGAATAATTG ATGAATATCA TAGCCATAAG ACTAATCAAA TGGTTAAGCT GCTTGAAGAT
GGTTCAGTAA ACCAAGCGGA AAGTTTAATT TCTATTATAA CGACAGCGGG ATTTAATTTA
AATGGTCCAT GTTATAAAGA ATGGGAATAT TGTAAAAATG TTTTAGAAGG AATAGTTAAT
AATGATGAGT ATTTTATTTA TATTGCTCAA ATGGATGTAG AAGATGACAT ATGGGACCCT
AATAATTGGC TTAAAGCTAA TCCTTTAGTT GCAAAACTTC CTAAAGGCCT TGAAAATTTA
AAAAGGTTTG CTGAAGAATC AAAACAAAAA GGTGGAGATG ATTTAAAAAA CTTTTTAACT
AAATCATTGA ATATATGGTA TGAGTTTTCT AATGACCAGT ATATTGGGCC GAGTGTTTGG
AAGGTTGGAG CTTCTAAATT AACATTAGAA AATTTTAAAG GTAAAACATG TTATGCTGGT
TTAGATTTAT CTAGTGGAGG GGATTTAACT TCATTAGCTC TTATATTTCC TTATGAATTT
GAAAAAGAAA ATGGGGAGAA AGTAAGAAAA TATTTTATAC ATTCCCATAG TTTTATTCCG
AAGAGAAGAG TTGCAGAACA TATTCAAAGT GATGATGTAC CATATGATGT TTGGATAGAA
AATGGGTTAT TAACAGTAAC GGAAACACTT GGAGGAATAA AAACAGATTA CAAATATATA
ATTAAGTATT TAAGAGATTT AATTGAAGAA TTTCAACTTA AAATAATTCA ATTAGGATAT
GATCCTCACA ATGCAGATAC TTTTCTTCAA GATTTAGAGG AGTTAGGATT TGATTGTGTA
GAAATATTTC AAAGTTGTAA GTGGCTCAAT GATCCAACAG AAGATTTTAA GCTTGAATGT
GAAGCTGGAA ATATAGAATA TAATGAGGAA AATGAGTTAC TAAGTTGGTC AGTTGTAAAT
GCTAAGTTAG TTTCAAATTC AAATGGAGAA ATAAAAATTG ATAAGAATTT ACAAGAAAAA
AGAATAGATC CAGTTGATGC TGTTATAGAC GCATATAAAT TAGCTTTTAA ATCTGAAAAA
TTAAATACTA AAGTGGATAT AAATAAATAT GCTGAAAAAG ACTTCTTGAA TAAGTTGTGG
AGTTGA
 
Protein sequence
MDDRVTRYAI DVLEGKEVAG RYVKLACQRH IDDLERSKLA PYKYEFDLEK ALESIDFFED 
LRFTDGEIAG QQVKLFGFQD FIVGSIFGWV CKGTGYRRFK KSYVQLARKN AKSLLNSGIG
IKLAAFDKYP NAQVYCTATK MKQARIVWKQ ARKFIEIEPD LREIFKIKDH DAIIESLING
GEIMALGRDT GTIDGFDPHG GIIDEYHSHK TNQMVKLLED GSVNQAESLI SIITTAGFNL
NGPCYKEWEY CKNVLEGIVN NDEYFIYIAQ MDVEDDIWDP NNWLKANPLV AKLPKGLENL
KRFAEESKQK GGDDLKNFLT KSLNIWYEFS NDQYIGPSVW KVGASKLTLE NFKGKTCYAG
LDLSSGGDLT SLALIFPYEF EKENGEKVRK YFIHSHSFIP KRRVAEHIQS DDVPYDVWIE
NGLLTVTETL GGIKTDYKYI IKYLRDLIEE FQLKIIQLGY DPHNADTFLQ DLEELGFDCV
EIFQSCKWLN DPTEDFKLEC EAGNIEYNEE NELLSWSVVN AKLVSNSNGE IKIDKNLQEK
RIDPVDAVID AYKLAFKSEK LNTKVDINKY AEKDFLNKLW S