Gene CPR_C0012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_C0012 
Symbol 
ID4206662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008265 
Strand
Start bp12738 
End bp13952 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content28% 
IMG OID 
Productphage major capsid protein, HK97 family 
Protein accessionYP_699941 
Protein GI110804052 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones88 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG AATTAAGAGA ATTATTAAAT CAGTTAGATT CAAAGAATAA GGAGTTAAAT 
TCTTTATTAA ATAAAGATGG AGTAACAGCT GAAGAATTAA ATAAAACTTC AAATGAAATA
GATATTTTAC AAGCAAAAAT TGAAGCTCAA AAAAGAAAAG AAAATATTGA AAATAACTTC
AATGAAGATA ATGTTAAGTC TTTAAATACA GGAAAAGAAG AAAATGTTAT TTATAATGGT
GCTTTATTTG TTAGAGCAAT AGCAGACAAT TTACTTAAAC AAAAAAATCA AAGAGGATTA
AATCTTTCTG AAAAAGAAAT AAATGCTATA TCAGAAAATA TAGATGAAGA TGGTGGCTAT
GCTGTTCCAG AGGATATTCA GACAAAGATT AATACAAGAT TAAAAGACAC AACAGATTTA
TATAACATGG TAGATTATGA GCCTGTATTT ACTAGAAGTG GTAGTAGAAC ATATGAAAAG
AGAAGTAAGC AAAAACCTAT GAAACCATTA AGTGAAAACC AACAGATTCC TACTAATGGC
GATAATGGTA AACTTGAGAG ATTTAATTTT AAATTAAAAG ATTTAGCAGA TTTTATGTCA
ATACCAAATG ATTTATTAAA ATTTGCTGAT AAAAGTTTAG AAGATTGGAT AATAAATTGG
TTTGTAGATA AAGTTAGAAT AACTAGAAAT GCAGAAATTT TATATGGAGC AGGTGGAGAT
GAACATGCTA CTGGTATTAT GACAGCAAAT AAATTTAAAA AGATTACATT ACCAAAATCA
CCAGCATTAA AGGATTTTAA GAAATGTAAA AATGTTGAGT TATTAAATGT ATTTAAAGCA
ACTTCTAGTT GGATTGTTAA TCAAGATGGA TTTAACTACT TAGATAGTTT AGAAGATAAG
ACAGGTAGAC CATATCTTCA ACCAGATCCA AAAGACCCAA CACAATATAG ATTCTTAGGA
TTACCAGTTA TTGAATTACC TAACGACCTT TTATTATCAA CTGAAAGTGC TATTCCAGTT
TTATTAGGTG ATACAAAAGA AGCTTATAAA TATGTTTCAG ATGGAGCATA TGAACTCGCT
ACAACAAATA TAGGAGCTGG AGCATTTGAA ACTAACACAA CAAAGGCAAG AATAATAATG
AGAATAGATG GAAATGTTAA AGATTCAGAA GCATTATTAA TTGCAGAAAT TCCAGTTGAA
TCAGTACAAG CTTAA
 
Protein sequence
MSKELRELLN QLDSKNKELN SLLNKDGVTA EELNKTSNEI DILQAKIEAQ KRKENIENNF 
NEDNVKSLNT GKEENVIYNG ALFVRAIADN LLKQKNQRGL NLSEKEINAI SENIDEDGGY
AVPEDIQTKI NTRLKDTTDL YNMVDYEPVF TRSGSRTYEK RSKQKPMKPL SENQQIPTNG
DNGKLERFNF KLKDLADFMS IPNDLLKFAD KSLEDWIINW FVDKVRITRN AEILYGAGGD
EHATGIMTAN KFKKITLPKS PALKDFKKCK NVELLNVFKA TSSWIVNQDG FNYLDSLEDK
TGRPYLQPDP KDPTQYRFLG LPVIELPNDL LLSTESAIPV LLGDTKEAYK YVSDGAYELA
TTNIGAGAFE TNTTKARIIM RIDGNVKDSE ALLIAEIPVE SVQA