Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_C0012 |
Symbol | |
ID | 4206662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008265 |
Strand | - |
Start bp | 12738 |
End bp | 13952 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_699941 |
Protein GI | 110804052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 88 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAG AATTAAGAGA ATTATTAAAT CAGTTAGATT CAAAGAATAA GGAGTTAAAT TCTTTATTAA ATAAAGATGG AGTAACAGCT GAAGAATTAA ATAAAACTTC AAATGAAATA GATATTTTAC AAGCAAAAAT TGAAGCTCAA AAAAGAAAAG AAAATATTGA AAATAACTTC AATGAAGATA ATGTTAAGTC TTTAAATACA GGAAAAGAAG AAAATGTTAT TTATAATGGT GCTTTATTTG TTAGAGCAAT AGCAGACAAT TTACTTAAAC AAAAAAATCA AAGAGGATTA AATCTTTCTG AAAAAGAAAT AAATGCTATA TCAGAAAATA TAGATGAAGA TGGTGGCTAT GCTGTTCCAG AGGATATTCA GACAAAGATT AATACAAGAT TAAAAGACAC AACAGATTTA TATAACATGG TAGATTATGA GCCTGTATTT ACTAGAAGTG GTAGTAGAAC ATATGAAAAG AGAAGTAAGC AAAAACCTAT GAAACCATTA AGTGAAAACC AACAGATTCC TACTAATGGC GATAATGGTA AACTTGAGAG ATTTAATTTT AAATTAAAAG ATTTAGCAGA TTTTATGTCA ATACCAAATG ATTTATTAAA ATTTGCTGAT AAAAGTTTAG AAGATTGGAT AATAAATTGG TTTGTAGATA AAGTTAGAAT AACTAGAAAT GCAGAAATTT TATATGGAGC AGGTGGAGAT GAACATGCTA CTGGTATTAT GACAGCAAAT AAATTTAAAA AGATTACATT ACCAAAATCA CCAGCATTAA AGGATTTTAA GAAATGTAAA AATGTTGAGT TATTAAATGT ATTTAAAGCA ACTTCTAGTT GGATTGTTAA TCAAGATGGA TTTAACTACT TAGATAGTTT AGAAGATAAG ACAGGTAGAC CATATCTTCA ACCAGATCCA AAAGACCCAA CACAATATAG ATTCTTAGGA TTACCAGTTA TTGAATTACC TAACGACCTT TTATTATCAA CTGAAAGTGC TATTCCAGTT TTATTAGGTG ATACAAAAGA AGCTTATAAA TATGTTTCAG ATGGAGCATA TGAACTCGCT ACAACAAATA TAGGAGCTGG AGCATTTGAA ACTAACACAA CAAAGGCAAG AATAATAATG AGAATAGATG GAAATGTTAA AGATTCAGAA GCATTATTAA TTGCAGAAAT TCCAGTTGAA TCAGTACAAG CTTAA
|
Protein sequence | MSKELRELLN QLDSKNKELN SLLNKDGVTA EELNKTSNEI DILQAKIEAQ KRKENIENNF NEDNVKSLNT GKEENVIYNG ALFVRAIADN LLKQKNQRGL NLSEKEINAI SENIDEDGGY AVPEDIQTKI NTRLKDTTDL YNMVDYEPVF TRSGSRTYEK RSKQKPMKPL SENQQIPTNG DNGKLERFNF KLKDLADFMS IPNDLLKFAD KSLEDWIINW FVDKVRITRN AEILYGAGGD EHATGIMTAN KFKKITLPKS PALKDFKKCK NVELLNVFKA TSSWIVNQDG FNYLDSLEDK TGRPYLQPDP KDPTQYRFLG LPVIELPNDL LLSTESAIPV LLGDTKEAYK YVSDGAYELA TTNIGAGAFE TNTTKARIIM RIDGNVKDSE ALLIAEIPVE SVQA
|
| |