Gene CPR_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0041 
Symbolplc 
ID4205371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp48157 
End bp49353 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content33% 
IMG OID642564584 
Productphospholipase C 
Protein accessionYP_697384 
Protein GI110802823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AGATTTGTAA GGCGCTTGTT TGTGCCACGC TAGTAACTAG CCTATGGGCT 
GGGGTATCAA CTAAAGTCTA CGCTTGGGAT GGAAAAATTG ATGGAACAGG AACTCATGCT
ATGATTGTAA CTCAAGGTGT TTCAATCTTA GAAAATGATA TGTCCAAAAA TGAACCAGAA
AGTGTAAGAA AAAACTTAGA GATTTTAAAA GATAACATGC ATGAGCTTCA ATTAGGTTCT
ACTTATCCAG ATTATGATAA GAATGCATAT GATCTATATC AAGATCATTT CTGGGATCCT
GATACAAATA ATAATTTCTC AAAGGATAAT AGTTGGTATT TAGCTTATTC TATACCTGAC
ACAGGGGAAT CACAAATAAG AAAATTTTCA GCATTAGCTA GATATGAATG GCAAAGAGGA
AATTATAAAC AAGCTACATT CTATCTTGGA GAAGCTATGC ACTATTTTGG AGATATAGAT
ACTCCATATC ATCCTGCTAA TGTTACTGCC GTTGATAGCG CAGGACATGT TAAGTTTGAG
ACTTTTGCAG AAGAAAGGAA AGAACAGTAT AAAATAAACA CAGTAGGTTG CAAAACTAAT
GAGGATTTTT ATGCTGATAT CTTAAAAAAC AAAGATTTTA ATGCATGGTC AAAAGAATAT
GCAAGAGGTT TTGCTAAAAC AGGGAAATCA ATATACTATA GTCATGCTAG CATGAGTCAT
AGTTGGGATG ATTGGGATTA TGCAGCAAAG GTAACTTTAG CTAACTCTCA AAAAGGAACA
GCAGGATATA TTTATAGATT CTTACACGAT GTATCAGAGG GTAATGATCC ATCAGTTGGA
AATAATGTAA AAGAACTAGT AGCTTACATA TCAACTAGTG GTGAAAAAGA TGCTGGAACA
GATGACTACA TGTATTTTGG AATCAAAACA AAGGATGGAA AAACTCAAGA ATGGGAAATG
GACAACCCAG GAAATGATTT TATGGCTGGA AGCAAAGACA CTTATACTTT CAAATTAAAA
GATGAAAATC TAAAAATTGA TGATATACAA AATATGTGGA TTAGAAAAAG AAAATATACA
GCATTCCCAG ATGCTTATAA GCCAGAAAAC ATAAAGGTAA TAGCAAATGG AAAAGTTGTA
GTTGACAAGG ATATAAATGA GTGGATTTCA GGAAATTCAA CTTATAATAT AAAATAA
 
Protein sequence
MKRKICKALV CATLVTSLWA GVSTKVYAWD GKIDGTGTHA MIVTQGVSIL ENDMSKNEPE 
SVRKNLEILK DNMHELQLGS TYPDYDKNAY DLYQDHFWDP DTNNNFSKDN SWYLAYSIPD
TGESQIRKFS ALARYEWQRG NYKQATFYLG EAMHYFGDID TPYHPANVTA VDSAGHVKFE
TFAEERKEQY KINTVGCKTN EDFYADILKN KDFNAWSKEY ARGFAKTGKS IYYSHASMSH
SWDDWDYAAK VTLANSQKGT AGYIYRFLHD VSEGNDPSVG NNVKELVAYI STSGEKDAGT
DDYMYFGIKT KDGKTQEWEM DNPGNDFMAG SKDTYTFKLK DENLKIDDIQ NMWIRKRKYT
AFPDAYKPEN IKVIANGKVV VDKDINEWIS GNSTYNIK