Gene CPR_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2012 
Symbol 
ID4204336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2218767 
End bp2219852 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content29% 
IMG OID642566562 
Productstage II sporulation protein P 
Protein accessionYP_699321 
Protein GI110801662 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGTTA TTAATAAAAG AGAGATACCC ATAGGAGTGA TTGTTCTTTT ATCTTTACTT 
ATAATATTTA TGTTTAGATT TATGAAGATT GCAGCAAGTA AAGACATGAG AGAAAATTTA
TCATATATAC AACTTTTAAA TGCAGGAATG CCTGTTGCAA AGGGAACATA TTACGATGAG
AATGCTTATT TAGAAAGTAA TATTACATTA AAAAGTTTAG CCTTAGAAAC TTTAAATATA
AAGCCTCTAG ATCCGATAGA GTTAGTAATG AATGAAGTGC CTTACTTTGG TGCAGTAAAT
AAAATAGCTT CAATTGATAA AGTTAATTAT GTTTCTGCTG AAAAAGTATC TTCATTTGAT
TTAAACAAAG ATAGTATAGA TATAGTTTCT GAGGAAGAAT CTAAGAAAAG TGCAGAATTA
GAAGCTAGTA AAAATAGTGA AGTTTATGAT CCAAGTTTAA AAAAGGAACT AGATCAGTCT
AAGCCAGAGG TGCTTATTTA TCATACTCAT AATTCAGAGG GGTATACTGA GGAGAGAACT
TCAAACAATG AAGAACATAA CGTAGTTGGA GTAGGAACTT TAGTTGCAAA AGAACTTGAA
GAAAACTATG GTATATCAGT TATTCATGAT AAAACAAATC ACTCAGCTTC ATATGAGCAA
TCTTACAATA AATCTAGAGA GACAGTTAAA AAATATATTA ATGAATATGA TGATTTTAAG
ATGGTAATAG ATATTCATAG AGATTCTGTT GGAGAGCATA ACAAAAAGAA TTTAACTGCT
AATATAAATG GAGAAAGTTT AGCTAAGATT ATGTTTGTTA CAACTAAGAA TAGCCAATAT
TTTAATGATG CTGAATCCTT GGCCTATAGA TTTATTAATA AAGCCAATGA GCTTTTTCCT
GATATTTTAA GAAGACAGGA AACCTTTAAG TATGATAGGG GAAAAAATGC GTTTAACCAA
CAATATAATA AGAATTCAAT GCTTATTGAA GTTGGTGCAG AAGTAAATAC TTCTAAAGAG
GCACAAGCTA CAGCAAAGTA TATAGCTAGA TTAATAGCAG AAGAATTAAA CAGAAAAAGT
GAATAA
 
Protein sequence
MRVINKREIP IGVIVLLSLL IIFMFRFMKI AASKDMRENL SYIQLLNAGM PVAKGTYYDE 
NAYLESNITL KSLALETLNI KPLDPIELVM NEVPYFGAVN KIASIDKVNY VSAEKVSSFD
LNKDSIDIVS EEESKKSAEL EASKNSEVYD PSLKKELDQS KPEVLIYHTH NSEGYTEERT
SNNEEHNVVG VGTLVAKELE ENYGISVIHD KTNHSASYEQ SYNKSRETVK KYINEYDDFK
MVIDIHRDSV GEHNKKNLTA NINGESLAKI MFVTTKNSQY FNDAESLAYR FINKANELFP
DILRRQETFK YDRGKNAFNQ QYNKNSMLIE VGAEVNTSKE AQATAKYIAR LIAEELNRKS
E