Gene CPR_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0597 
Symbol 
ID4204927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp714858 
End bp716162 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content28% 
IMG OID642565157 
Productsurface protein pspA precursor 
Protein accessionYP_697924 
Protein GI110802887 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAAAATTATT ATTAATATTA ACTACATTTT TAATGATTAA TTCTAGTCAA 
ATTACTTTAG CTAAGAACTT AGATTATTCT GATGCAAAAA TGGAAATTAA AAAATCACTA
AATAATACTA ATGGATGGGT AAATAAAGAT TCTAAATGGT ATTATTTAGA TTCAAATGGT
AATATCCATA AAGGTTGGTT ATATTTAAAC GATTTATGGT ATTATCTTGA TAATATTAAT
GGTGAAATGA AAAAGGGTAT TCAAGAAATA AATGGTCATA AGTATTATTT AGATCCTTCT
AGTGGAGTTA TGAAAACTGG TTGGATAAAA ATTAATGATG ATTACTACTT CTTTGGTTCT
GATGGAGCAA TGAGAACTGG TTGGATAAAT GATGGTTGGA CAGATTATTA TTTAAAACCA
GATGGAACAA TCTTTAAAGG TTGGCTAGAT GATGGACTAA ATAAGTATTA TATGGATGAA
AATGGCCAAA TGAGAAAAGG TTGGGTCAAA TATAACGGAG ATTATTACTT CTTTGGACCT
GATGGTGCAA TGAGAACTGG TTGGATAAAT GATGGTTGGA CAGATTATTA TTTAAAACCA
GATGGAACAC CTCATATCGG TTGGTTAACT TATGATAATA ATAAATACTA CTTAAATACC
AACGGATCTA TAGCTAAACA CTGGTTATCT TTAAATAATA ATTGGTACTA CTTTGAAAAT
AATGGAGTAA TGGTTCAAAA TTCTGAAAAA ACAGTTAATG GAAACATATA TAAATTTGAT
TCAGATGGTG TAATGATAAC TGATAAATGG TTTGGCTCTA CTTATGTAAA TAAAGATGGA
ATTGTATTAC ATGGTACTCC ATCTAGATCA CATTCATATA CTCAATACAA GTTATTTAAT
TATATGAGTA ATGAAGATAA TAGAGAATCT GTTCATTATG CTGCTATAGA CCTTCACGGT
GGAGAAACAA CTAATAACTG TGTTTATTTC ACTTCTGAAG CTTTAAGAAG AGCTGGAGTT
AAAATCCCTC TATACGTGGC TAACACATAT CAATTAGAAA GAGAGTTACT TTCTAGAGGA
TGGATAAAAT CAACTAACAC AAGTGATCTT AGACCTGGAG ATGTAGTATT TTCAGGATAT
AAACATTCAT TTACATTTAT GAATTGGTAT GATAAGGATT ATGCTTATAT AGTTGATAAT
CAAAAAAAAT ATTTTGATTC TGTTCTTCAT AAAAGACTAG TCTCAGTAGA TGATCCTATT
AATGATACTA TAAGAGCAAC TCATTTCTTT TACTTACCAG AATAA
 
Protein sequence
MNKKKLLLIL TTFLMINSSQ ITLAKNLDYS DAKMEIKKSL NNTNGWVNKD SKWYYLDSNG 
NIHKGWLYLN DLWYYLDNIN GEMKKGIQEI NGHKYYLDPS SGVMKTGWIK INDDYYFFGS
DGAMRTGWIN DGWTDYYLKP DGTIFKGWLD DGLNKYYMDE NGQMRKGWVK YNGDYYFFGP
DGAMRTGWIN DGWTDYYLKP DGTPHIGWLT YDNNKYYLNT NGSIAKHWLS LNNNWYYFEN
NGVMVQNSEK TVNGNIYKFD SDGVMITDKW FGSTYVNKDG IVLHGTPSRS HSYTQYKLFN
YMSNEDNRES VHYAAIDLHG GETTNNCVYF TSEALRRAGV KIPLYVANTY QLERELLSRG
WIKSTNTSDL RPGDVVFSGY KHSFTFMNWY DKDYAYIVDN QKKYFDSVLH KRLVSVDDPI
NDTIRATHFF YLPE