Gene CPR_1815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1815 
Symbol 
ID4205919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2006280 
End bp2008007 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content26% 
IMG OID642566365 
Productadherence and virulence protein A 
Protein accessionYP_699130 
Protein GI110803979 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTAG ACGGTATTTA TTTATATAAC TTAATAAATG AATTAAAAGA TTCATTAATA 
AATTCTAGAA TTGACAAAAT AAATCAACCT GAAAAGGATG AAATAATAAT AAATGTACGT
GGAAAGGAAA ATAAAAAACT TTTAATTTCC TCTAGTTCCA AATACCCTAG ATTACATTTT
ACTACAATAA GTAAGAATAA TCCATTACAA CCTCCTGTTT TTTGTATGGT ACTTAGAAAA
TATTTAACAG GTGGAAGAAT AATCGATATT TATCAACAAT CTACAGATAG AATTGTTTCA
ATTGATATAG CCAATAAAGA TGAAATGGGT TTTCATAGTG TATATACTTT AGTAGTAGAA
ATAATGGCTA GACACAGTAA TATTTCTCTT GTAAGGAAGA GAGATAATAA AATTATGGAA
TCAATAAAAC ATATAACAGC AAATAAAAAT AGTTTTAGAG TTCTATACCC TGGTGTAAGC
TATGTATTCC CTCCTGCATC TGAAAAATTA AATCCTTTTG ACTTTTCAAA GGAAGATTTA
AAGATAGAAT TAAGCAAAAA TAATAATGAA TTAGATGAAA AGATTTTTTC AAAGCTACTA
ACTGGTGTTG GTAAAAATCT TTCTCTTGAA ATGTATTCAT TATTTAAATC ACAATTTGGA
GATTCATATG TTTTTGATGA TATATTCAAT TTTATATGTA ATTACTTTAC TAACATATTT
AAAGATATAC AAAATATTAT ATTCTACAAG AATGAAAAAA TTATAGATTT TTATTTTAAA
GATTTATCTA TTTTAGACAA TTGTACTAAA GAAGTGTATG ATAATAGTAG TGAACTTTTA
GATGCTTTTT TTGCTAATAA AGATAAACAA GATAGATTAC ATGCAAAAAG TGCAGATATC
CAAAAATTAG TTAATACTAA TATAGACAGA TGCTTAAAAA AAATTAAGGT TCTTGAAAAA
ACCTTAGAAG AATGTTCTAA AAAAGAGGAA TTTAAGATTA AAGGTGAACT TTTAACCTCT
TATATTTATA GTATTAAAAA AGGAGATAAA TCTGTTGACC TTTTAAATTA TTATAGTGAG
GATGAAGAAT ACCTGACTAT TTCACTTGAT GAAAATAAAA CTCCATCTGA AAACATTCAA
TTTTACTTTA AAAAATATAA CAAACTAAAA AAAGCTGAAG AATCTGCCTT AGAACAACTA
GCTATAAATG AAGATGAGCT TAAATACTTA AATTCAGTTT CCTCAAGCAT ACAGGTTGCA
GATAACTATG AAGACATAGA TGCTATAAAA AATGAACTTA TAGAAACAGG CTATATAAGA
TTTAGAAGAA ATAATAATGG TAAAAAGAAA GAAAAACAAT CTAAGCCTTA TCACTATGTT
TCTTCTGATG GAATAGATAT TTATGTTGGT AAAAATAATA TCCAAAATGA TTATTTAACT
TTAAAGTTTG CTGACAAGAA TGATACTTGG CTTCACACTA AAGATATACC TGGTTCTCAC
GTAATTGTTA AAAGTTCAAA CATTCCTGAT AAAACCTTAG AAGAAGCTGC TAACTTAGCT
GTTTTCTATA GTAAAGGAAA AGGTGGCACT AAAATTCCTG TAGACTATAC TTTAGTTAAA
AATGTAAAGA AACCTTCTGG TTCTAAACCT GGAATGGTAA TCTACTCAAC TAACAAGACA
GTTTATATGG ATTCACCAAA GGAAATAACT TTAGAAAAAC TTAAGTAA
 
Protein sequence
MALDGIYLYN LINELKDSLI NSRIDKINQP EKDEIIINVR GKENKKLLIS SSSKYPRLHF 
TTISKNNPLQ PPVFCMVLRK YLTGGRIIDI YQQSTDRIVS IDIANKDEMG FHSVYTLVVE
IMARHSNISL VRKRDNKIME SIKHITANKN SFRVLYPGVS YVFPPASEKL NPFDFSKEDL
KIELSKNNNE LDEKIFSKLL TGVGKNLSLE MYSLFKSQFG DSYVFDDIFN FICNYFTNIF
KDIQNIIFYK NEKIIDFYFK DLSILDNCTK EVYDNSSELL DAFFANKDKQ DRLHAKSADI
QKLVNTNIDR CLKKIKVLEK TLEECSKKEE FKIKGELLTS YIYSIKKGDK SVDLLNYYSE
DEEYLTISLD ENKTPSENIQ FYFKKYNKLK KAEESALEQL AINEDELKYL NSVSSSIQVA
DNYEDIDAIK NELIETGYIR FRRNNNGKKK EKQSKPYHYV SSDGIDIYVG KNNIQNDYLT
LKFADKNDTW LHTKDIPGSH VIVKSSNIPD KTLEEAANLA VFYSKGKGGT KIPVDYTLVK
NVKKPSGSKP GMVIYSTNKT VYMDSPKEIT LEKLK