Gene CPR_0145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0145 
Symbol 
ID4206494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp177913 
End bp179481 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content30% 
IMG OID642564700 
Productputative surface protein 
Protein accessionYP_697482 
Protein GI110803469 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.875356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA ATATATTAAA AAAGTTTAGT TTTATGATGG TGTTTATTTT TGTTTTTATT 
ACAACATCAA TACCTGTATT TGCGGCTACG CCATCAATTT CTAAAGATGC TCCTATAAAG
GGAAGTATCA CAATATCAAA AAAAGGTGCT ACATTTACTG CATATAAGTT ATTAGATGCA
ACTAAAAGTG GGGATGCATA TGAATACTCA GTAAATAGTG ATTTAAAAGA CTTCTTTAAT
AATTCTAATT ATGGTTCTTA TAGCCAAGAA TCAATTCAAA AATTAAGTGG AGAAGAAGTA
AAAGAGTTTG CTGTTAAATT ACATAAATAT GTTCTTGATA ATAAAAAGAG TGGAAAAGAA
CTTACAGATG GACAAAAAAA TACTGTTGAC TTAGGTTATT ACTTAGTTAC AGAAGCTTCA
AGTGATTCAG AGGGGGCAGC AGTTGCTTCA ACACCTATAA TAGTTTCAGT TCCTCAAGTT
TCAGGAGATT CATGGAATTA TGATGTAACT ATTAATCCAA AGGATAACAC TCCTATATTA
GAAAAAAACA TAGTTAAAGA GAATCAAAGA GTTAAAACTT CATCTGAGAA TATTGGAGAT
GTTGTTAAAT ACGAAGTAAA AGCTTCTATA CCAGTTTATC AAAAGAATGC ACAGGATATA
ATGTATAAAT TTACTGACAC TATGAGCAAG GGCTTAACAT ATGATGAGAA AACTGGCTTT
AAGGTGACTT CAGGAGATAA AGTTTTTGCT AAGGACAAGG ATTATACTGT AGAAGTTAAA
AAACAAGAAG ATGGAGAAAC AGTTATTACA ATAAACTTTG TATATGAGAA TATAAAAGCT
TATGCAGAGA CTGGAATAAC TTTAAATTAC CAAGCTACCT TAAATAAGGA TGTAGTTATT
AGTAATAAAG AAAACTTAGG AAATACTAAC AACATACAAT TAGACTATAC AAATAACCCA
CATGTTAAGG ATAGTTATAA GAAATTAACT GATAAGGTTA CTACTTATAC CTTTGGATTT
GGAATTACAA AGGTTGATTC AGAGCTTAAT TCTAAGCTTT TACAAGGTGC TGAGTTTTCA
GTAAAAGATG AAAGTGGAAA GACTGTAGCT AAATATACAT ATGATGAAAA AGGACAAGTG
GTTTCTTTAA GTGGAAATGG GGTAACTAAC TCAAAGGGAA TTACAACATT TGTAGGCTTA
AAAGAAGGAA AGTACCTTAT TACAGAGGAA GTAGCTCCAT CTGGATATAG CTTATTAAAA
AATCCAGTAG AAGTGACAAT TACAGCTAAT AAAGATGAGT CTGGAAAGTA CACAGGTGCT
GCAACTATAG AGATTTCTAA TGGAAATAAA GCAGGACAAA TAATAAATGA TATTTCAGAG
AATGATGGAA ATATATTATT TAATGTTCAA ATAGAAAACC ACGCTGGATT TTCTCTTCCA
TCAACAGGAG GATTAGGAAA TACAGGATTT ATAAAAATAG CTATTATCTT ATTAAGTATA
GTTTGTGTTT TATCTATACT AGGACTTGGC TACACTAAAT TTGAAAATAG CAGAAAAACT
AAGAACTAA
 
Protein sequence
MKNNILKKFS FMMVFIFVFI TTSIPVFAAT PSISKDAPIK GSITISKKGA TFTAYKLLDA 
TKSGDAYEYS VNSDLKDFFN NSNYGSYSQE SIQKLSGEEV KEFAVKLHKY VLDNKKSGKE
LTDGQKNTVD LGYYLVTEAS SDSEGAAVAS TPIIVSVPQV SGDSWNYDVT INPKDNTPIL
EKNIVKENQR VKTSSENIGD VVKYEVKASI PVYQKNAQDI MYKFTDTMSK GLTYDEKTGF
KVTSGDKVFA KDKDYTVEVK KQEDGETVIT INFVYENIKA YAETGITLNY QATLNKDVVI
SNKENLGNTN NIQLDYTNNP HVKDSYKKLT DKVTTYTFGF GITKVDSELN SKLLQGAEFS
VKDESGKTVA KYTYDEKGQV VSLSGNGVTN SKGITTFVGL KEGKYLITEE VAPSGYSLLK
NPVEVTITAN KDESGKYTGA ATIEISNGNK AGQIINDISE NDGNILFNVQ IENHAGFSLP
STGGLGNTGF IKIAIILLSI VCVLSILGLG YTKFENSRKT KN