Gene CPR_1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1627 
Symbol 
ID4205044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1816958 
End bp1818664 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content29% 
IMG OID642566178 
Producthypothetical protein 
Protein accessionYP_698943 
Protein GI110803235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000849473 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTTA CTGCAAAGTA TTGTGATAGT TTTGAATCCA TAATAAATAG CGAAGGGTTT 
ATAAAAGTTT TAGAAACTTA TTTAAAAAAA ACTAAAAATA AGAAGAGTCA TAATTTTAGA
TTTTTAAGTG AAGCCATAGG TACTGAAGAT ATAAAGGTTA TATCTAGGTA TTTAATAAGT
GCATTAAAAC TATTATCTAT GATGGGGGCT GATGAGGTGA TAGTTGTTAA TGATGCTTTT
GAAGGTTTAC TTGAAGATAA AAAATCTTTT GCTGACATTA TAGAAGATGT TTATTCTTTT
TGGAGAAAAT TAGAAAGATA CACTGTTATT CAAAACAATA AAATAAAAGA TGGTATTGCA
GCAGTTGGAT TTATAGATGC TAATAAAAAT TTTAATGATT TAATTTTAAG ATTTTACAGG
AGACTCCAAA AAAATTTATT AGGAAGTATG CCTAATATAC TTAGACAAGT TTCTGCTGGT
GGAAATGCAA GTATTATGGT AAGTAATTTA ATATGGCCAA GTTCAAGGGA ATATTCCATA
TTAGAACATA TTCCATTTAT AGATGCAATA TCATTAGAAG CGCCATTTAT AACTTATCCA
AGTAAGAATA CTAGGGATGG TATATTCTTA GAAACAAGTG AAAATCCATT AAGTGGATCT
CATATAAATA GTGAAGAATG GTTTTGTTAT CCTGCTAAGG TAGGGGAATT ATTAGCATAT
GTATATTTCC ATAGAGATTT TATGTCTCAT GGAATAAGTT TATGTAATTT GTTTGATTTA
GCAACTGTAG AGGAATGTAG AGGGGTAAAA CCAGATATTA TTTATATCTT TGGAGCTAAA
GATGATGATG ATGAGTTAAA AACTTGTTTC TATGATGATG AAAAAAATAA CATAATGCTA
GGGTATGCTA ATTATAGTGA AGAAATAGAT TATTTTGGGT ATATGAAAAA GATGATATTA
ACTCTTCATA ATATAATAAT GATAAAAAGA GGATATATGC CTATTCATGG AGCTATGGTC
AATGTGGTTC TTAAAAATGG AAAAGAAGCT AATATTGTAA TAATGGGTGA TAGTGGAGCA
GGAAAATCTG AAAGCTTAGA AGCCTTTAGA GCACTGAGTG AAGAATATAT AAGCGATATG
ACCATAATTT TTGATGACAT GGGTGTATTT AAAAATGTAG ATGGCATAAT TAAGGGGTAT
GGAACTGAAA TAGGAGCCTT TGTAAGACTT GATGACTTAG ATCAAGGGTA TGCTTTTAAA
GAAATTGATA GAAGTATATT TATGAATCCT GATAAGATAA ATGCTAGACT TCTTATGCCA
GTATGTAAGT ATGACGATAT AATAAGGGGA TATGATGTAG ATCTTTTCCT TTATGCTAAT
AATTATGATG GATTAGATGA GGGAGAAAAA TCTATTGAAT ATTTTAATAA TCCAGAGGAA
GCTAAGAAAA TTTTTAAAGC TGGTGCTAGA ATGGCAAAGG GAACAACTAC TGAAAATGGC
TTAGTGGAAT CATATTTTGC TAATCCTTTT GGACCTGTGC AAAAGAAAGA AGAGATGGAT
TTAATAATAG ATAAATATTT TGAAGATATG TTTAATAATA AAGTGAAGGT TGGACAAATA
AAAACTTGTT TAGGAGTTTT AGGCCTTGAA AAGGAAGGAC CTAAAAAAGC AGCCATAGAA
CTTTTTAATA TAATTGAAAA AATGTAA
 
Protein sequence
MNFTAKYCDS FESIINSEGF IKVLETYLKK TKNKKSHNFR FLSEAIGTED IKVISRYLIS 
ALKLLSMMGA DEVIVVNDAF EGLLEDKKSF ADIIEDVYSF WRKLERYTVI QNNKIKDGIA
AVGFIDANKN FNDLILRFYR RLQKNLLGSM PNILRQVSAG GNASIMVSNL IWPSSREYSI
LEHIPFIDAI SLEAPFITYP SKNTRDGIFL ETSENPLSGS HINSEEWFCY PAKVGELLAY
VYFHRDFMSH GISLCNLFDL ATVEECRGVK PDIIYIFGAK DDDDELKTCF YDDEKNNIML
GYANYSEEID YFGYMKKMIL TLHNIIMIKR GYMPIHGAMV NVVLKNGKEA NIVIMGDSGA
GKSESLEAFR ALSEEYISDM TIIFDDMGVF KNVDGIIKGY GTEIGAFVRL DDLDQGYAFK
EIDRSIFMNP DKINARLLMP VCKYDDIIRG YDVDLFLYAN NYDGLDEGEK SIEYFNNPEE
AKKIFKAGAR MAKGTTTENG LVESYFANPF GPVQKKEEMD LIIDKYFEDM FNNKVKVGQI
KTCLGVLGLE KEGPKKAAIE LFNIIEKM