Gene CPR_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0114 
Symbol 
ID4204965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp143687 
End bp144859 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content23% 
IMG OID642564669 
Producthypothetical protein 
Protein accessionYP_697451 
Protein GI110802866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTT GTTCTAAATG TGGTAATAAA TTATCTGATT CAATGAAATT TTGTAATAAG 
TGTGGTGCTA AAGTAAAATC AGTAGATAAA GATACTGAAA ACAATTCAAA TCCAGAGGAT
GCAAAAAATA TTTCTAACAT TGAAAATAAT ATTGATAAAA CCATAATATT AGATGCACCT
AAAATAAAAA AAGAAATTAA TACTGATATT AATGAAAATA TTAAAAATAA TTTGAAACCT
ATTCACAACT CTAATTTTAA TAATAAAGAA AAATGTTTAG ATGCAAGTTT TGATGATGAT
TTAGATGATG ATTTAGATGA AAATTTCTAT GATGAATCTT GCAATAATGA AAATACTGAT
AATTTTAAAC ATTCTAATAC CAAAAAAAAG ATTTTAATAA GTATTTTCAC ATTAGTAGCT
TTTGTAATAA TAGGTACTTC TATTTATTTT TTAAGATCAC CTTTATTATA TAAATACTAT
TATAACTCTG CACTTAAATC TTCATCAGTT ACTGAAAAAT TATCTTATTA TAACAGTGCT
TTAAAATACT CTAAAAATGA TGATTTATTA AATTCTATTT ATACAACTTT AAAGTCTGAC
TCTGATTTTG TTGATAATTC TTCAATATTA ACAAACTTAA ATAAATCTGA AAAAGATAAT
TTAATGTCAA AATTATATGT AAATAAAGCT ACTGTAGATT TTAAAAATAA AAATTACACC
GACTGTGATT CTGATTTAGA TCTAGCTACA AAATATGGAT ATAAAAAAGA AAACTTTTCA
CAATATGATG ATTTACAAAA AAAGCTTAAT GAAAGTAAAA ATTCTTCAAA TAACGATAAA
GTCGATAATG TCTATAGTTT TACAAATGAA AATCCATCAA AATTTTCTGG AAACATATAT
GATTATCCTT ATGACTTTAT AATGCCATAT AGTAATTCTT CCTATTTGTC AGCATCAGAT
CTTTCTAAAT ATAATAAAAG TACTCTTTCT TTAATGAGAA ACGAAATATA TGCACGTCAT
GGGTATGTGT TTAATACTAA TCCATTTAAG GAATATTTTA ATTCAAAATC ATGGTATCAT
CCTGATTCAT CCTTTAAAGG TGATGACAGT GAATTAAATG ATTATGAAAT AAAAAATGTA
CAAACCATAA AATCAGTTGA AAACTCCAAA TAA
 
Protein sequence
MKFCSKCGNK LSDSMKFCNK CGAKVKSVDK DTENNSNPED AKNISNIENN IDKTIILDAP 
KIKKEINTDI NENIKNNLKP IHNSNFNNKE KCLDASFDDD LDDDLDENFY DESCNNENTD
NFKHSNTKKK ILISIFTLVA FVIIGTSIYF LRSPLLYKYY YNSALKSSSV TEKLSYYNSA
LKYSKNDDLL NSIYTTLKSD SDFVDNSSIL TNLNKSEKDN LMSKLYVNKA TVDFKNKNYT
DCDSDLDLAT KYGYKKENFS QYDDLQKKLN ESKNSSNNDK VDNVYSFTNE NPSKFSGNIY
DYPYDFIMPY SNSSYLSASD LSKYNKSTLS LMRNEIYARH GYVFNTNPFK EYFNSKSWYH
PDSSFKGDDS ELNDYEIKNV QTIKSVENSK