Gene CPR_1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1697 
Symbol 
ID4204905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1892297 
End bp1893505 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content27% 
IMG OID642566247 
Producthypothetical protein 
Protein accessionYP_699012 
Protein GI110801954 
COG category[R] General function prediction only 
COG ID[COG1323] Predicted nucleotidyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA CTGGCATAAT AACTGAATAT AACCCTTTTC ATCTAGGTCA TGAACTTCAT 
CTAAAAAATT CAAAAGAGAT TACAAATTGC GATGGAGTTA TTTGTGTTAT GAGTGGAAAC
TTTGTGCAAA GAGGTTTGCC TGCTTTAACA GACAAATGGA CTAGAACAAA AATGGCCTTA
GAAGCTGGAG TTGATTTAGT TGTAGAACTT CCAACTCTTT TTGCAACTTC TTCAGCAGAA
TTTTTTGCCT TTGGTGCAGT ATCTTTGCTT AATTCTTTAA ATGTAGTTAA TAATATTTGT
TTTGGATCAG AATGTGGAGA TATAGATTTA ATTAAAAAAC TTAGTGAAAT TATTGTCAAT
GAACCTCCTA TATTCAAAGA ATATTTAAAG GATTATTTAA AGGAAGGCCT TCCCTTTCCT
AAAGCTAGAA GTGAAGCTTT AATGAAGTAC TTAGATTATA ATAATTATAA AACTGATTTT
TCATACTTAG AAAAAGTTCT AAACTCTTCT AATAATATAT TAGCCATTGA ATATTGTAAA
AGCCTTTATA AGCTTCAAAG TACTATAAAA CCTTTTACTA TACAAAGATT AGGAGCAGAT
TACAACGATG AAGAACTGTC AAAAAATGAA ATAGCCTCTG CTTCTGCCAT AAGAAAAAGT
ATTTACACTT CAAATATAGA AGAAAGTCTT GATTTTATGC CTGAGTATAG CTATAACTTA
TTAAAAAATA CTTCTTTTAG TGATTTAGAC AAAATGTTTG ACTTAGTAAA ATACGCTATA
GTAAGCAATC CTAATATATT AAAAGAAATA CCAGAGGCTT CTGAAGGAAT AGATAATAAG
ATAATTCAAA ACATAGGAAA AGCTAATTCT TTAGATGAAT TAATAAACCT ATGCAAAAGT
AAGCGTTATT CATATACTAG ATTAAACAGA ATTTTATGTC ACATACTATT AAATGTAAAT
AAAGATCTTC TTTCTCTTAG AAAATATTCT CCTAATTATG TAAGAATCTT AGGATTTAAT
AATAAAGGAA GGGAAATTTT AAAAGAGATT AAGAAAAATT CTGAAATAAA TATCGTTAAT
AAATTATCAA AAGCTAAAAC AGATCCTTTG TTAGAATTTG ACATAAAAGC CACTAATATT
TATAGCTTTC TAAATCCATC AGTTAAAATT AACAGTGATT ATTTAATTAG TCCTATTATT
TTTAGATAA
 
Protein sequence
MNITGIITEY NPFHLGHELH LKNSKEITNC DGVICVMSGN FVQRGLPALT DKWTRTKMAL 
EAGVDLVVEL PTLFATSSAE FFAFGAVSLL NSLNVVNNIC FGSECGDIDL IKKLSEIIVN
EPPIFKEYLK DYLKEGLPFP KARSEALMKY LDYNNYKTDF SYLEKVLNSS NNILAIEYCK
SLYKLQSTIK PFTIQRLGAD YNDEELSKNE IASASAIRKS IYTSNIEESL DFMPEYSYNL
LKNTSFSDLD KMFDLVKYAI VSNPNILKEI PEASEGIDNK IIQNIGKANS LDELINLCKS
KRYSYTRLNR ILCHILLNVN KDLLSLRKYS PNYVRILGFN NKGREILKEI KKNSEINIVN
KLSKAKTDPL LEFDIKATNI YSFLNPSVKI NSDYLISPII FR