Gene CPF_0965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0965 
Symbol 
ID4202989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1112564 
End bp1115776 
Gene Length3213 bp 
Protein Length1070 aa 
Translation table11 
GC content31% 
IMG OID638081847 
Producthypothetical protein 
Protein accessionYP_695412 
Protein GI110799867 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATG CAGATTCGGT AGGGAAAATT GGTCTTGATT TAGAGATACA AGATGGTGAT 
ATAGGAAAAC AAATAGAAAA GATGGCTAGT GCTATAGGTA GTCAAATAAG TAAGTCGCTA
GAAGGAATAA CAGAAAAATT TGATTTTAAT TCAATAACAA AAGGAATTTC TGAATCTTTA
AATAAAGGAA TGAATAATAT TGATGAAACT ATAAAATCTA GTGTTGAGAA AAGTAAAGCT
AATATTCTTA AGACAATAGA AGAAATAAAA TCAAAAGCTT TAGATGCTAT AAGAAGTATA
ATAGCTAAAT CTAAAGAAAT AAAAATTCCT ATTCAGTTTT CTCCAGTTAG TAATATTGCA
ATGCCTAGTA GCAAGGTAGC AACGCAACCA ATAAGTAGAA GAGGACCACC AAAAAGTAAT
GTTGGAGATT TAGAATCTAT AAAATCTAAG ATTGAAAATC TTTCTAATAG TTTAGAGATA
ACTAATAGAT CAATAGAGCA GCAACAAGAA AAATTATCAG GATTGAAGGC TGCTTATAAT
TCTACATTTA ATCAAGCTAG AAAAAACAAA TTACAGGAGC AAATATTAAA AACAGAAGCT
GTTATAAATA AACTTATAGC TAAATCTGAT GCAACAGGGT TTAAATTAGC TGATTTAGAT
AGGCAGTTTG AGAAATTAGG TAATTCAGCT AAGAATTCTA CTTTAGGATT AAATGAAGCA
AGTAATAGTA TGAAGAGGCT TGAAAATACT ACAAGTAGAA CAAATAGAAA TTTAAGAAAT
GCTAATAACT CTACTAGACG ATATAGAGAA AATATGAATG GTGCTAGAAG TGCAACAGGG
ATGTTTATTG ATAGTATGTT TAGGTGGGGA ATAGTATTCC CTTTAGTAAT GAAGGGGATA
AACACTGTTG CTAGTTATAT AGGAAGTGCT TTAATGACTA ATGCTCAGTT TGCAAACAGT
TTAGCACAAA TTAGAACTAA TCTTATGGTT GCATTTATGC CAATCTATCA AGCAGTTCTA
CCAGCACTTA ATGCTCTTAT GAGTGCATTA GCAACAGTAA CCGCATATAT TGCAGCTTTT
ATAAGTGCTA TATTTGGTAA AACATATCAA GCTAGTTTTG GTGCTGCTAA AAGTATGAAT
GCTTCTATAG CTTCAATGAA GAATATGGAA AAGCAAGGTA AAAAAACATC TGGAGCAGTA
GATAAAATAG GAGATTCGGC AGAAAAGACA AAAAAGAAAA TACAAAGGTC CTTAGCTGGA
TTTGATGAAA TAAATAAATT AAGTATTCCA GATGATTCTG ATAAAGCTCC AAAGGCTCCA
AAAGGAGGAG GCGGTGGTGG AGGAATAGAT CCGATACCAA TGGTTGCTCC AGATATAGAT
TTAAGTCCAA CAAGTGTAGC AATGCAAAAA ATAAATGCTA TGGTAGAAAA GCTAAAAGAT
ATTATATCTA AAATATTTCA ACCTTTTAAA AATGCATGGG CAAGAGAAGG AGCTGCAACA
ATTGCAAGTA TTAAATATGC ATTACATGGA GTTTGGGAGC TTATAAAAGC TATAGGTATT
AGTTTTTTAG AAGTATGGAC TAATGGAACG GGAGAAAAAA TACTTGTAGT TATTCTACAA
ATTTTACAAA ACATATTTAA TATAGTTGGA GATATAGCAA TTACATTTGC AGATGCTTGG
AATGCTGGAG GAATAGGAAC AGCTATAGTT CAATCTTTAG CAAATGCTCT TTTAAATGCA
CTTACATTAA TTAAGCATAT GGGAGATTCT TTAAGGCAAG TTTGGGGAGA AATTGGTCCT
GGATTAGCAA CTACATTCAT GCAAATATTA AATGCAACAT CAGGAGTATT AGAAAATTTA
ACTCAAAAAT TAATTTATGT TTGGGATAAT GGAGGTAGTC ATTTATTCCA GGGATTTATA
AGGCTAGGTG CAAAAATATT TGAATTAGCT GGGTATATTT ATACTAATTT TGTTGCTCCT
ATGGTTAATT GGTTTGTAAA CATGATAGCT CCAGTTCTAG CTAAATTAGC AGATATATTA
GGAATTGTTT TAGATGCGTT TAGCAACTTA ATAAATTGGT TAATGGGTAG TGGAAAGCCA
GTATTAGATA CAATTATTAT TGTTTTAGGA AGTCTTGGTG CTTCTATACT AATAGTTAAA
GGAGCATTAA CTTTATGGAC AATAGCTCAA ACAATTTGGA CAACTGTAGC AAAAACAAGT
ACTATAGCAA CAACATTACT AGGTGGAGCA ATAGCATTTT TAACAAGTCC AATAGGAATT
GCAATAGTTG CTATAACAGC AATAATAGCT AGTGGAGTAG CTTTATATAA AAATTGGGAC
TTTGTAAAAG CTAAAGCTAT AGAAATATGG GGAAAAATAA AAGACATATT TAATAGCTTT
AAAGAATGGT TAAGGAATGT TTTCCAAACA GATTGGTCAA ATTGTTTTGG AGTATTAGGG
AATCTATTAA ATCTTTTCTT AAAAAATGTA GATAATGTTT TTCAATCTAT CAAAAAAATA
TTTGGTGGAA TAATAGACTT TGTAACCGGA GTATTTACTG GAAACTGGAG CAGAGCTTGG
CATGGCGTTG TAGATATTTT CAAAGGTATA ATGAGTGGAT TAGGTTCTGT AATTAAAGCG
CCTCTAAACT CCGTTATTGG GCTAATTAAT ATGGCTATAG ATGGTTTAAA CAAAATTAGT
TTTACTACTC CAGATTGGAT TCCTGGTATT GGTGGTAAGC ACTTTGGAGT TAACATAGCT
AAAATGCCTT ATTTGGCTAA AGGCGGTATA GTAGATAAAC CAACACAAGC CGTAATAGGA
GAGGCTGGAA CAGAGGCAGT AGTACCACTA GAAAATAATA CTGGTGGATT AAATTTACTT
GCTATTAAAC TTTCAGAAAG AATTAATAAT ATGTTATTAC TTTCTAATAA TGCATTAAAA
CAACCTGATT TAACAATGTT AGGTCAAAAT ATTAATAGTA ATGAAAAGAA GAGTATTAAT
GATCCAGAGT TCATAGAAAA AATAAAAGAA GTTATAATAG AAGCTATTTT AGAAGCGATG
AAGAATAAAA AAGATAATAG CTATAATAAT TCAGGGCCTC AAGAGAGTGG TGATTTAATA
TTAAGAATAA GAGATACTGA TTTAGGTAGA ATTGCAATAG AAGCTATAAA TAAAGTGAAT
AGACAAGCTG GAGAGCAATT ATTAAATCTT TAG
 
Protein sequence
MADADSVGKI GLDLEIQDGD IGKQIEKMAS AIGSQISKSL EGITEKFDFN SITKGISESL 
NKGMNNIDET IKSSVEKSKA NILKTIEEIK SKALDAIRSI IAKSKEIKIP IQFSPVSNIA
MPSSKVATQP ISRRGPPKSN VGDLESIKSK IENLSNSLEI TNRSIEQQQE KLSGLKAAYN
STFNQARKNK LQEQILKTEA VINKLIAKSD ATGFKLADLD RQFEKLGNSA KNSTLGLNEA
SNSMKRLENT TSRTNRNLRN ANNSTRRYRE NMNGARSATG MFIDSMFRWG IVFPLVMKGI
NTVASYIGSA LMTNAQFANS LAQIRTNLMV AFMPIYQAVL PALNALMSAL ATVTAYIAAF
ISAIFGKTYQ ASFGAAKSMN ASIASMKNME KQGKKTSGAV DKIGDSAEKT KKKIQRSLAG
FDEINKLSIP DDSDKAPKAP KGGGGGGGID PIPMVAPDID LSPTSVAMQK INAMVEKLKD
IISKIFQPFK NAWAREGAAT IASIKYALHG VWELIKAIGI SFLEVWTNGT GEKILVVILQ
ILQNIFNIVG DIAITFADAW NAGGIGTAIV QSLANALLNA LTLIKHMGDS LRQVWGEIGP
GLATTFMQIL NATSGVLENL TQKLIYVWDN GGSHLFQGFI RLGAKIFELA GYIYTNFVAP
MVNWFVNMIA PVLAKLADIL GIVLDAFSNL INWLMGSGKP VLDTIIIVLG SLGASILIVK
GALTLWTIAQ TIWTTVAKTS TIATTLLGGA IAFLTSPIGI AIVAITAIIA SGVALYKNWD
FVKAKAIEIW GKIKDIFNSF KEWLRNVFQT DWSNCFGVLG NLLNLFLKNV DNVFQSIKKI
FGGIIDFVTG VFTGNWSRAW HGVVDIFKGI MSGLGSVIKA PLNSVIGLIN MAIDGLNKIS
FTTPDWIPGI GGKHFGVNIA KMPYLAKGGI VDKPTQAVIG EAGTEAVVPL ENNTGGLNLL
AIKLSERINN MLLLSNNALK QPDLTMLGQN INSNEKKSIN DPEFIEKIKE VIIEAILEAM
KNKKDNSYNN SGPQESGDLI LRIRDTDLGR IAIEAINKVN RQAGEQLLNL