Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0965 |
Symbol | |
ID | 4202989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1112564 |
End bp | 1115776 |
Gene Length | 3213 bp |
Protein Length | 1070 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638081847 |
Product | hypothetical protein |
Protein accession | YP_695412 |
Protein GI | 110799867 |
COG category | [S] Function unknown |
COG ID | [COG5412] Phage-related protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGATG CAGATTCGGT AGGGAAAATT GGTCTTGATT TAGAGATACA AGATGGTGAT ATAGGAAAAC AAATAGAAAA GATGGCTAGT GCTATAGGTA GTCAAATAAG TAAGTCGCTA GAAGGAATAA CAGAAAAATT TGATTTTAAT TCAATAACAA AAGGAATTTC TGAATCTTTA AATAAAGGAA TGAATAATAT TGATGAAACT ATAAAATCTA GTGTTGAGAA AAGTAAAGCT AATATTCTTA AGACAATAGA AGAAATAAAA TCAAAAGCTT TAGATGCTAT AAGAAGTATA ATAGCTAAAT CTAAAGAAAT AAAAATTCCT ATTCAGTTTT CTCCAGTTAG TAATATTGCA ATGCCTAGTA GCAAGGTAGC AACGCAACCA ATAAGTAGAA GAGGACCACC AAAAAGTAAT GTTGGAGATT TAGAATCTAT AAAATCTAAG ATTGAAAATC TTTCTAATAG TTTAGAGATA ACTAATAGAT CAATAGAGCA GCAACAAGAA AAATTATCAG GATTGAAGGC TGCTTATAAT TCTACATTTA ATCAAGCTAG AAAAAACAAA TTACAGGAGC AAATATTAAA AACAGAAGCT GTTATAAATA AACTTATAGC TAAATCTGAT GCAACAGGGT TTAAATTAGC TGATTTAGAT AGGCAGTTTG AGAAATTAGG TAATTCAGCT AAGAATTCTA CTTTAGGATT AAATGAAGCA AGTAATAGTA TGAAGAGGCT TGAAAATACT ACAAGTAGAA CAAATAGAAA TTTAAGAAAT GCTAATAACT CTACTAGACG ATATAGAGAA AATATGAATG GTGCTAGAAG TGCAACAGGG ATGTTTATTG ATAGTATGTT TAGGTGGGGA ATAGTATTCC CTTTAGTAAT GAAGGGGATA AACACTGTTG CTAGTTATAT AGGAAGTGCT TTAATGACTA ATGCTCAGTT TGCAAACAGT TTAGCACAAA TTAGAACTAA TCTTATGGTT GCATTTATGC CAATCTATCA AGCAGTTCTA CCAGCACTTA ATGCTCTTAT GAGTGCATTA GCAACAGTAA CCGCATATAT TGCAGCTTTT ATAAGTGCTA TATTTGGTAA AACATATCAA GCTAGTTTTG GTGCTGCTAA AAGTATGAAT GCTTCTATAG CTTCAATGAA GAATATGGAA AAGCAAGGTA AAAAAACATC TGGAGCAGTA GATAAAATAG GAGATTCGGC AGAAAAGACA AAAAAGAAAA TACAAAGGTC CTTAGCTGGA TTTGATGAAA TAAATAAATT AAGTATTCCA GATGATTCTG ATAAAGCTCC AAAGGCTCCA AAAGGAGGAG GCGGTGGTGG AGGAATAGAT CCGATACCAA TGGTTGCTCC AGATATAGAT TTAAGTCCAA CAAGTGTAGC AATGCAAAAA ATAAATGCTA TGGTAGAAAA GCTAAAAGAT ATTATATCTA AAATATTTCA ACCTTTTAAA AATGCATGGG CAAGAGAAGG AGCTGCAACA ATTGCAAGTA TTAAATATGC ATTACATGGA GTTTGGGAGC TTATAAAAGC TATAGGTATT AGTTTTTTAG AAGTATGGAC TAATGGAACG GGAGAAAAAA TACTTGTAGT TATTCTACAA ATTTTACAAA ACATATTTAA TATAGTTGGA GATATAGCAA TTACATTTGC AGATGCTTGG AATGCTGGAG GAATAGGAAC AGCTATAGTT CAATCTTTAG CAAATGCTCT TTTAAATGCA CTTACATTAA TTAAGCATAT GGGAGATTCT TTAAGGCAAG TTTGGGGAGA AATTGGTCCT GGATTAGCAA CTACATTCAT GCAAATATTA AATGCAACAT CAGGAGTATT AGAAAATTTA ACTCAAAAAT TAATTTATGT TTGGGATAAT GGAGGTAGTC ATTTATTCCA GGGATTTATA AGGCTAGGTG CAAAAATATT TGAATTAGCT GGGTATATTT ATACTAATTT TGTTGCTCCT ATGGTTAATT GGTTTGTAAA CATGATAGCT CCAGTTCTAG CTAAATTAGC AGATATATTA GGAATTGTTT TAGATGCGTT TAGCAACTTA ATAAATTGGT TAATGGGTAG TGGAAAGCCA GTATTAGATA CAATTATTAT TGTTTTAGGA AGTCTTGGTG CTTCTATACT AATAGTTAAA GGAGCATTAA CTTTATGGAC AATAGCTCAA ACAATTTGGA CAACTGTAGC AAAAACAAGT ACTATAGCAA CAACATTACT AGGTGGAGCA ATAGCATTTT TAACAAGTCC AATAGGAATT GCAATAGTTG CTATAACAGC AATAATAGCT AGTGGAGTAG CTTTATATAA AAATTGGGAC TTTGTAAAAG CTAAAGCTAT AGAAATATGG GGAAAAATAA AAGACATATT TAATAGCTTT AAAGAATGGT TAAGGAATGT TTTCCAAACA GATTGGTCAA ATTGTTTTGG AGTATTAGGG AATCTATTAA ATCTTTTCTT AAAAAATGTA GATAATGTTT TTCAATCTAT CAAAAAAATA TTTGGTGGAA TAATAGACTT TGTAACCGGA GTATTTACTG GAAACTGGAG CAGAGCTTGG CATGGCGTTG TAGATATTTT CAAAGGTATA ATGAGTGGAT TAGGTTCTGT AATTAAAGCG CCTCTAAACT CCGTTATTGG GCTAATTAAT ATGGCTATAG ATGGTTTAAA CAAAATTAGT TTTACTACTC CAGATTGGAT TCCTGGTATT GGTGGTAAGC ACTTTGGAGT TAACATAGCT AAAATGCCTT ATTTGGCTAA AGGCGGTATA GTAGATAAAC CAACACAAGC CGTAATAGGA GAGGCTGGAA CAGAGGCAGT AGTACCACTA GAAAATAATA CTGGTGGATT AAATTTACTT GCTATTAAAC TTTCAGAAAG AATTAATAAT ATGTTATTAC TTTCTAATAA TGCATTAAAA CAACCTGATT TAACAATGTT AGGTCAAAAT ATTAATAGTA ATGAAAAGAA GAGTATTAAT GATCCAGAGT TCATAGAAAA AATAAAAGAA GTTATAATAG AAGCTATTTT AGAAGCGATG AAGAATAAAA AAGATAATAG CTATAATAAT TCAGGGCCTC AAGAGAGTGG TGATTTAATA TTAAGAATAA GAGATACTGA TTTAGGTAGA ATTGCAATAG AAGCTATAAA TAAAGTGAAT AGACAAGCTG GAGAGCAATT ATTAAATCTT TAG
|
Protein sequence | MADADSVGKI GLDLEIQDGD IGKQIEKMAS AIGSQISKSL EGITEKFDFN SITKGISESL NKGMNNIDET IKSSVEKSKA NILKTIEEIK SKALDAIRSI IAKSKEIKIP IQFSPVSNIA MPSSKVATQP ISRRGPPKSN VGDLESIKSK IENLSNSLEI TNRSIEQQQE KLSGLKAAYN STFNQARKNK LQEQILKTEA VINKLIAKSD ATGFKLADLD RQFEKLGNSA KNSTLGLNEA SNSMKRLENT TSRTNRNLRN ANNSTRRYRE NMNGARSATG MFIDSMFRWG IVFPLVMKGI NTVASYIGSA LMTNAQFANS LAQIRTNLMV AFMPIYQAVL PALNALMSAL ATVTAYIAAF ISAIFGKTYQ ASFGAAKSMN ASIASMKNME KQGKKTSGAV DKIGDSAEKT KKKIQRSLAG FDEINKLSIP DDSDKAPKAP KGGGGGGGID PIPMVAPDID LSPTSVAMQK INAMVEKLKD IISKIFQPFK NAWAREGAAT IASIKYALHG VWELIKAIGI SFLEVWTNGT GEKILVVILQ ILQNIFNIVG DIAITFADAW NAGGIGTAIV QSLANALLNA LTLIKHMGDS LRQVWGEIGP GLATTFMQIL NATSGVLENL TQKLIYVWDN GGSHLFQGFI RLGAKIFELA GYIYTNFVAP MVNWFVNMIA PVLAKLADIL GIVLDAFSNL INWLMGSGKP VLDTIIIVLG SLGASILIVK GALTLWTIAQ TIWTTVAKTS TIATTLLGGA IAFLTSPIGI AIVAITAIIA SGVALYKNWD FVKAKAIEIW GKIKDIFNSF KEWLRNVFQT DWSNCFGVLG NLLNLFLKNV DNVFQSIKKI FGGIIDFVTG VFTGNWSRAW HGVVDIFKGI MSGLGSVIKA PLNSVIGLIN MAIDGLNKIS FTTPDWIPGI GGKHFGVNIA KMPYLAKGGI VDKPTQAVIG EAGTEAVVPL ENNTGGLNLL AIKLSERINN MLLLSNNALK QPDLTMLGQN INSNEKKSIN DPEFIEKIKE VIIEAILEAM KNKKDNSYNN SGPQESGDLI LRIRDTDLGR IAIEAINKVN RQAGEQLLNL
|
| |