Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1574 |
Symbol | |
ID | 4201349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 1791790 |
End bp | 1794849 |
Gene Length | 3060 bp |
Protein Length | 1019 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638082452 |
Product | putative phage structural protein |
Protein accession | YP_696017 |
Protein GI | 110801004 |
COG category | [S] Function unknown |
COG ID | [COG4926] Phage-related protein |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAG TACAATTATA TAAATTTAAT AATAAGAATT TTGTTAATAA TGGAGATATG ATATTAGAAC CTTCTAAATG TACTTTGAAA ATGGAATTAG TAACTGGATT AAATGAAGTA GCAATGGAGC ATGAATATGA TGAAGAGGAA AGATGGAAAT ATATTTCTAG AGATGATGTT ATAAAAGTTA GTACACCATA TAAAAAAATA CCAGAACAGC TTTATAGAAT CTACGATATA GAAAAAAACT TAGATTATAT GAGTGTAAAA GCAAGGCATA TATTCTATGA CTTAGTTGAC ATATATATTA AGGATTTAAC TAGAGAGGAA AATATATTTG ATGTTAGATG TGTAAATTGT AATGGACAAC AAGCATTAGA TAAAATTTTA AAAGGGACAG AGTTTAAAGG ACATTCAGAT ATAGAAAAGA TAGCAACATC TTATTATTTT AGAAAGAAGA TAGTACAAGC AATAGGTGGA GATGATAAAG AAAATTCTTT TTTAAGTAGA TGGGGAGGAG AATTATTTTT AAATAATTTT GATATTTACT TAAATAAAAG AGTAGGAGAA GATAATAATG TTAGAATTGC ATATAAAAAG AATTTAACAG GTTTAGTTGA AACAATAGAT ATGGATAGTT TAATAACTAG AGTAATACCA ACTGGTTATG ATGGTATCTG TATAAGTGGA ACAACTCCAT GGGTAGATTC TCCATTAATT AATAATTATA GTCATGTGTT TGAATCAGAG CAAAGATTTG AAGATATTAA GTTAAAAGGG ACACCAAACA ATAAAGGTGA AAATGCAGAG GGGTTTGATA CACAAGAGCA GGTTAATGAA GCGTTAAAAA ATAAAGGTAA GGAGCTTATA AGTAAAGGAA TAGATAAGCC GTTAGTAAAT TATAGTGTTC AATTTATTCC TTTATCAAGT ACTGAAGAAT ATAAAGAGTA TAAGTCACTA GAAGAGATTT TATTAGGAGA TACAGTATAT ATAGAACATA AACCATTAGA TATTAATATT AAAGCTAGAT GTATATCACT TGAATATGAT TGTTTAAATG AAGAAATGTT GAATTGTGAA ATAGGAAATT ATTTAGACAC ATATGCATCA GCACAAGCAG ATAAGAGTGT AACATTTGAT ACTATTGCAG GAAGCTTTGA TGGTGATGGG AATTTAGGTG GCGAAAATAT AATTGGAGCA ATAAATGCTA TGAAAGCTCC ATTATTAGCA CAAAGAGATA GAGCAAAAAA ATTAGATATA GTTGCTTGGA TTCAAGAAGT TTTGGATCCA AATGATCCTG ATTTTGGATG TGTTCAAGGT GGGACAAAAG GGATTCTTTT ATCTGATAAA AGATTAAGCG ATAATTCAGG GTGGGACTTT AAAACAGCTA TAACTCCAAA AGGAATAATA GCTGACGAAT TAATAGGTAT TTTAACAACA GTTTTAATAC GGAATATGGA TAAAAGCTTT GAGATAGATT TAAAGAAACC TGGTGGAGCT TTATTTAGAA ACAATGGAAA AGATGCAATA AAGATAGAAA ATAATATGAT TAAGCTTTAC AATTGGAAAA AGAACGGCGA TTATATAGGT GCGTTAATGT CATTGGTTCA AGGAGATGAT GAAAATAAGC CTTTGATTGG GTTAGCAAAT GATATTGATA GTGCAATGTC TCTTGGGTAT GCTGTTGAAG GTAAGACAAA AGTTCCTTCT TATATTGCGC TTGACAAATA TAACATTTTA GATGATTCAG GTGGAAAGCC AGTAAGAATA TATGAAGAAG TAGATTTTAA AGGGAACAAA GTTTATAACA TAGATATTCG TTCTGATAAT GGTACAAATA GCATTCATGT TGGAGATCAT TTTATTAATA TAACTACACC TGATAATGAA ATTGTAGTTG CAGGTTCAGG AACAAGGATA GGAAAAGATA AATCTTTATA TTATGATGCT AGAACTGGAG AATTGAGATG TAATGACCTA GTATTAGATG GTGTTATTAA AAATACAAGT GGTACTACTG TATTTGACCC AAACTCTCCT ATAGGAGGTG GTGTAGATAC CCTAGGGAAT GTTAGTAAAG GAATACCTTC AAGAAAATAT TTCAGGTATG TCAAAGGAAT AGAAGGCCTA CAACAATATC CAGGTAATAT TGGAGATGGC CAAATAACAT ATGGTTATGG AGTTACTAAA GCTAATGAGC CAACATACTT TGCTAAATTA GGTAATCCAC CTTGTTCTGA AGAAACTGCA TCTAAAGTTT TATTTGAATT AATACCAGAC AGATATGGCT CTTTAGTTAA AAATCAAATG CTTAAAGATG GTTTAGACCT TAGTAAAGTT AATATAAATG TTTTTGATGC ATTTGTAGAT TTATGCTATA ACTCAGGATA TTATAATTCT CGTATGTACA GAGCTTGGAT AAGGGGTGCT AGTATTGATG AAATTTATAA TGATTGGCTA ACATATGCAA CTATGCCTGG AACAATTTTT GAAAAAGGAT TAAAGCGTAG AAGAAAGGAA GAAGCTGAAA TGTTTAAAAA TGCTAACTAC ATTATGTCTC CTATTGGAAT TTTAAACGCA AGTGGAAATC AAATAGGCAC AGTAAAAGGT GATGGATACT TCCCACCTAT AGAAAGTAGT AACTTTAAAA CAATAAATAA TGAGTATGGT AATGGTTGGA TTATTCCAGT AAGTAATGGA CATGTAACAG CAACATTCCC TTATTATCCT TCAGGGGCTC AACATTCAGG AATAGATTTT GGTGTTCCTA TAGGTACGCC AGTTAGAGCT TCAAAGTCAG GTAAAGTTAT AAAAAGAAGA GAATTAACTA CAAGCTATGG TAAATATTTA TTTATAGATC ATGGCGGTGG ATTAGTTACT ATTTACGCTC ATAATAGCGA GTTGCTAGTA AATGAAGGTG ATACAGTAAA AGCAGGACAA GTTATATCTA GAAGTGGTAA TACTGGAAAT TCATCAGGCC CACATTGTCA TTGGGAACTT AGAGTTAATG GTACAGCTCA AAATATAGCT CCTTCTTTAA AAGTTGGAGA TTTAGTGTAA
|
Protein sequence | MGKVQLYKFN NKNFVNNGDM ILEPSKCTLK MELVTGLNEV AMEHEYDEEE RWKYISRDDV IKVSTPYKKI PEQLYRIYDI EKNLDYMSVK ARHIFYDLVD IYIKDLTREE NIFDVRCVNC NGQQALDKIL KGTEFKGHSD IEKIATSYYF RKKIVQAIGG DDKENSFLSR WGGELFLNNF DIYLNKRVGE DNNVRIAYKK NLTGLVETID MDSLITRVIP TGYDGICISG TTPWVDSPLI NNYSHVFESE QRFEDIKLKG TPNNKGENAE GFDTQEQVNE ALKNKGKELI SKGIDKPLVN YSVQFIPLSS TEEYKEYKSL EEILLGDTVY IEHKPLDINI KARCISLEYD CLNEEMLNCE IGNYLDTYAS AQADKSVTFD TIAGSFDGDG NLGGENIIGA INAMKAPLLA QRDRAKKLDI VAWIQEVLDP NDPDFGCVQG GTKGILLSDK RLSDNSGWDF KTAITPKGII ADELIGILTT VLIRNMDKSF EIDLKKPGGA LFRNNGKDAI KIENNMIKLY NWKKNGDYIG ALMSLVQGDD ENKPLIGLAN DIDSAMSLGY AVEGKTKVPS YIALDKYNIL DDSGGKPVRI YEEVDFKGNK VYNIDIRSDN GTNSIHVGDH FINITTPDNE IVVAGSGTRI GKDKSLYYDA RTGELRCNDL VLDGVIKNTS GTTVFDPNSP IGGGVDTLGN VSKGIPSRKY FRYVKGIEGL QQYPGNIGDG QITYGYGVTK ANEPTYFAKL GNPPCSEETA SKVLFELIPD RYGSLVKNQM LKDGLDLSKV NINVFDAFVD LCYNSGYYNS RMYRAWIRGA SIDEIYNDWL TYATMPGTIF EKGLKRRRKE EAEMFKNANY IMSPIGILNA SGNQIGTVKG DGYFPPIESS NFKTINNEYG NGWIIPVSNG HVTATFPYYP SGAQHSGIDF GVPIGTPVRA SKSGKVIKRR ELTTSYGKYL FIDHGGGLVT IYAHNSELLV NEGDTVKAGQ VISRSGNTGN SSGPHCHWEL RVNGTAQNIA PSLKVGDLV
|
| |