Gene CPF_1574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1574 
Symbol 
ID4201349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1791790 
End bp1794849 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content31% 
IMG OID638082452 
Productputative phage structural protein 
Protein accessionYP_696017 
Protein GI110801004 
COG category[S] Function unknown 
COG ID[COG4926] Phage-related protein 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAAAG TACAATTATA TAAATTTAAT AATAAGAATT TTGTTAATAA TGGAGATATG 
ATATTAGAAC CTTCTAAATG TACTTTGAAA ATGGAATTAG TAACTGGATT AAATGAAGTA
GCAATGGAGC ATGAATATGA TGAAGAGGAA AGATGGAAAT ATATTTCTAG AGATGATGTT
ATAAAAGTTA GTACACCATA TAAAAAAATA CCAGAACAGC TTTATAGAAT CTACGATATA
GAAAAAAACT TAGATTATAT GAGTGTAAAA GCAAGGCATA TATTCTATGA CTTAGTTGAC
ATATATATTA AGGATTTAAC TAGAGAGGAA AATATATTTG ATGTTAGATG TGTAAATTGT
AATGGACAAC AAGCATTAGA TAAAATTTTA AAAGGGACAG AGTTTAAAGG ACATTCAGAT
ATAGAAAAGA TAGCAACATC TTATTATTTT AGAAAGAAGA TAGTACAAGC AATAGGTGGA
GATGATAAAG AAAATTCTTT TTTAAGTAGA TGGGGAGGAG AATTATTTTT AAATAATTTT
GATATTTACT TAAATAAAAG AGTAGGAGAA GATAATAATG TTAGAATTGC ATATAAAAAG
AATTTAACAG GTTTAGTTGA AACAATAGAT ATGGATAGTT TAATAACTAG AGTAATACCA
ACTGGTTATG ATGGTATCTG TATAAGTGGA ACAACTCCAT GGGTAGATTC TCCATTAATT
AATAATTATA GTCATGTGTT TGAATCAGAG CAAAGATTTG AAGATATTAA GTTAAAAGGG
ACACCAAACA ATAAAGGTGA AAATGCAGAG GGGTTTGATA CACAAGAGCA GGTTAATGAA
GCGTTAAAAA ATAAAGGTAA GGAGCTTATA AGTAAAGGAA TAGATAAGCC GTTAGTAAAT
TATAGTGTTC AATTTATTCC TTTATCAAGT ACTGAAGAAT ATAAAGAGTA TAAGTCACTA
GAAGAGATTT TATTAGGAGA TACAGTATAT ATAGAACATA AACCATTAGA TATTAATATT
AAAGCTAGAT GTATATCACT TGAATATGAT TGTTTAAATG AAGAAATGTT GAATTGTGAA
ATAGGAAATT ATTTAGACAC ATATGCATCA GCACAAGCAG ATAAGAGTGT AACATTTGAT
ACTATTGCAG GAAGCTTTGA TGGTGATGGG AATTTAGGTG GCGAAAATAT AATTGGAGCA
ATAAATGCTA TGAAAGCTCC ATTATTAGCA CAAAGAGATA GAGCAAAAAA ATTAGATATA
GTTGCTTGGA TTCAAGAAGT TTTGGATCCA AATGATCCTG ATTTTGGATG TGTTCAAGGT
GGGACAAAAG GGATTCTTTT ATCTGATAAA AGATTAAGCG ATAATTCAGG GTGGGACTTT
AAAACAGCTA TAACTCCAAA AGGAATAATA GCTGACGAAT TAATAGGTAT TTTAACAACA
GTTTTAATAC GGAATATGGA TAAAAGCTTT GAGATAGATT TAAAGAAACC TGGTGGAGCT
TTATTTAGAA ACAATGGAAA AGATGCAATA AAGATAGAAA ATAATATGAT TAAGCTTTAC
AATTGGAAAA AGAACGGCGA TTATATAGGT GCGTTAATGT CATTGGTTCA AGGAGATGAT
GAAAATAAGC CTTTGATTGG GTTAGCAAAT GATATTGATA GTGCAATGTC TCTTGGGTAT
GCTGTTGAAG GTAAGACAAA AGTTCCTTCT TATATTGCGC TTGACAAATA TAACATTTTA
GATGATTCAG GTGGAAAGCC AGTAAGAATA TATGAAGAAG TAGATTTTAA AGGGAACAAA
GTTTATAACA TAGATATTCG TTCTGATAAT GGTACAAATA GCATTCATGT TGGAGATCAT
TTTATTAATA TAACTACACC TGATAATGAA ATTGTAGTTG CAGGTTCAGG AACAAGGATA
GGAAAAGATA AATCTTTATA TTATGATGCT AGAACTGGAG AATTGAGATG TAATGACCTA
GTATTAGATG GTGTTATTAA AAATACAAGT GGTACTACTG TATTTGACCC AAACTCTCCT
ATAGGAGGTG GTGTAGATAC CCTAGGGAAT GTTAGTAAAG GAATACCTTC AAGAAAATAT
TTCAGGTATG TCAAAGGAAT AGAAGGCCTA CAACAATATC CAGGTAATAT TGGAGATGGC
CAAATAACAT ATGGTTATGG AGTTACTAAA GCTAATGAGC CAACATACTT TGCTAAATTA
GGTAATCCAC CTTGTTCTGA AGAAACTGCA TCTAAAGTTT TATTTGAATT AATACCAGAC
AGATATGGCT CTTTAGTTAA AAATCAAATG CTTAAAGATG GTTTAGACCT TAGTAAAGTT
AATATAAATG TTTTTGATGC ATTTGTAGAT TTATGCTATA ACTCAGGATA TTATAATTCT
CGTATGTACA GAGCTTGGAT AAGGGGTGCT AGTATTGATG AAATTTATAA TGATTGGCTA
ACATATGCAA CTATGCCTGG AACAATTTTT GAAAAAGGAT TAAAGCGTAG AAGAAAGGAA
GAAGCTGAAA TGTTTAAAAA TGCTAACTAC ATTATGTCTC CTATTGGAAT TTTAAACGCA
AGTGGAAATC AAATAGGCAC AGTAAAAGGT GATGGATACT TCCCACCTAT AGAAAGTAGT
AACTTTAAAA CAATAAATAA TGAGTATGGT AATGGTTGGA TTATTCCAGT AAGTAATGGA
CATGTAACAG CAACATTCCC TTATTATCCT TCAGGGGCTC AACATTCAGG AATAGATTTT
GGTGTTCCTA TAGGTACGCC AGTTAGAGCT TCAAAGTCAG GTAAAGTTAT AAAAAGAAGA
GAATTAACTA CAAGCTATGG TAAATATTTA TTTATAGATC ATGGCGGTGG ATTAGTTACT
ATTTACGCTC ATAATAGCGA GTTGCTAGTA AATGAAGGTG ATACAGTAAA AGCAGGACAA
GTTATATCTA GAAGTGGTAA TACTGGAAAT TCATCAGGCC CACATTGTCA TTGGGAACTT
AGAGTTAATG GTACAGCTCA AAATATAGCT CCTTCTTTAA AAGTTGGAGA TTTAGTGTAA
 
Protein sequence
MGKVQLYKFN NKNFVNNGDM ILEPSKCTLK MELVTGLNEV AMEHEYDEEE RWKYISRDDV 
IKVSTPYKKI PEQLYRIYDI EKNLDYMSVK ARHIFYDLVD IYIKDLTREE NIFDVRCVNC
NGQQALDKIL KGTEFKGHSD IEKIATSYYF RKKIVQAIGG DDKENSFLSR WGGELFLNNF
DIYLNKRVGE DNNVRIAYKK NLTGLVETID MDSLITRVIP TGYDGICISG TTPWVDSPLI
NNYSHVFESE QRFEDIKLKG TPNNKGENAE GFDTQEQVNE ALKNKGKELI SKGIDKPLVN
YSVQFIPLSS TEEYKEYKSL EEILLGDTVY IEHKPLDINI KARCISLEYD CLNEEMLNCE
IGNYLDTYAS AQADKSVTFD TIAGSFDGDG NLGGENIIGA INAMKAPLLA QRDRAKKLDI
VAWIQEVLDP NDPDFGCVQG GTKGILLSDK RLSDNSGWDF KTAITPKGII ADELIGILTT
VLIRNMDKSF EIDLKKPGGA LFRNNGKDAI KIENNMIKLY NWKKNGDYIG ALMSLVQGDD
ENKPLIGLAN DIDSAMSLGY AVEGKTKVPS YIALDKYNIL DDSGGKPVRI YEEVDFKGNK
VYNIDIRSDN GTNSIHVGDH FINITTPDNE IVVAGSGTRI GKDKSLYYDA RTGELRCNDL
VLDGVIKNTS GTTVFDPNSP IGGGVDTLGN VSKGIPSRKY FRYVKGIEGL QQYPGNIGDG
QITYGYGVTK ANEPTYFAKL GNPPCSEETA SKVLFELIPD RYGSLVKNQM LKDGLDLSKV
NINVFDAFVD LCYNSGYYNS RMYRAWIRGA SIDEIYNDWL TYATMPGTIF EKGLKRRRKE
EAEMFKNANY IMSPIGILNA SGNQIGTVKG DGYFPPIESS NFKTINNEYG NGWIIPVSNG
HVTATFPYYP SGAQHSGIDF GVPIGTPVRA SKSGKVIKRR ELTTSYGKYL FIDHGGGLVT
IYAHNSELLV NEGDTVKAGQ VISRSGNTGN SSGPHCHWEL RVNGTAQNIA PSLKVGDLV