Gene CPF_1587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1587 
Symbol 
ID4201712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1804774 
End bp1806513 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content30% 
IMG OID638082465 
Productputative phage terminase, large subunit 
Protein accessionYP_696030 
Protein GI110800668 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0773483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAATA CAGTTCTTGA AGAGCTTATT GATTATTCTA ATAAAATACT AAATTGTGAA 
ATTGTTGCTT GTAAAAGACA TAAACAAGCT TGTCAGAGAT TTCTAAATGA TTTAGAAAGA
ATGGAGCATG AAGATTTCGA GTATTATTGG GATGAAGAGG AAGCTCAAAA AATTGTTAAG
TGGTATAGTT ACTGTAAACA TTCAAAAGGA GTATTAGAGG GACAACCAAT AATATTAAAT
TCATGGTCAA AGTTTGTAAT TTGTAATATA GAAGCTTGGA AGCATAAGGA TACAAATTAT
AGAAGGTTTA GATTTGCTTT TATTCAAGTA GGGAGAAAAA ATGCAAAATC TCAAATGGAA
GCTGGAATGG CTGGTTATGA AATAGGAGCA AAAGGGTATA ATGCAGCAGA AGTTTATACT
TTAGGAGTTG AAAGAGATCA GGCTAAAATT GTTTTTGATG AATGGGAGCT AATGACTTCT
AAACCATTAA AGAAGAAATT TAAGTTTACT CAAAAAGAAA TACGACATAG AAATAGTAAT
AGTTTTATGA AGCATTTAAG TAAAAAAGCT GGTAAAACTG GTGATGGTAA GAATCCACAA
ATGGCTATTA TAGATGAGTA TCATGCACAT CCTAATTCAG ATATGTATGA TGTTATGAAA
TCAGGTATGA TGGCAAGAAC AGAGCCATTA TTAGTAATAA TAACTACGGC TGGAATGGAC
TACGAAGAAA CGGCTTGTTA TTATGAATAT TTAGATTGTT GTTCAATATT AGATGGAACT
TTTGAGAATG ATAAATACTT TGTAATGATT TGTGAGCTAG AAAAAGAAGA TGATCCTTTT
GATGAAGAAG TTTGGTTAAA GGCTAATCCA ATTTTATGTA CTTATCCTGA AGGAATACAA
AGCATGAGGG AAAATGCTAA ATTAGCTAAG AATACAAGTA ATGAAAAGAA GAGAATAGAG
TTCTTTACTA AGAATTGTAA TATATATGTT GCAGCAGGAG AAAAAAGGTA TGTTGATGTT
GAATACTGGA AAGCTTGTAA AAAGGAATTA ACCTTAGAGG ATTTCAGAGG ACATGATTGT
TATATTGGAA TAGATTTATC AAAGTCAGGG GATTTAACTT CAATTGCTTT TGAGTTTCCT
TATTTAGATG GGAATATTAG ACGATATGCT TTATTTGGAC AATCATTTAT ACCATCAGAA
GTAGTTAAAG AAAAAATGAT AACTGACAAT GTACCATATG AATTATGGAG TAAAAAAGGT
TGGTTAATAA AGACAGAAGC TAATGATGGT TTAATAGTAG ATTTTTGGGC AGTTCTAAAT
ACTATAGAAA GTATTGTAAA AGAATATGAA CTAAATGTTA TAGAAGTTAG TTATGACCCT
CATGGAGCTG CAATGTTAGT TGGAGAACTA GAAAGAAAGG ATTATACCTG TGTAGAATGT
GGACAAAGTT GTGCAAAACT AAATGAAGCT ACTGTAAATT TTAGAGATTT AATGAAAGTT
AAGCAACTTG AACATGATGA TAATAAACTT ATGACTTGGT GTGTTCAAAA TGCAGAGATT
GATTCCAACT CTTTTGGAGA AATAAAAATA AGTAAAAAAA GCAGATTTAA AAGAATTGAC
CCATTAGCAA GTTGTATATT CGCTCATGTT AGAGCTATAA CATATTGGAA AAGAGAAAAC
TTAAATGTGA GTGAATTTGC AGAAGAAGAT TTCTTAAAGA GATTATGGGG GAGAAAATAG
 
Protein sequence
MYNTVLEELI DYSNKILNCE IVACKRHKQA CQRFLNDLER MEHEDFEYYW DEEEAQKIVK 
WYSYCKHSKG VLEGQPIILN SWSKFVICNI EAWKHKDTNY RRFRFAFIQV GRKNAKSQME
AGMAGYEIGA KGYNAAEVYT LGVERDQAKI VFDEWELMTS KPLKKKFKFT QKEIRHRNSN
SFMKHLSKKA GKTGDGKNPQ MAIIDEYHAH PNSDMYDVMK SGMMARTEPL LVIITTAGMD
YEETACYYEY LDCCSILDGT FENDKYFVMI CELEKEDDPF DEEVWLKANP ILCTYPEGIQ
SMRENAKLAK NTSNEKKRIE FFTKNCNIYV AAGEKRYVDV EYWKACKKEL TLEDFRGHDC
YIGIDLSKSG DLTSIAFEFP YLDGNIRRYA LFGQSFIPSE VVKEKMITDN VPYELWSKKG
WLIKTEANDG LIVDFWAVLN TIESIVKEYE LNVIEVSYDP HGAAMLVGEL ERKDYTCVEC
GQSCAKLNEA TVNFRDLMKV KQLEHDDNKL MTWCVQNAEI DSNSFGEIKI SKKSRFKRID
PLASCIFAHV RAITYWKREN LNVSEFAEED FLKRLWGRK