Gene CPF_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1586 
Symbol 
ID4202382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1803526 
End bp1804773 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content27% 
IMG OID638082464 
ProductHK97 family phage portal protein 
Protein accessionYP_696029 
Protein GI110799190 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00170734 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTT TTAAAAAGTT ATTTAATAAA AGAAGTAATT ATGATGAAGA GATTGGTATT 
AATATATCTG ATTCTAACTT TTGGGAGAAG TTTGGTATTA AATTAAAATT TTTAATATCA
GGTAAGAGAG TATTAAAAGA AAATACAGTT TATATATGTA CTAAGGTAAG AGCTGAAAGC
ATAGGTAAAT TATCTTTAAA GATTTACAAG GATAGAGAAG AGTATAAAGA ACATGAACTT
TATTATCTTT TAAGATATAA GCCTAATCCA TTAATGAACT CAATTAATTT TTGGAAGTGC
TTAGAAGCAC AAAGAACTTT AAAAGGTAAT GCGTATACAT ATATAGAAAG AGATAGAAGA
GGAAAGATAA TTGGTTTATA TCCTATTGAT TCAGATAATG TAACTAAAGT TATGGATGAT
AATAACTTTT TAAGTAGTTT AACTAAAGTT TGGTATATAG TAACTGACAA TAAAGGGATT
AAACATAAGT TACTTCCTGA TGAAATACTA CATTTTATTG GAGATATTAC TTTAGATGGA
TTAATAGGAA TAGCTCCACT TGATTATTTG AAATGTACTA TTGAGAATGG AAGAGCTACT
CAGGAGTTTA TAAATAAATT CTTTAAAAGT GGATTAACTA CAAAAGGAAT AATTCAATAT
GTAGGAGAGC TAGACGAAAA GGCAAAGAAA ACTTTTATAA AAGAATTTGA ATCTATGAGT
AATGGTCTAG CAAATGCTCA TTCGGTTTCA TTACTTCCTT TAGGGTATCA ATTTCAACCT
TTGTCATTAA GCATGGCAGA TGCACAATTT TTAGAAAATG CAAAATTAAC TAAAAGAGAA
TTAGCAGCAG CATTTGGAAT GAAGTCATAT CATCTTAATG ATTTAGAGAG AGCAACATTT
AATAATCTTA CAGAACAACA GAAAGATTTT TATATAACAA CACTTCAACC ATCTCTTACT
AATTATGAAC AAGAGATGCA AGATAAATTA TTAAGTCAAT ATGAAACTTT AAATAATGTG
AAAATTGAGT TTAATGTAGA TAGTATTTTA AGAAGTGATA TAAAAACAAG ATATGAAGCT
TATAGAATTG GTATTCAAAG TGGATTTATA GCTTCCAATG AGGTGAGAAA AAAAGAAAAT
TTACCACCAA AAGATGGAGG AAATGAATTA CTTATAAATG GTAATATGAT GCCTATAGCT
ATGGCTGGAA AACAATATTT GAAAGGTGGT GATAATAGTG GAGCATAA
 
Protein sequence
MKFFKKLFNK RSNYDEEIGI NISDSNFWEK FGIKLKFLIS GKRVLKENTV YICTKVRAES 
IGKLSLKIYK DREEYKEHEL YYLLRYKPNP LMNSINFWKC LEAQRTLKGN AYTYIERDRR
GKIIGLYPID SDNVTKVMDD NNFLSSLTKV WYIVTDNKGI KHKLLPDEIL HFIGDITLDG
LIGIAPLDYL KCTIENGRAT QEFINKFFKS GLTTKGIIQY VGELDEKAKK TFIKEFESMS
NGLANAHSVS LLPLGYQFQP LSLSMADAQF LENAKLTKRE LAAAFGMKSY HLNDLERATF
NNLTEQQKDF YITTLQPSLT NYEQEMQDKL LSQYETLNNV KIEFNVDSIL RSDIKTRYEA
YRIGIQSGFI ASNEVRKKEN LPPKDGGNEL LINGNMMPIA MAGKQYLKGG DNSGA