Gene CPF_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0472 
Symbol 
ID4201350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp559699 
End bp561078 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content21% 
IMG OID638081354 
Productputative capK protein 
Protein accessionYP_694927 
Protein GI110801005 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000021402 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA GTTTTGTAGA TAAAATACCA GAAATATTTT TTGCTCCATT AGAATTTCTT 
AATAATAAAA TTTTTAAGAA ATTAATAAAA AGTAGAAAAT ATAAAAAAAT ATATAAGAAT
ACTATTAATG AAATAAAAAG ATTTGAGATA ATGAATTCAA GTGAAATAGA AGAATTTTCA
TTTAAAAAAT TAAAAAAATT ATTAATATAT GCTTTTGAAA ATGTACCATA TTACAATAAT
TTATTTAATT CAATAAACTT TAATCCTAAT TATATGAAAA ATATATCTGA AATAAAAAAA
ATACCTTTAT TAGATAAAAG AATAATAAAC GAAAATTATA ATAATCTTAT ATCAAATGAA
TTTAAGAATA ATAAATCAAA ATTAATTAAA TTATCCACTG GTGGAAGTAC AGGCAAGCCT
TTAGAATTTT TAAGTCTTAA ATATTATAAT GAAGCTAGAG AAAATGCATT TATAGATTAT
ATATTTAGCA AGAATGGATA TAAAAAAGAT ATGAGAACAA TAATTTTAAG AGGAGATAAA
GTAAGAAATA TAGATAAAAA AAAAGCTAAA AATATATATT GGAGAAAAAA AAGAGGAACA
AATGATCTTA TTATGTCTAC TTATATGCTT AATGATGTAA GTTGTATTTT CTACATAAAA
AAATTAATTA AATACAAGGC GAAGTGCATA AAGGCCTATC CAAGTGCTTT AGAATTATTA
GCTAATAAGA TGATTAGCAT GAATTTAAAA AATAAGGATG TAGAATTGAT AATTTGTGCA
TCAGAAAATT TAAGTTGTAT GCAAATAGAT ATCTTTAAAA AAGTTTTTTG TAATGCTAAA
ATATTTGATT TTTATGGTCA TACGGAACAT TGTTGCTTAG CAGAGTTAAA CAATAAAAAT
AAATACACTT TTATACCGAA TTATGGATAC GTTGAATTAG AAGAAAATGA TGATGGGAGT
TTTGAGATAG TTTGCTCTGG CTATAACAAT TATGTTATGC CTTTCATTAG ATATAAAACC
GGAGATTTAG TTAAAGATTT TAATTTTATA GATGGAAAAT TAGAGGTTAA TAAGATAGAA
GGTAGAAAAA AAGAGTATAT TATTGATAAA CTAGGTAATA AAATAGTTTT TACAGGATCA
TATAAAATAC TAGATTGTGT AATAAATAAG ATTTTAGCTG GACAAATAAT TCAAGAAAAC
ATAGGACATT TGTATGTTGA TGTTATTGTT AATGAAAAAT TTGAGGATAA TGATAAAAAT
TCTATAATTT TAGCATTTAA ATCTAAATAT AAAGATAGAT TTGATGTTGA TGTAAGGATA
GTAAAAGAGC TAAGAAAAAC CAGTAGAGGA AAGATTAATT TCTTTATTCA GTGTATTTAA
 
Protein sequence
MYKSFVDKIP EIFFAPLEFL NNKIFKKLIK SRKYKKIYKN TINEIKRFEI MNSSEIEEFS 
FKKLKKLLIY AFENVPYYNN LFNSINFNPN YMKNISEIKK IPLLDKRIIN ENYNNLISNE
FKNNKSKLIK LSTGGSTGKP LEFLSLKYYN EARENAFIDY IFSKNGYKKD MRTIILRGDK
VRNIDKKKAK NIYWRKKRGT NDLIMSTYML NDVSCIFYIK KLIKYKAKCI KAYPSALELL
ANKMISMNLK NKDVELIICA SENLSCMQID IFKKVFCNAK IFDFYGHTEH CCLAELNNKN
KYTFIPNYGY VELEENDDGS FEIVCSGYNN YVMPFIRYKT GDLVKDFNFI DGKLEVNKIE
GRKKEYIIDK LGNKIVFTGS YKILDCVINK ILAGQIIQEN IGHLYVDVIV NEKFEDNDKN
SIILAFKSKY KDRFDVDVRI VKELRKTSRG KINFFIQCI