Gene CPF_0156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0156 
Symbol 
ID4203419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp184140 
End bp185642 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content30% 
IMG OID638081037 
Productperfringolysin O 
Protein accessionYP_694620 
Protein GI110798884 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.772662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAT TTAAGAAAAC AAAATTAATA GCAAGTATTG CAATGGCTTT ATGTCTGTTT 
TCTCAACCAG TAATCAGTTT CTCAAAGGAT ATAACAGATA AAAATCAAAG TATTGATTCT
GGAATATCAA GTTTAAGTTA CAATAGAAAT GAAGTTTTAG CTAGTAATGG AGATAAAATT
GAAAGCTTTG TTCCAAAGGA AGGTAAAAAA GCTGGTAATA AATTTATAGT TGTAGAACGT
CAAAAAAGAT CCCTTACAAC ATCACCAGTA GATATATCAA TAATTGATTC TGTAAATGAC
CGTACATATC CAGGAGCATT ACAACTTGCA GATAAAGCAT TTGTGGAAAA TAGACCTACA
ATCTTAATGG TAAAAAGAAA GCCTATTAAC ATTAATATAG ATTTACCAGG ATTAAAGGGC
GAAAATAGTA TAAAGGTTGA TGATCCAACC TATGGAAAAG TTTCTGGAGC AATTGATGAA
TTAGTATCTA AGTGGAATGA AAAGTATTCA TCTACACATA CTTTACCAGC AAGAACTCAA
TATTCAGAAT CTATGGTTTA TAGTAAATCA CAAATATCAA GTGCCCTTAA TGTTAATGCT
AAAGTCCTTG AAAACTCACT TGGAGTAGAC TTTAATGCAG TAGCAAACAA TGAGAAAAAA
GTTATGATTT TAGCATATAA ACAAATATTC TATACAGTAA GTGCAGATTT ACCTAAGAAT
CCATCAGATC TTTTTGATGA CAGTGTTACA TTTAATGATT TAAAACAAAA GGGAGTAAGT
AATGAAGCAC CTCCACTTAT GGTTTCAAAT GTAGCTTATG GAAGAACAAT ATATGTTAAG
TTAGAAACTA CTTCTAGTAG TAAAGATGTA CAAGCTGCTT TCAAAGCTCT TATAAAGAAC
ACTGATATAA AAAATAGTCA ACAATATAAA GATATTTATG AAAATAGTTC CTTCACAGCA
GTAGTTTTAG GAGGAGATGC ACAAGAACAT AACAAAGTTG TAACTAAAGA CTTTGATGAA
ATAAGAAAAG TAATTAAAGA CAATGCAACT TTTAGTACAA AAAACCCAGC ATATCCAATA
TCTTATACTA GTGTTTTCTT AAAAGATAAC TCAGTTGCTG CTGTTCACAA TAAAACAGAT
TATATAGAAA CAACTTCTAC AGAGTATTCT AAGGGAAAAA TAAACTTAGA TCATAGTGGA
GCCTATGTTG CACAGTTTGA AGTAGCCTGG GATGAAGTTT CATATGACAA AGAAGGAAAT
GAAGTTTTAA CTCATAAAAC ATGGGATGGA AATTATCAAG ATAAAACAGC TCACTATTCA
ACAGTAATAC CTCTTGAAGC TAATGCAAGA AATATAAGAA TAAAAGCAAG AGAGTGTACA
GGCCTTGCTT GGGAATGGTG GAGAGATGTT ATAAGTGAAT ATGATGTTCC ATTAACAAAT
AATATAAATG TTTCAATATG GGGAACAACT TTATACCCTG GATCTAGTAT TACTTACAAT
TAA
 
Protein sequence
MIRFKKTKLI ASIAMALCLF SQPVISFSKD ITDKNQSIDS GISSLSYNRN EVLASNGDKI 
ESFVPKEGKK AGNKFIVVER QKRSLTTSPV DISIIDSVND RTYPGALQLA DKAFVENRPT
ILMVKRKPIN INIDLPGLKG ENSIKVDDPT YGKVSGAIDE LVSKWNEKYS STHTLPARTQ
YSESMVYSKS QISSALNVNA KVLENSLGVD FNAVANNEKK VMILAYKQIF YTVSADLPKN
PSDLFDDSVT FNDLKQKGVS NEAPPLMVSN VAYGRTIYVK LETTSSSKDV QAAFKALIKN
TDIKNSQQYK DIYENSSFTA VVLGGDAQEH NKVVTKDFDE IRKVIKDNAT FSTKNPAYPI
SYTSVFLKDN SVAAVHNKTD YIETTSTEYS KGKINLDHSG AYVAQFEVAW DEVSYDKEGN
EVLTHKTWDG NYQDKTAHYS TVIPLEANAR NIRIKARECT GLAWEWWRDV ISEYDVPLTN
NINVSIWGTT LYPGSSITYN