Gene CPF_2839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2839 
Symbol 
ID4200953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3102687 
End bp3104309 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content29% 
IMG OID638083706 
Producthypothetical protein 
Protein accessionYP_697203 
Protein GI110798752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCACATA ATAAATCATA TAGAACTTTT ATAATATTAC AAGAAGATGA AAAGGGGCAT 
TCTATGGCTT CAGATAAGCC TTTAACTGGA TATGCCAAGA TAGAAACTAA GAATGATAAA
TGTAAAGTTT CTTTTTACGC TCAAAACTTA AAGAAAGAAT ACAAAGATTG TTATATGATG
CTTATTTGTA ACAAGAAAGA TTTTAAGAAG AATATAAATC TTGGTCCAAT GAACATAAAT
CAACAAGGAA AGGCAGAAAT GAGTTTAGAA TATGATTCTA TCAACATAGG AGATTTAAAT
GTATCTTATG AGAATATAGT AGGAGCTGCT ATAGGAAAAA ATATTAATGG TAGAACAGTA
TTCTTTATGT GTGGATTTTT AAATAACCAA ATGCCAAAGG ATAATTGGAA AAATTATGAG
ATTAAGATGA TATCAGGCAA AGAAAAGTCT CATTACGATA AAAATAATAA AGAAGACGAA
ATGAAAAAAG AATGTCCTAT TAACATGAAA GAGCATAAGG AAAAATATAA TGATGATAAA
GATTATTATA AGGACAAGAA AGACAAAGAT GAAAAAGATA AGTATAAAGA GTATCTAAAA
GAAAAGGACA AGGATTATTT AAAAGAAGAG AATAAAGAAA AGGAATGTAA AAAAGAATAT
TGTAAGGATA CAGAAGATAA AGAGGATAAA GATGATTGCA AAGAGCATTT AAAAGAAGAA
TATAAAGAGA AAAAAGATGA CTGTAGAGGA AAAGACAAAG AAGAGTGCAA GCACCATGAT
AAAGAAGAGA AGCATGAGGA AAAATGTGAA GAGGATAAAA AATATAAAGA TGATAAAGAG
GACAAATATA AGAAAGAAGA TAAATATAAG GATAAATATG AAGTAGATGA TGATTGTAAA
GATAAACATG AAGAGCATAA AGAATATAAG GATAAAGATA CGGATAAAGA TAATCATGAA
GAAAAGAAAA TAGAAGGTTA TAAAGACTGC TATAAGGAAA AATATCACCG CAATGATAAT
TGGGATTATA GATCAAAATT ACAAGAGTGT GATAGATTTA TAAGTAAAAT AGATTTAGAA
AGAGAATATG ATCCTTATGA TGGAGAACGT TATGAGCTAG GTAGAAGATT TGCAGAGTAC
GAAAATGAAA TAGAGCAAAT GAAACTTAGA GATTGTAAGG AGAAAGAAGA AAAAACTTAT
GAAGTTGACT TTGATTGTCC TATAGGTGAA GTTTTAATGG GAGCTTTAGA AGGATGTAAG
AAGGTTCCTA AATTTGCAGA GGATATAAAA AGATGTGCAT GGTATAAGGT TGATGTTAGA
AACTTCGATG ATATGTGTAA TATGTCAAAC TATAATAAAT ACACAATGAT GTATTATCCT
ATGATTAATT ACTATCCTTA TATAAGCAAA GAAGGTCATT TCTTCTTTGG TGTAAAGTGT
GATAAGGATG GAGATATAAA ATATATTTTA TATGCTATTC CAGGAACTAA GGATAGAAAA
GATCAACCAT ACGGTGGTAG AACTGGTTTC GTTACATGGG ACAGATATGG AGATAGAGAA
AATGGCTATT GGATAATGTT CTATGATTTT GAAAACTCAT CTGTAGTTAT CCCTATGAAA
TAA
 
Protein sequence
MAHNKSYRTF IILQEDEKGH SMASDKPLTG YAKIETKNDK CKVSFYAQNL KKEYKDCYMM 
LICNKKDFKK NINLGPMNIN QQGKAEMSLE YDSINIGDLN VSYENIVGAA IGKNINGRTV
FFMCGFLNNQ MPKDNWKNYE IKMISGKEKS HYDKNNKEDE MKKECPINMK EHKEKYNDDK
DYYKDKKDKD EKDKYKEYLK EKDKDYLKEE NKEKECKKEY CKDTEDKEDK DDCKEHLKEE
YKEKKDDCRG KDKEECKHHD KEEKHEEKCE EDKKYKDDKE DKYKKEDKYK DKYEVDDDCK
DKHEEHKEYK DKDTDKDNHE EKKIEGYKDC YKEKYHRNDN WDYRSKLQEC DRFISKIDLE
REYDPYDGER YELGRRFAEY ENEIEQMKLR DCKEKEEKTY EVDFDCPIGE VLMGALEGCK
KVPKFAEDIK RCAWYKVDVR NFDDMCNMSN YNKYTMMYYP MINYYPYISK EGHFFFGVKC
DKDGDIKYIL YAIPGTKDRK DQPYGGRTGF VTWDRYGDRE NGYWIMFYDF ENSSVVIPMK