Gene CPF_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1143 
Symbol 
ID4203761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1304725 
End bp1306764 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content29% 
IMG OID638082024 
Producthypothetical protein 
Protein accessionYP_695589 
Protein GI110799978 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.513503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAAATG AAAAGGCAAG TAAATATATT TCTCTAATAA GTATTTCCTT AGCTGTCATA 
TTTGTTTTTG TATCCGTATT TGCTTTAAAG TCATTTAATT CTACAGAAGC ACTTAGTACT
AGTGGACAAT CTTCTACTAT TGATACGGTT TTAAATAAAA ATATTGTAAC TGATATAGAT
ATAAAGATAA AAGAAAGTGA TTGGGAATGG TTAATTGAAA ATGCTACAGA TGAGGAATAT
AGAAGTGCAG ATATAACTAT AAATGGAGAA ACTTTTTATA ATGTTGGGGT TAGACCTAAA
GGAAATTCAA GTTTATCCTC TGTTGCAAAT GATGATACAA CAGATAGATA TAGTTTAAAA
ATAGATTTTG GACAATATGT TGACGGACAG ACCTATCATG GAATAAGAAA ACTAGCTTTA
AATAATAATA TATCTGATGC CACTTATATG AAGGAAGCAA TATCTTATGA CATATATAAT
TTTCTTGGGA TTCCTACTCC AGAGTATTCT TATTCAAACA TAAAGATTAA TGGAGAACAG
TGGGGATTAT ATTTAGCATT AGAAGTAATT GAGGAGAGAT TTGTTGAAAA AAATTATGGT
GAATTAGAAG GTAATTTATA TAAACCTGAA ACTATGGGAG TAGGGGCAAA AAAAGATGAA
GGAAATAAAG ATGCCATGCC TGATATGAAA AATAATCAAG GAAGAGAAGG GGGAATGATG
AATCCACCTA ATATGCCTAA CAATGAAGGA AATAATAAAG AGGGTAATAA ACCTATGAAT
ATTCCTAATG AAAATCAAAA TATGGCTGGC AATATGAATA AAGAAAATGC AGGAATGGGA
CAAATGTCTC CTATGATGGG AGGAAAAAAT AATGAAGGTG CTGATTTTAA ATATATAGAT
GATAATATTA GCAGTTATTC TACTTTAAGA GATAGTGCAG TATTTAAGAG TACAACGGAT
GAAGATTTTG AAAATGTAAT TGAAATGATG AAAAGTTTAG AAAATGGTAG GGATATAGAG
AAATATTTAA ATGTTGATGA AGTTCTTAAG TACTTTGCAG TAAACACTTT CTTAGTTAAC
CTAGATAGTT ATTCAGGTGG AATGTATCAT AATTATTATC TTTACGAAAA TAATGGAGTT
TGTGAGATAC TTCCATGGGA CTTAAATTTA TCCTTTGGAG GATTTGCCAC AAATAGTGGA
AGTAGAGCTG TAAATTTCCC TATTGATTCT CCTGTTACAG GTAATTTAGA AAATTTCCCT
CTTATAGGAA AACTTTTAGA AAATGATGAA TATAAAGAGA AGTATCATGA GTATTTAGAT
AAGATAGTAA ATGAATATTT TAAAAGTGGT ATTTTTAGTA CTACAGTTAC CAATAATGAT
AAGTTAATAG GAGATTACGT AAAGATTGAT CCTACAGCAT TCTATACTTA TGATGAATAC
AAAAATGCCA TTAAGGAGTT ATTAGTTTTT GGAGAGGATA GAACAAAGAG TGTAGAAGCT
CAGTTAAATG GAGAGCAAAC ATCAACGGAA TATGGAAACA TTGAAACTTC ATTAAATTTA
AAAGCCTTAG GTGAACAAAA TATGGGTGGA AAAATGCCTA ATGATAAGAT GAATGAAGAA
AACTCAGTAA ATAATAATGA AGAAAATAAT AATGGACAAG CTATTCCAGA AGGTGGAAGA
CCTTTTAATG GAGGAAACAT GGGAGAAGCT CCTAATAATA TTAATAATCC TAATGGTAAT
ATGGATAATA AAATACCTAA TATGGGAAAC ATGCCTACAC AAGAAAATAT ACAAGAGGCT
ATGAAGATAT TGAATAACAG GGATTATTCA AGCTTAAGCG AAGAGGAAAA GAGACAATTA
AATGATTTAG GAATAAGTGA AGAAAATATA AATATGTTTA ATAATATTCC TAAACAAGGA
GAAAGAGGAG AAGTTAGAGA AACCTTTAAT AAAACATATT ATGTAATATT TGGAGGAGTT
ATCCTAACAC TATTAATTTC CCTAGTTTTT GTAACAAAGT ATAAAAGAAA AAGATACTAA
 
Protein sequence
MINEKASKYI SLISISLAVI FVFVSVFALK SFNSTEALST SGQSSTIDTV LNKNIVTDID 
IKIKESDWEW LIENATDEEY RSADITINGE TFYNVGVRPK GNSSLSSVAN DDTTDRYSLK
IDFGQYVDGQ TYHGIRKLAL NNNISDATYM KEAISYDIYN FLGIPTPEYS YSNIKINGEQ
WGLYLALEVI EERFVEKNYG ELEGNLYKPE TMGVGAKKDE GNKDAMPDMK NNQGREGGMM
NPPNMPNNEG NNKEGNKPMN IPNENQNMAG NMNKENAGMG QMSPMMGGKN NEGADFKYID
DNISSYSTLR DSAVFKSTTD EDFENVIEMM KSLENGRDIE KYLNVDEVLK YFAVNTFLVN
LDSYSGGMYH NYYLYENNGV CEILPWDLNL SFGGFATNSG SRAVNFPIDS PVTGNLENFP
LIGKLLENDE YKEKYHEYLD KIVNEYFKSG IFSTTVTNND KLIGDYVKID PTAFYTYDEY
KNAIKELLVF GEDRTKSVEA QLNGEQTSTE YGNIETSLNL KALGEQNMGG KMPNDKMNEE
NSVNNNEENN NGQAIPEGGR PFNGGNMGEA PNNINNPNGN MDNKIPNMGN MPTQENIQEA
MKILNNRDYS SLSEEEKRQL NDLGISEENI NMFNNIPKQG ERGEVRETFN KTYYVIFGGV
ILTLLISLVF VTKYKRKRY