Gene CPF_1867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1867 
Symbol 
ID4201240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2100479 
End bp2101804 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content29% 
IMG OID638082737 
Productpeptidase, M23/M37 family protein 
Protein accessionYP_696301 
Protein GI110798595 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0957526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGA GAAAATTAAC TGCAATTATA CTTAGTATAT CAATGATTTT TGCGGTTTCA 
ATAAATAGCA CTAATATTGT ACAAGCTAAG ACTACAGAAG AAGCTCAACA AGAAATTGAT
AATAATAAAA ATAAAATAGA TGATCTTAAA GATAAACAAA GTGATATAAA TTCTGAGAAG
AGTAAATCTC AAAGTAAATT AGATGAAATA CAAAAACAAG TAGCTGATAA AAATCAAAAG
TTACTTACTT CTCAAAAGAA GGTTGATGAA TATAAAGGTA ATATTGACTC CTTAAAGGAT
AGTATAGATA AACTTCAAGG ACAAATTAAT GATATTCAAA GCAATATAGA TAAGAAGAAA
AAGGAAGAAG AAGAAAAAGA AGAAATACTT TCAGGTAGAA TAAGAAGTGC TTATAAATCT
AATTTAAGCA ATCAGTTTTT ATACATAATG CTTGAATCAA AAAATGTAGG GGACTTTATA
AGTAATGTAT CAAGCATAAA ATATGTAGTG GATACAGATA ATAAGTTAAT TGATGATATA
AAGAAGGTTC AAAGTGAATT AAAAAGTGAA GAATCTCAAT TAAAAAGTCA AGAAGAAGAC
TTATCAAGTA AAAAAACTAA GTTAGAAAAT GAGAAGAAAG AATATGATAC CTTAGTAAGT
CAGTATCAAT CTCAATTAAA TGAATTAAAT TCTTTAGAAG AAGAAAAACA AGCTGAAATA
AATAGCTTAA GTGAAAAAGA AAGAACAGTA TTAGATGAAA TTAATAGTTA CGAAGAAGAT
AATGCTAATC TTAAAGATTA TATAAATAAT TTAATCAATG AGAAAAAAAG TGTTAAGGTA
AATAGTGATA ATAATAGTAA AAGTAGTACA AACAATAAAA GTACAGAGGA ATCAAGCGCA
TCTAATAATG AGGGAAATTC AGAGACAAAG GCTAATTCCT CAAGTGGATT TATGAGACCA
GCTCCAGGTG GAGTTACAGA TCCCTTTGGA CCTAGAGTGC ATCCTGTTAC AGGAAAAAGA
AGTGTTCACA CAGGGGCAGA TTTAGGAGCA TCTTATGGAA CACCTATTCT TGCATCAAAG
TCAGGTACTG TTGTTGAAGC AGGATGGAAT ACTGCTTATG GTAATATGGT TATAATAGAT
CATGGAGATG GAACAAGTAC TTTATATGGA CATTCATCTA GACTTGCTGT ACAAGCTGGT
CAACATGTAT CACAAGGACA AGTAATTGCT TATGTAGGAT CAACAGGATA TAGTACAGGA
CCTCACCTTC ATTTTGGTAT AATGATAAAT GGTGAATGGG TAAATCCTAT GAATTATATA
AGTTAA
 
Protein sequence
MNKRKLTAII LSISMIFAVS INSTNIVQAK TTEEAQQEID NNKNKIDDLK DKQSDINSEK 
SKSQSKLDEI QKQVADKNQK LLTSQKKVDE YKGNIDSLKD SIDKLQGQIN DIQSNIDKKK
KEEEEKEEIL SGRIRSAYKS NLSNQFLYIM LESKNVGDFI SNVSSIKYVV DTDNKLIDDI
KKVQSELKSE ESQLKSQEED LSSKKTKLEN EKKEYDTLVS QYQSQLNELN SLEEEKQAEI
NSLSEKERTV LDEINSYEED NANLKDYINN LINEKKSVKV NSDNNSKSST NNKSTEESSA
SNNEGNSETK ANSSSGFMRP APGGVTDPFG PRVHPVTGKR SVHTGADLGA SYGTPILASK
SGTVVEAGWN TAYGNMVIID HGDGTSTLYG HSSRLAVQAG QHVSQGQVIA YVGSTGYSTG
PHLHFGIMIN GEWVNPMNYI S