Gene CPF_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1947 
Symbol 
ID4201347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2187208 
End bp2188215 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content32% 
IMG OID638082816 
Productputative membrane-associated zinc metalloprotease 
Protein accessionYP_696380 
Protein GI110801002 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATATAA TTTTTGCTTT ATTAGCATTT AGTGCTTTAA TCTTAGTTCA TGAGCTTGGA 
CATTTTATTG TGGCTAAATT AAATGGGATT TATGTTGAGG AATTTGCTAT TGGTATGGGT
CCAAAGCTTT TTGGTGTAAA AGTTGGAGAA ACAGAATATA ACTTAAGAAT CCTTCCTTTT
GGTGGATTTG TTAAAATGCT TGGTGAAGAA GATGAAAGTG ATGATTCAGG AAGCTTAAAT
GCAAAAACTC CTATTCAAAG AATACTTGTT ATGGGAGCAG GGGCATTTAT GAATTATGTA
TTAGCCCTTA TAATATTTAT TGGGCTAGCT ATGAGTTCTG GCTTTGCAGA AAATAAAGTA
GCAAGTGTAG TACCTAATTC ACCAGCACAA GAAATAGGAA TTGAACAGGG AGATGAATTC
CTAAAAATAG ATGGAAACAA AATTCATACA ACTGATGATT TTAGAATGGG ATTAGCTTTA
GCTAAGGGAA ATCCAGTTGA ATTAGAGATA AAAAGAGGCA ATGATGTCTT AACAAAAACA
GTACAGCCGA TTTTAAATGA GAGTGGAATG TATCAAGTTG GAATAAGCTA TGCTTTAGTT
GAAAAACCTA CTTTGCTTCA AGGAATAAAG CAAGGATTTA ATGAAACAAG AAGTCTTGTA
TCTCAGTCAT TTATTGCATT AAAGACTATT GTTACAGGAG AAGCAAATTT AAAAACTGAT
GTAGGAGGTC CTGTTACAAT AATTAAAATG TCAGGGCAAG CAGCAAAAGC AGGAGCAAAT
ACTCTTTTAT GGTTTATGGC ATTTTTAAGT GTTCAATTAG CAGTATTTAA CCTTTTACCA
TTCCCAGCTT TAGATGGTGG AAGAATATTT ATAGAGCTTA TTCAAATGAT AATTAGAAAA
GAAATACCTG CTAAATATAT TGAAGCTGTA AATACCGTTG GATTCATGCT CCTTATGGGA
CTTATGGTAT TAGTAACTAT AAAAGATATA ATATTCCCGA TACTATAG
 
Protein sequence
MYIIFALLAF SALILVHELG HFIVAKLNGI YVEEFAIGMG PKLFGVKVGE TEYNLRILPF 
GGFVKMLGEE DESDDSGSLN AKTPIQRILV MGAGAFMNYV LALIIFIGLA MSSGFAENKV
ASVVPNSPAQ EIGIEQGDEF LKIDGNKIHT TDDFRMGLAL AKGNPVELEI KRGNDVLTKT
VQPILNESGM YQVGISYALV EKPTLLQGIK QGFNETRSLV SQSFIALKTI VTGEANLKTD
VGGPVTIIKM SGQAAKAGAN TLLWFMAFLS VQLAVFNLLP FPALDGGRIF IELIQMIIRK
EIPAKYIEAV NTVGFMLLMG LMVLVTIKDI IFPIL