Gene CPF_2117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2117 
SymbolspoVD 
ID4202985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2347513 
End bp2349732 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content33% 
IMG OID638082982 
Productstage V sporulation protein D 
Protein accessionYP_696546 
Protein GI261876156 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0768] Cell division protein FtsI/penicillin-binding protein 2 
TIGRFAM ID[TIGR02214] stage V sporulation protein D 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGA AGAAAAACAA AGGATTTAAA GATTATAAAA TAACAAAATC GCTTAGATTT 
GGAAGAACAT ATTGGACCAT GATAGTAGTA TGGGGATTAC TAGGAGTATT AACCTTAAGA
TTATCATACG TAATGATTTT TAAGCATAAA GAATATGGTT CAATGGCAGA AGAACAATGG
AAAAATGAAA TAAAGATAGG TGCTAAAAGA GGAGAAATAC TAGATAGAAA TGGAGCTCAA
TTAGCTGTTA GCGCTAATGT TTATAGAGTG GATTTAGATC TTAAAACTTT AAGAGAAGAT
ACATTTACTA AAGATGATAC AGATGATACA AAAAAAGAAA AGCTTAATAA AATTGCAGGG
GAGCTAGGTA CTGTTTTAGA TATGCCTAAA GAAGAGGTAT ATGATAAGAT AAATAGCACT
TTACCATCAG GATTGCCAGC TACTTCTGTT ACTTTAATTA GAAAGATAGA AAAAGATAAA
GCTGATAGTG CTAAGAATTT AAAGATAAGA GGAGTAATAG TTTCACAGGA TACAAAGAGA
TATTACCCAG ATAATAATTT CTTAGCTCAA GTATTAGGAA GAGTTGATGC AGATGGTATA
GGTCAAGGTG GTATAGAGCG TGAATATAAT GTGGAGTTAT CAGGTTTACC AGGAATGAGA
ATATCAGAAG TTGCAAGAAA TAGTAGTGGA ATACCTTATT CAAATTCTGA ATTTGCAAAG
CCTGTGGATG GTAAAGATGT CACACTAACC GTTGATGAAA CTATACAATA TTTTGCAGAA
AAAGTTGCAG AAGAAGGAAA AAAAGAATAT AAAGCAGATG GGGTTAGTAT AATAGTTATG
AATCCTAAAA ATGGAGAAAT ATTAGCTATG GCTAATAAAC CAGATTATAA TCCTAATGAA
CCTTATAAAG GGTATGAAAA TTTCCCTGGT AAAGATAAAA CTGAAAAGAT GGAAAATATG
TGGAAAAATG ATGCCGTATC AAATTCATTT GAGCCAGGAT CTATATTTAA AATGGTAACA
TCTTCAGCTG CCGTACAAGA AGGGATAGCA GGTGGAAATG AAACATATTT TTGTCCAGGT
GGAAAAAATG TATCTGGAAC TTATATAAAA TGTTGGAAGC CAGATGGGCA TGGAACTGAA
ACTTTTGACC AGATATTAGA AAACTCTTGT AACGTTGGGT TTATGGACAT AGGTCAAAAA
CTTGGAAAAG AAAAATTAAA TGAATATATA GAAAAGTTTG GATTTGGTAA GCAGACAGGA
ATAGATTTAC CTGGAGAAAC AACAGGTATA GTATTGCCAA ATGATAAAAT AGGTCCAGTA
GAACTTGCAA CCATATCATT TGGTCAAACA GATAGTGCTA GTTCAGTTCA AATGATGGCA
GCTATGAACA CAATTGCTAA TGGAGGAACA TGGATACAAC CTCATATAAT GAAGGAAATA
AGCCATGAAG ATACAAGTGG AGCAAGAGTA GTGGATAAAA CTTTTGTACC TAAAAAGATT
GATAATATAA TAGATCAAAA AACAGCTATG AGAGTTTCAG AGGCTCTAGA AAAAACTGTT
CATTTTGGAT CTCCTAAGAG AGCTTATATA GAGGGATATG GAATTGCAGG TAAAACTGGT
ACAGCTGAAA AGGTTAAGGC TAGTGGAGGA TATGGTGCAG GGTATGTAGC GTCCTTTGCT
GGATTTGCTC CATATAATGA TCCACAAGTT TCAGTTTTAA TATCTGTAGA TAATCCAAAG
GGGGAATACT TTGGAGGATT AGTAGCGGCA CCATTAGCTC ATGATTTATT TAGTGATATA
TTTAACTATA TGGAATTAGA TAGTTCAAAG ATAGACAAAA ATAAATCAAA AGAAGAGATT
CTGCCAGAAG TTAGAGGTAT GAGTTTAGAT AAGGCTAAAG CTATTTTAGA TAAAGATAAT
ATAAAATATT CTGTAGAAGA TGGTGGGAAT TCTGTAGTTG ATATGAATCC TAAGCCGGGA
TATACAATCA AAGAGGGAGA TGAAATTAAA CTTTACACTA AAACTACTTC AAATTATAAT
AAAGATGTTG TAGTACCAGA TTTTAATGGA CTTTCCATGG AAAAAGCAAA GGAAATTTTA
AACAAAATTG GCTTAAAGGG TACTTTTGCA GGAGAGGGAG TTATTAAGGA ACAAAGTGTA
GCTCAAGGTG ATGTAGTTAA AAGTGGTACA TCAATAGAAT TTAAATTAGA TAAGAAATAA
 
Protein sequence
MKKKKNKGFK DYKITKSLRF GRTYWTMIVV WGLLGVLTLR LSYVMIFKHK EYGSMAEEQW 
KNEIKIGAKR GEILDRNGAQ LAVSANVYRV DLDLKTLRED TFTKDDTDDT KKEKLNKIAG
ELGTVLDMPK EEVYDKINST LPSGLPATSV TLIRKIEKDK ADSAKNLKIR GVIVSQDTKR
YYPDNNFLAQ VLGRVDADGI GQGGIEREYN VELSGLPGMR ISEVARNSSG IPYSNSEFAK
PVDGKDVTLT VDETIQYFAE KVAEEGKKEY KADGVSIIVM NPKNGEILAM ANKPDYNPNE
PYKGYENFPG KDKTEKMENM WKNDAVSNSF EPGSIFKMVT SSAAVQEGIA GGNETYFCPG
GKNVSGTYIK CWKPDGHGTE TFDQILENSC NVGFMDIGQK LGKEKLNEYI EKFGFGKQTG
IDLPGETTGI VLPNDKIGPV ELATISFGQT DSASSVQMMA AMNTIANGGT WIQPHIMKEI
SHEDTSGARV VDKTFVPKKI DNIIDQKTAM RVSEALEKTV HFGSPKRAYI EGYGIAGKTG
TAEKVKASGG YGAGYVASFA GFAPYNDPQV SVLISVDNPK GEYFGGLVAA PLAHDLFSDI
FNYMELDSSK IDKNKSKEEI LPEVRGMSLD KAKAILDKDN IKYSVEDGGN SVVDMNPKPG
YTIKEGDEIK LYTKTTSNYN KDVVVPDFNG LSMEKAKEIL NKIGLKGTFA GEGVIKEQSV
AQGDVVKSGT SIEFKLDKK