Gene CPR_1831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1831 
Symbol 
ID4206596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2021314 
End bp2023533 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content32% 
IMG OID642566381 
Productstage V sporulation protein D 
Protein accessionYP_699146 
Protein GI110801826 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0768] Cell division protein FtsI/penicillin-binding protein 2 
TIGRFAM ID[TIGR02214] stage V sporulation protein D 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAATA AGAAAAACAA AGGATTTAAA GATTATAAAA TAACAAAGTC ACTTAGATTT 
AAAAGAACAT ATTGGACCAT GATAGTAGTA TGGGGATTAC TAGGAGTATT AACCTTAAGA
TTATCATACA TAATGATTTT TAAGCATAAA GAATATGGTT CAATGGCAGA AGAACAATGG
AAACATGAAA TAAAAATAGG AGCTAAAAGA GGAGAAATAC TAGATAGAAA TGGAGCTCAA
TTAGCTGTTA GTGCTAATGT TTATAGGGTA GATTTAGATC TCAAAACGCT AAGAGAAGAT
ACATTTACTA AAGATGATAC AGATGATACA AAAAAAGAAA AACTTAATAA AATTGCAGGG
GAGTTAGGTA CTGCTTTAGG CATGCCTAAA GAAGAGGTAT ATGATAAGAT AAATAGTACT
TTTCCATCAG GATTGCCAGC GACATCTGTT ACTTTAATCA GAAAGATAGA AAAAGATAAA
GCTGATAGTG TCAAGAATTT AAAGATAAGA GGAGTAATAG TTTCACAAGA TACAAAGAGA
TATTACCCAG ATAATAATTT CTTAGCACAA GTATTAGGAA GAGTTGATGC AGATGGTATA
GGTCAAGGTG GTATAGAGCG TGAATATAAT GTGGAGTTAT CAGGTTTACC AGGAATGAGA
ATATCAGAAG TTGCAAGAAA TAGTAGTGGG ATACCTTATT CAAATTCTGA GTTTGCAAAA
CCTGTTGATG GTAAAGATGT CACACTAACC GTTGATGAAA CTATACAGTA TTTTGCAGAA
AAAGTTGCAG AAGAAGGAAA AAAAGAATAT AAAGCAGATG GGGTTAGTAT AATAGTTATG
AATCCTAAAA ATGGAGAAAT ATTAGCTATG GCTAATAAAC CAGATTATAA TCCTAATGAA
CCTTATAAAG GGTATGAAAA TTTCCCTGGT AAAGATAAAA CTGAAAAGAT GGAAAATATG
TGGAAAAATG ATGCCGTATC AAATTCATTT GAGCCAGGAT CTATATTTAA AGTGGTAACA
TCTTCAGCTG CCGTACAAGA GGGGATAGCA GGTGGAAATG AAACATATTT TTGTCCAGGT
GGAAAAAATG TATCTGGAAC TTATATAAAA TGTTGGAAGC CAGATGGGCA TGGAACTGAA
ACTTTTGATC AAATATTAGA AAACTCTTGT AACGTTGGGT TTATGGACAT AGGTCAAAAA
CTTGGAAAAG AAAAATTAAA TGAATATATA GAAAAGTTTG GATTTGGTAA GCAGACAGGA
ATAGATTTAC CTGGAGAAAC AACAGGTATA GTATTGCCAA ATGATAAAAT AGGTCCAGTA
GAACTTGCAA CCATATCATT TGGTCAAACA GATAGTGCTA GTTCAGTTCA AATGATGGCA
GCTATGAACA CAATTGCTAA TGGAGGAACA TGGATACAAC CTCATATAAT GAAGGAAATA
AGCCATGAAG ATCCAAGTGG AGCAAGAGTA GTTGATAAAA CTTTTGTACC TAAAAAAATT
AATAATATAA TAGATAAAAA AACTGCTATG AGAGTTTCAG AAGCTTTAGA AAAAACTGTT
CATTTTGGAT CTCCAAAGAG AGCTTATATA GAAGGATATG GGATTGCTGG TAAAACTGGT
ACAGCTGAAA AGGTTAAAGC TAGTGGAGGA TATGGTGCAG GGTATGTAGC GTCATTTGCT
GGATTTGCTC CATATAATGA TCCACAAGTT TCAATTTTAA TATCTGTAGA TAATCCAAAG
GGGGAGTACT TTGGAGGATT AGTAGCAGCA CCATTAGCTC ATGATTTATT TAGCGATATA
TTTAATTATA TTGAATTAGA TAGTTCAGAT TTAGATAAAA ATAAAGTAAA AGAAGAGATT
GTACCAGATG TTAGAGGTAT GAATTTAGAT AAAGCTAAAT CTATTTTAGA TAAAGACAAT
ATTAAATATT CTGTAGAAGA TGCTGGAAAT TCTGTAGTTG ATATGAATCC TAAGCCAGGA
TATACAATTA AAGAGGGAGA TGAAATTAAA CTTTACACTA AAACTACTTC AAATTATAAT
AAAGATGTTG TAGTACCAGA TTTTAATGGA CTTTCAATGG AAAAAGCAAA GGAAATTTTA
AACAAAATTG GCTTAAAGGG TACTTTTGCA GGAGAGGGAG TTATTAAGGA ACAAAGTGTA
GCTCAAGGTG ATGTAGTTAA AAGTGGTACA TCAATAGAAT TTAAATTAGA TAAAAAATAA
 
Protein sequence
MKNKKNKGFK DYKITKSLRF KRTYWTMIVV WGLLGVLTLR LSYIMIFKHK EYGSMAEEQW 
KHEIKIGAKR GEILDRNGAQ LAVSANVYRV DLDLKTLRED TFTKDDTDDT KKEKLNKIAG
ELGTALGMPK EEVYDKINST FPSGLPATSV TLIRKIEKDK ADSVKNLKIR GVIVSQDTKR
YYPDNNFLAQ VLGRVDADGI GQGGIEREYN VELSGLPGMR ISEVARNSSG IPYSNSEFAK
PVDGKDVTLT VDETIQYFAE KVAEEGKKEY KADGVSIIVM NPKNGEILAM ANKPDYNPNE
PYKGYENFPG KDKTEKMENM WKNDAVSNSF EPGSIFKVVT SSAAVQEGIA GGNETYFCPG
GKNVSGTYIK CWKPDGHGTE TFDQILENSC NVGFMDIGQK LGKEKLNEYI EKFGFGKQTG
IDLPGETTGI VLPNDKIGPV ELATISFGQT DSASSVQMMA AMNTIANGGT WIQPHIMKEI
SHEDPSGARV VDKTFVPKKI NNIIDKKTAM RVSEALEKTV HFGSPKRAYI EGYGIAGKTG
TAEKVKASGG YGAGYVASFA GFAPYNDPQV SILISVDNPK GEYFGGLVAA PLAHDLFSDI
FNYIELDSSD LDKNKVKEEI VPDVRGMNLD KAKSILDKDN IKYSVEDAGN SVVDMNPKPG
YTIKEGDEIK LYTKTTSNYN KDVVVPDFNG LSMEKAKEIL NKIGLKGTFA GEGVIKEQSV
AQGDVVKSGT SIEFKLDKK