Gene CPR_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2136 
Symbol 
ID4205317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2364922 
End bp2367084 
Gene Length2163 bp 
Protein Length720 aa 
Translation table11 
GC content30% 
IMG OID642566686 
Producttranscription accessory protein 
Protein accessionYP_699443 
Protein GI110803195 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAATA TAAATCATAT TTTATCAAAG GAACTTGGTA TTTCATTAAA ACAGGTAATA 
AGTGTTATAG AAATGTTAGA TGAAGGAAAT ACAGTACCTT TCATAGCTAG ATATAGAAAA
GAAAGAACTG GTGGATTAAC AGATGAGGTT CTAAGAAAAT TTAATGAGAG ATTAACATAT
CTAAGAAATT TAGAAAGTAG AAAAGAAGAT GTTTTAAGAA TAATAGAGGA ACAAGAAAAA
TTAACTCCAG AATTAAAACT GAATATTGAG AAAGCTACTA CTCTTACTGA GGTTGAAGAT
ATATATAGAC CTTTTAAAGC TAAAAAGAGA ACAAGAGCTA CTATGGCTAT TGAAAAGGGA
TTAAAGCCAC TAGCTGAGCT TATTCTATCA GGAGAATTTA ATGGTGATAT AGTAGAAGAG
GCTAATAAAT ATATTAATGA AGAAAAAGGT GTTAAAAATG AAGAGGAAGC TCTTCAAGGG
GCTATGGATA TTATAAGTGA AATAATATCT GATAATGCTG ATTATAGAAA ATGGATTAGA
AATTTTGTTC AAAAGGATGG AATAATTCAG GTAAAAGGAA GCAGTGAAGA ACAAACACCT
TATGAAATGT ATTATGATTA TAAGGAACCT GTTAGAACAA TTCCATCTCA TAGAATATTA
GCTATAAATA GAGGAGAAAA GGAAAAAATT CTTTCTGTTA AAGTAACTTG TAATGATGAT
AAAATTATAG ATTATTTAAA TAAAAAAGTT TTAAAGGGAA ATAAAATTAC TGATAAGTAT
TTAGAAGAAA GTATAAAGGA CTCCTTTAAA AGACTTATAT ATCCTTCAAT AGAAAGGGAA
ATAAGAAGTG AGTTAACCTC TAAGGGAGAA GAGGGAGCCA TTGATATATT TAAGGCTAAT
TTAAAAGCTC TTTTAATGCA AGCGCCTATT AAGGGGAAAG TTGTAATGGG ATTTGACCCT
GGATTTAGAA CTGGATGTAA GGTTGCAATC TTAGACGAAA CAGGAAAATT TGTTGAGAAT
ACAACAGTTT ATCCAACAGC CCCTCAAAAT AGAATTGATG AAACAATAAG TACACTTAAA
AAACTTATTA AAAAACATGG AGTTCAAGTT ATTTCTTTAG GAAATGGAAC AGCTTCAAGA
GAATCAGAAG AAGTAATTGC AAAAATGCTT AAGGAAATAA AAGATGAAAC AGGAAAAGAG
TTATTCTATG TTATAGTTTC TGAGGCAGGA GCTTCTGTTT ATTCAGCATC AGAACTTGCA
AATAAGGAAT ATCCAGACTT AGATGTAACT GTAAGAGGCG CAATTTCTAT AGGAAGAAGA
CTTCAAGATC CATTAGCTGA GCTTGTTAAA ATAGATCCTA AGGCTATAGG AGTAGGACAA
TATCAACATG ATGTAACTCA GAAAAAACTT GATGAATCCT TAGCAGGGAT AGTTGAGGAT
TGTGTTAATA ATGTAGGAGT AGATTTAAAT ATAGCAACTC CATCACTATT AAGTTATATT
TCAGGTATAA ATGCTTCAAT AGCTAAAAAT ATTGTTGATT ATAGAGAAGA AAATGGTAAG
TTTAAAAGTA GAAAAGAACT TTTAAAAGTT AAAAGATTAG GACAAAAAGC TTATGAACAA
TGTGCAGGAT TCTTAAGAGT TATGGAAAGT AAAGAAGCTT TAGATAATAC CTCAGTTCAT
CCAGAGTCAT ATGGGGTAGC TAAGGAACTT ATAAAAACTT TAGGATATAC AGAAGAAGAT
TTAAAAAATG GCAAATTAGT AGATATAGAT GAGAGAGTAA AAGCAAAAGG AATTTCTAAC
TTAGCAAAAG AATTAGAAGT TGGAGAACCA ACACTTAATG ATATTATAAA GGAAATTAAA
AAGCCTGGAA GAGATCCAAG AGAGGAATTA CCTAAACCAA TATTTAAGTC TGGCGTAATA
GAAATGAAAG ATTTAAAACC AGGTATGATT TTAATGGGGA CAGTTAGAAA TGTATCTGAT
TTTGGTGCTT TTGTAGATAT TGGAGTTCAC CAAGATGGAT TAGTTCATAA GAGCCAAATG
GCAGATAGAT TTGTTAAACA TCCACTTGAT ATAGTTAAGG TTGGAGATAT AGTAGAAGTT
AGAATATTAG ATGTTGATTT AAAAAGAAAG AGAATTTCAT TATCAATGAA AAAAGAAGGT
TAA
 
Protein sequence
MDNINHILSK ELGISLKQVI SVIEMLDEGN TVPFIARYRK ERTGGLTDEV LRKFNERLTY 
LRNLESRKED VLRIIEEQEK LTPELKLNIE KATTLTEVED IYRPFKAKKR TRATMAIEKG
LKPLAELILS GEFNGDIVEE ANKYINEEKG VKNEEEALQG AMDIISEIIS DNADYRKWIR
NFVQKDGIIQ VKGSSEEQTP YEMYYDYKEP VRTIPSHRIL AINRGEKEKI LSVKVTCNDD
KIIDYLNKKV LKGNKITDKY LEESIKDSFK RLIYPSIERE IRSELTSKGE EGAIDIFKAN
LKALLMQAPI KGKVVMGFDP GFRTGCKVAI LDETGKFVEN TTVYPTAPQN RIDETISTLK
KLIKKHGVQV ISLGNGTASR ESEEVIAKML KEIKDETGKE LFYVIVSEAG ASVYSASELA
NKEYPDLDVT VRGAISIGRR LQDPLAELVK IDPKAIGVGQ YQHDVTQKKL DESLAGIVED
CVNNVGVDLN IATPSLLSYI SGINASIAKN IVDYREENGK FKSRKELLKV KRLGQKAYEQ
CAGFLRVMES KEALDNTSVH PESYGVAKEL IKTLGYTEED LKNGKLVDID ERVKAKGISN
LAKELEVGEP TLNDIIKEIK KPGRDPREEL PKPIFKSGVI EMKDLKPGMI LMGTVRNVSD
FGAFVDIGVH QDGLVHKSQM ADRFVKHPLD IVKVGDIVEV RILDVDLKRK RISLSMKKEG