Gene CPR_1432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1432 
Symbol 
ID4206391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1610586 
End bp1612646 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content27% 
IMG OID642565986 
ProductAraC family transcriptional regulator 
Protein accessionYP_698751 
Protein GI110803556 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00438113 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAAAG AATATGTTAA TTTCCCTTCA GACATACCTG TAACCATATC TTATGTAAAC 
ATAAAAAACT ATCCCTTGCA TTGGCATGAT GCAATAGAAA TACTTTATGT CCTTAAAGGC
AGCATAAAAG TAGATATAGA TACTGACAGT TATGAAATAC AAGAAGATGA AATTGAAATA
GTAAACACAG AACAAACTCA TAGAATTTAT TCTAATAAGG ATAACAGAGT TTTAATATTT
AAAATAGATC CACACTTTTT TGAAAAATAC TATAGCGACA TAGAAAATAT GTTTTTCTAT
ACTAATACCT CTGATGAAGG TGCTCAAAGT GATGAATCTT ACGATAAACT TAGAGTATTT
TTATCCATTA TATTATGTGA AGAAGCTCAA AAAGTAGATG ATTATGATAA ATATATAGAA
AAATCTCTAG TAGAGCTTTT ATTCCACCTT TTAAATAATT TCCACTATCT TTTATATGAT
AATGATGAAA TTCATGAGAA TAATATGCTC CTAGAAAGGT ACCATAGAAT TTCTAAGTAT
ATATATAATA ACTACAATAA GAATATAACC TTAAAAGATA TTGCTAATAC TGAGTTTCTT
AGTACGCACT ATCTTTCCCA TGAAATAAAA TATGCAACAG GACTAAGCTT TACAGACCTT
CTAAACTTAA CTAGAGTTGA AGAATCAGTA AAACTTCTTT TAGATACTGA TAAGTCCTTA
TCTGAAATAT CTTATGAAAT AGGCTTTTCT CATACTCGTT ATTTTAATAA ACATTTTAAG
GCCTATTATA ATTGCACTCC TCTTCAATTT AGAAAAAAAC ATAAAATAAG TGAAGAGGAA
TATAATAAAC AAAAAGAAAT AACATACTAT CCATTAACTG ATAGTCTTGA GGAACTTTCT
TATTACTTAG ATGATTACCC AAGATTTAAT TATGAAGATA AGATTCATAA GCTTACTTTT
AATATGAATA CTGAAGGCAC TGAATTTAAT AAGTACTTTA AAGAAGTTCT TAATGTAGGA
GATGCCTTTG ACTTATTATT AGAAGATAAC CAAGATATTG TTGAAGATCT TCAAGATCAT
ATAGGTTATA ACTATATAAG ACTTCTACAT GTATTCTCAT CAGATATGGG TATATTCCCT
GGTTCAAAAT TCTTTAACTG GACTAGAACC TTTGATATTT TTGAATATAT ATCTTCATTA
GACTTAATCC CTTTAATAGT ACTAGATGAT TTTGGATATT CTAAGGATAA TTTCTTAGAT
GTTATAAAAT CCTTCATAGA CTTTTTTAGC GAGGTAGAAA GTTTTGAATT AACAGATTTA
AAATTTCAAT TTACCTCTAC ATTTAATGAA GAACTAAAAA ATTCTTTAAT TGAATTATTT
GAATCTAAGG ATTTAACTTT AGTAAATGAA CTTTATACTC CAAATAATAA AATAGATTTA
ATATATGACA CAGCTTATAT GCTTCCATTT ATAATTCATA ATACTGTAAG CTCAGGAAGC
AAGTTAAACT TTATAAAAGC CTTTGATGCC TTAGATAGAC AAATTGATAT TACAAATGAA
GTTTTCTTTG GCTATCCTGC AATGGTAAAT GATAAAGGAA TTAAAAAGCC CTCTTATTAT
GCTTACTATT TTTTAAGCAA ACTAGGAGAC ACCCTTCTTT ATAAGGGAGA TGGATATATA
CTAACTAAGT CAGAGGATGA ATATCAGCTT CTTGTATATA CCTATAATGA TGAAATTAAT
TCCCTTATAG ATTTTAAAAA CTTTACTAAG TTAAGAGGGG TTAAAGACTT GGTAGATAAA
AAACTTTCCT TAAACTTACT TGATTTAGAT AGCGATGTTA GAATAACTAA ATACACCATA
GGGGAGAACT TTGGCTCTTC ATTTAATTAC TGGCTTTCCA TGGGAAAACC TAAAAGATTA
AGAAAAGCAG AAAAGGATAT ATTATTCCAA GCTTCCTATC CTAAAATAGA ATTTAAGTAC
GCTAAAAAAA ATACTATTTT AAATATACAA ACAACTCTTC AAGGATATTG TGCAGAACTA
TTTATACTAA AAAAAGTTTA A
 
Protein sequence
MRKEYVNFPS DIPVTISYVN IKNYPLHWHD AIEILYVLKG SIKVDIDTDS YEIQEDEIEI 
VNTEQTHRIY SNKDNRVLIF KIDPHFFEKY YSDIENMFFY TNTSDEGAQS DESYDKLRVF
LSIILCEEAQ KVDDYDKYIE KSLVELLFHL LNNFHYLLYD NDEIHENNML LERYHRISKY
IYNNYNKNIT LKDIANTEFL STHYLSHEIK YATGLSFTDL LNLTRVEESV KLLLDTDKSL
SEISYEIGFS HTRYFNKHFK AYYNCTPLQF RKKHKISEEE YNKQKEITYY PLTDSLEELS
YYLDDYPRFN YEDKIHKLTF NMNTEGTEFN KYFKEVLNVG DAFDLLLEDN QDIVEDLQDH
IGYNYIRLLH VFSSDMGIFP GSKFFNWTRT FDIFEYISSL DLIPLIVLDD FGYSKDNFLD
VIKSFIDFFS EVESFELTDL KFQFTSTFNE ELKNSLIELF ESKDLTLVNE LYTPNNKIDL
IYDTAYMLPF IIHNTVSSGS KLNFIKAFDA LDRQIDITNE VFFGYPAMVN DKGIKKPSYY
AYYFLSKLGD TLLYKGDGYI LTKSEDEYQL LVYTYNDEIN SLIDFKNFTK LRGVKDLVDK
KLSLNLLDLD SDVRITKYTI GENFGSSFNY WLSMGKPKRL RKAEKDILFQ ASYPKIEFKY
AKKNTILNIQ TTLQGYCAEL FILKKV