Gene CPR_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0195 
Symbol 
ID4206435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp242461 
End bp244836 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content23% 
IMG OID642564752 
Productsensory box histidine kinase 
Protein accessionYP_697530 
Protein GI110802656 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00184929 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAATT CAATATTTTT TCATATTTTT AATGATGATA AAAATAAATT AGAAACTAAA 
AGAATAATAA AACTATCTAT TTTTATAGTT TTCTCTATAT TTTTTATATT AACACTAGAT
CTATCTTATA AAGTTCTTAT TAGAAAAAAT ATAGAGTTTA TTCCTAATAA TTCTATTCCA
AATTTCCCAT TAAGTCTATC ATTAATATTA GGAACAATGG CATACATAAG TTCATTAATA
TATTATTCAA GCACTAAAAA AGATGATTTT TTTATAATCT CTTTAATATA TATGAATTTA
TCTGTAGAAC TTTTAATTAC TAAAGGACAT AATCTAATAA TATTCGATAA GTTTATTTTT
ATACACGCAA TATTTAGGAT AATTTTGCTT TTTTATGTTG CCTTTAATAA GAAAGGAATA
TCACCTCTTA TTACTAAACA TAAAACAATT TCATCAGTAG CAGTATTTTT ATTTTCAGTT
ATAACACCTA TGATTAACTA TAAAATTTTT TCTAATAAGT TATTTACTAA AGATATTTAT
TTTTATGCTA CTTTAATGAC TATTATTATT ATCCTCTATA TAATTGCTTG CATATTTTTA
TCAAAGAAAT CTTTAGATGA TTGCGAGTTA ATATATTCAT TTATAATTGC TAGTATTCTT
TTAATAGCAT TAAGAGGATT ATACTGGATT TGTGAAGTAC TTCTTCCAAA TATAACACTT
TTAAAAACCA ATAATGTTGT TCTTCTACTT ACTATTCTGT CATTTTTATT GGCTATAAGT
GGAGTTTTTA ATGAAATTAC GACTAAAAAC AAAAAAAGTT CTTTACTACA AAATGAACTC
CAAGTTTTCT ATCATTTAGT TGAATTTAAT ACCAGTAGTT CTATAATTCT ATATGATAAT
AAAAAGAAGG TTATATATAC AAATAAAACA ATAAGAGAAC GCTATTGCAA ATCAACTGAA
TTAAAAGATC AACTTAAAGA GGTAGAAAAA TTATTTGTAG ATTCGATTTT TATAGATGAC
TCTGAAAAAA ATGCTACTAA ATCACTTTTT AATAAGGGCA ATTGGGAAGG TAAGATTATT
TTAAAAAACG ACAAAATAGT AAGTGCCTAC ATACAGATAT TAAATGTTGA AAATAAAAAT
TATTTTGCTG TGAATTTAAA AGATATAACT GAAGAATATA CCCTAACAAA AAATATTAAA
AGAAATGAAC AATTATTAAG TTGTATAAAT AATAACGTAC AGGATTTAAT AATAAGTGTT
GATAATAATG GTTTAATTAC ATATGTTAAT GATTCTGTAT TAGAAACATT AAATTATACC
TATGAAGAGA TCATAGGTAT GCCTATAATA AACCTTTTAG GTAAAAATGA TGAGATATTA
AATCAATTAA AACTAGAAGA TGAGGAAGAT AGTATTAAAT GTAAACTTGT TGGTAAACAC
TCTTTTGTTT ATGTAGAATC TATAATTAGA ACTTTAAGCG ATAATAATGA AATTCCTTAT
GGAAAAGTTA TAGTTGCAAA AAACTTAACA TCTAAAAAAC GTCTTGAAAA TTTAGCTATC
AAATTTAAGG AAGCTAAGGC TTATGAACAA ATAAGAAATG AATTTTTCGC CAATATATCA
CATGAACTTA GAACACCACT TAATATTATC TACTCTACAA TACAATTATT AAATTCTAAG
CATGAAACTA ACTATGTGAA TTTTAATGAT TTCTATGGCA AATATAAACA AGGTCTAAAA
ATAAATTGTT ATAGAATGCT TAGACTTATA AATAACCTTA TTGATGTTAG TAAAATCGAA
GTTGGATTTT TAAAAGCTGA TTTTACTAAT AGAGATATAG TATTTCTTGT AGAAAATATA
GTATCTTTGG TTATTCCTCA TTCTGAAAAT AAGGATATTA ATATAATCTT TGATACTAAT
GTTGAAGAAA ACATAATAAA ATGTGATCCT GTAAAAATTG AAAGATTAAT TCTTAACTTA
CTTTCAAATG CAATAAAATT CACCCAAAAT CATGGTAAAA TATTTGTGGA TTTAAACATC
TCAAAGGATT GGGTTAAAAT AAGCATAAAA GATAATGGAA TCGGTATTCC TAAAGAAATG
CAGGCATCAA TTTTTGATAG ATTTGTACAA GCTGATAAAT CTTTAAAAAG AAGAAATGAA
GGTAGTGGAA TAGGTCTTAG CATTGTAAAG TCTATTGCTG AACTACATGA TGGTAAAATT
GAACTTATAA GTGATGGAAT AAAAGGTTCA GAATTTATAG TATGGCTACC AAATGTAAAA
TTAAATTACA CAGAAGAAAG CAATAATTTA GTTGATTATA TAACAGATGA TAAAAATATA
GAGTTAGAGC TTTCTGATAT TTATGAAGTA CATTAA
 
Protein sequence
MDNSIFFHIF NDDKNKLETK RIIKLSIFIV FSIFFILTLD LSYKVLIRKN IEFIPNNSIP 
NFPLSLSLIL GTMAYISSLI YYSSTKKDDF FIISLIYMNL SVELLITKGH NLIIFDKFIF
IHAIFRIILL FYVAFNKKGI SPLITKHKTI SSVAVFLFSV ITPMINYKIF SNKLFTKDIY
FYATLMTIII ILYIIACIFL SKKSLDDCEL IYSFIIASIL LIALRGLYWI CEVLLPNITL
LKTNNVVLLL TILSFLLAIS GVFNEITTKN KKSSLLQNEL QVFYHLVEFN TSSSIILYDN
KKKVIYTNKT IRERYCKSTE LKDQLKEVEK LFVDSIFIDD SEKNATKSLF NKGNWEGKII
LKNDKIVSAY IQILNVENKN YFAVNLKDIT EEYTLTKNIK RNEQLLSCIN NNVQDLIISV
DNNGLITYVN DSVLETLNYT YEEIIGMPII NLLGKNDEIL NQLKLEDEED SIKCKLVGKH
SFVYVESIIR TLSDNNEIPY GKVIVAKNLT SKKRLENLAI KFKEAKAYEQ IRNEFFANIS
HELRTPLNII YSTIQLLNSK HETNYVNFND FYGKYKQGLK INCYRMLRLI NNLIDVSKIE
VGFLKADFTN RDIVFLVENI VSLVIPHSEN KDINIIFDTN VEENIIKCDP VKIERLILNL
LSNAIKFTQN HGKIFVDLNI SKDWVKISIK DNGIGIPKEM QASIFDRFVQ ADKSLKRRNE
GSGIGLSIVK SIAELHDGKI ELISDGIKGS EFIVWLPNVK LNYTEESNNL VDYITDDKNI
ELELSDIYEV H