Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0195 |
Symbol | |
ID | 4206435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 242461 |
End bp | 244836 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 23% |
IMG OID | 642564752 |
Product | sensory box histidine kinase |
Protein accession | YP_697530 |
Protein GI | 110802656 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00184929 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAATT CAATATTTTT TCATATTTTT AATGATGATA AAAATAAATT AGAAACTAAA AGAATAATAA AACTATCTAT TTTTATAGTT TTCTCTATAT TTTTTATATT AACACTAGAT CTATCTTATA AAGTTCTTAT TAGAAAAAAT ATAGAGTTTA TTCCTAATAA TTCTATTCCA AATTTCCCAT TAAGTCTATC ATTAATATTA GGAACAATGG CATACATAAG TTCATTAATA TATTATTCAA GCACTAAAAA AGATGATTTT TTTATAATCT CTTTAATATA TATGAATTTA TCTGTAGAAC TTTTAATTAC TAAAGGACAT AATCTAATAA TATTCGATAA GTTTATTTTT ATACACGCAA TATTTAGGAT AATTTTGCTT TTTTATGTTG CCTTTAATAA GAAAGGAATA TCACCTCTTA TTACTAAACA TAAAACAATT TCATCAGTAG CAGTATTTTT ATTTTCAGTT ATAACACCTA TGATTAACTA TAAAATTTTT TCTAATAAGT TATTTACTAA AGATATTTAT TTTTATGCTA CTTTAATGAC TATTATTATT ATCCTCTATA TAATTGCTTG CATATTTTTA TCAAAGAAAT CTTTAGATGA TTGCGAGTTA ATATATTCAT TTATAATTGC TAGTATTCTT TTAATAGCAT TAAGAGGATT ATACTGGATT TGTGAAGTAC TTCTTCCAAA TATAACACTT TTAAAAACCA ATAATGTTGT TCTTCTACTT ACTATTCTGT CATTTTTATT GGCTATAAGT GGAGTTTTTA ATGAAATTAC GACTAAAAAC AAAAAAAGTT CTTTACTACA AAATGAACTC CAAGTTTTCT ATCATTTAGT TGAATTTAAT ACCAGTAGTT CTATAATTCT ATATGATAAT AAAAAGAAGG TTATATATAC AAATAAAACA ATAAGAGAAC GCTATTGCAA ATCAACTGAA TTAAAAGATC AACTTAAAGA GGTAGAAAAA TTATTTGTAG ATTCGATTTT TATAGATGAC TCTGAAAAAA ATGCTACTAA ATCACTTTTT AATAAGGGCA ATTGGGAAGG TAAGATTATT TTAAAAAACG ACAAAATAGT AAGTGCCTAC ATACAGATAT TAAATGTTGA AAATAAAAAT TATTTTGCTG TGAATTTAAA AGATATAACT GAAGAATATA CCCTAACAAA AAATATTAAA AGAAATGAAC AATTATTAAG TTGTATAAAT AATAACGTAC AGGATTTAAT AATAAGTGTT GATAATAATG GTTTAATTAC ATATGTTAAT GATTCTGTAT TAGAAACATT AAATTATACC TATGAAGAGA TCATAGGTAT GCCTATAATA AACCTTTTAG GTAAAAATGA TGAGATATTA AATCAATTAA AACTAGAAGA TGAGGAAGAT AGTATTAAAT GTAAACTTGT TGGTAAACAC TCTTTTGTTT ATGTAGAATC TATAATTAGA ACTTTAAGCG ATAATAATGA AATTCCTTAT GGAAAAGTTA TAGTTGCAAA AAACTTAACA TCTAAAAAAC GTCTTGAAAA TTTAGCTATC AAATTTAAGG AAGCTAAGGC TTATGAACAA ATAAGAAATG AATTTTTCGC CAATATATCA CATGAACTTA GAACACCACT TAATATTATC TACTCTACAA TACAATTATT AAATTCTAAG CATGAAACTA ACTATGTGAA TTTTAATGAT TTCTATGGCA AATATAAACA AGGTCTAAAA ATAAATTGTT ATAGAATGCT TAGACTTATA AATAACCTTA TTGATGTTAG TAAAATCGAA GTTGGATTTT TAAAAGCTGA TTTTACTAAT AGAGATATAG TATTTCTTGT AGAAAATATA GTATCTTTGG TTATTCCTCA TTCTGAAAAT AAGGATATTA ATATAATCTT TGATACTAAT GTTGAAGAAA ACATAATAAA ATGTGATCCT GTAAAAATTG AAAGATTAAT TCTTAACTTA CTTTCAAATG CAATAAAATT CACCCAAAAT CATGGTAAAA TATTTGTGGA TTTAAACATC TCAAAGGATT GGGTTAAAAT AAGCATAAAA GATAATGGAA TCGGTATTCC TAAAGAAATG CAGGCATCAA TTTTTGATAG ATTTGTACAA GCTGATAAAT CTTTAAAAAG AAGAAATGAA GGTAGTGGAA TAGGTCTTAG CATTGTAAAG TCTATTGCTG AACTACATGA TGGTAAAATT GAACTTATAA GTGATGGAAT AAAAGGTTCA GAATTTATAG TATGGCTACC AAATGTAAAA TTAAATTACA CAGAAGAAAG CAATAATTTA GTTGATTATA TAACAGATGA TAAAAATATA GAGTTAGAGC TTTCTGATAT TTATGAAGTA CATTAA
|
Protein sequence | MDNSIFFHIF NDDKNKLETK RIIKLSIFIV FSIFFILTLD LSYKVLIRKN IEFIPNNSIP NFPLSLSLIL GTMAYISSLI YYSSTKKDDF FIISLIYMNL SVELLITKGH NLIIFDKFIF IHAIFRIILL FYVAFNKKGI SPLITKHKTI SSVAVFLFSV ITPMINYKIF SNKLFTKDIY FYATLMTIII ILYIIACIFL SKKSLDDCEL IYSFIIASIL LIALRGLYWI CEVLLPNITL LKTNNVVLLL TILSFLLAIS GVFNEITTKN KKSSLLQNEL QVFYHLVEFN TSSSIILYDN KKKVIYTNKT IRERYCKSTE LKDQLKEVEK LFVDSIFIDD SEKNATKSLF NKGNWEGKII LKNDKIVSAY IQILNVENKN YFAVNLKDIT EEYTLTKNIK RNEQLLSCIN NNVQDLIISV DNNGLITYVN DSVLETLNYT YEEIIGMPII NLLGKNDEIL NQLKLEDEED SIKCKLVGKH SFVYVESIIR TLSDNNEIPY GKVIVAKNLT SKKRLENLAI KFKEAKAYEQ IRNEFFANIS HELRTPLNII YSTIQLLNSK HETNYVNFND FYGKYKQGLK INCYRMLRLI NNLIDVSKIE VGFLKADFTN RDIVFLVENI VSLVIPHSEN KDINIIFDTN VEENIIKCDP VKIERLILNL LSNAIKFTQN HGKIFVDLNI SKDWVKISIK DNGIGIPKEM QASIFDRFVQ ADKSLKRRNE GSGIGLSIVK SIAELHDGKI ELISDGIKGS EFIVWLPNVK LNYTEESNNL VDYITDDKNI ELELSDIYEV H
|
| |