Gene CPF_0198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0198 
Symbol 
ID4203663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp240450 
End bp242825 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content24% 
IMG OID638081082 
Productsensory box histidine kinase 
Protein accessionYP_694661 
Protein GI110799765 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00155091 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACC CAATATTTTT TCATATTTTT AATGATGATA AAAATAAATT AGAAACTCAA 
AGAATAATAA AACTATCCAT TTTTATAGTT TTCTCTATAT TTTTTATATT GCTAATAGAT
TTATCCTATA AAGTGCTTAT TAGAAAAAAT ATAGAGTTTA TTCCTGGTAA TTCTATGCCA
AGTTTCTCAT TAAGTTTATC ATTAATATTA GGAACAATGG CATACATAAG TTCATTAATA
TACTATTCAA GCACCAAAAA AGATGATTTT TTTATAATCT CTTTAATATA TATGAATTTA
TCTGTAGAAC TTTTAATTAC TAAAGGACAT AACCTAATAA TATTCGATAA GTTTATTTTT
ATACACGCAA TATTTAGGAT AATTTTGCTT TTTTATGTTG CCTTTAATAA GAAAGGAATA
TCCCCTCTTA TTACTAAACA TAAAATAATT ACATCAATAG TAGTTTTTTT ATTTTCAGTT
ATAACACCTA TGATTAACTA TAGAATTTTT TCTAACAATT TATTTGCTAA AGATATTTAT
TTTTATGCTA CTTTAATGAC TATGATTATT ATCCTATACA TAATCGCTTG CATATTCTTA
TCAAAGAAAT CTTTAGATGA TTGTGAGTTA ATATATTCAT TTATAATTGC TAGTATTCTT
TTAATAGCCC TTAGAGGATT ATATTGGATT TGTGAAGTAC TTCTTCCAAA TATAACACTT
TTAAAAACCA ATAATGTTGT TCTTCTACTT ACCATACTAT CATTTTTATT GGCTATAAGT
GGAGTTTTTA ATGAAATTAC AGCTAAAAAC AAAAGAAGCT CTTTACTACA AAATGAACTT
CAAGTTTTTT ATCACTTAGT TGAATTTAAT ACTAGTAGTT CTATAATTTT ATATGATAAT
AAAAAGAAGG TTATATATAC AAACAAGACA ATAAGAGAAC GCTACTGCAA ATCAACTAAA
TTAAAAGATC AACTTAAAGA GGTAGAAAAA TTATTTGTAG ATTCGATTTT TATAGATGAC
TCTGAAAAAA ATGCTACTAA AGCACTTTTT AATAAGGGCA ACTGGGAAGG TAAGCTTATT
TTAAAAAATG GCAAAATAGT AAGTGCCTAC ATACAGATAT TAAATGTTGA AAATAAAAAT
TATTTTGCTG TAAATTTAAA AGATATAACC GAAGAATATA CCCTAACAAA AAATATTAAA
AGAAATGAAC AATTATTAAG TTGTATAAAT AATAACGTAC AGGATTTAAT AATAAGTGTT
GATAATAATG GTTTAATTAC ATATGTTAAT GATTCTGTAT TAAAAACATT AAATTATACC
TATGAAGAAA TTATAGGAAT GCCTATAATA AACCTTTTAG GTAAAAATGA TGAGATATTA
AATCAATTAA AACTAGAAGA TGAGGAAGAT AGTATTAAAT GTAAACTTGT TGGTAAACAT
TCCTTTGTAT ATGTAGAATC TATAATTAGA ACTTTAAATG ATAATAATGA AATTCCTTAT
GGAAAAGTTA TAGTTGCAAA AAACTTAACC TCTAAAAAAC GTCTTGAAAA TTTAGCTATA
AAATTTAAAG AAGCTAAGGC TTATGAACAA ATAAGAAATG AATTTTTCGC CAATATATCA
CATGAGCTTA GAACACCACT TAATATTATC TATTCTACAA TACAGTTATT AAATTCTAAG
CATGAAACTG ACCCTATGGA CTTTAATAAC TTCTATGATA AATATAAGCA AGGTCTTAAG
ATAAATTGTT ATAGAATGCT TAGACTTATA AATAACCTTA TTGATGTTAG TAAAATTGAA
GTTGGATTTT TAAAAGCTGA TTTTACTAAT AGAGATATAG TATTCCTTGT AGAAAATATA
GTATCTTTGG TTATTCCTCA TTCTGAAAAT AAGGATATTA ATATAATCTT TGATACTAAT
GTTGAAGAAA ACATAATAAA ATGTGATCCT GTAAAAATTG AAAGATTAAT TCTTAACTTA
CTTTCAAACG CAATAAAATT CACCCAAAAT CATGGTGAAA TATTTGTAGA TTTAAACATC
TCAAAGGATT GGGTTAAAAT AAGCATAAAA GATAATGGAA TTGGTATTCC CAAAGAAATG
CAAGCATCAA TTTTTGATAG ATTTGTACAA GCTGATAAAT CCTTAAAAAG AAGAAATGAA
GGTAGTGGAA TAGGTCTTAG CATTGTAAAG TCTATTGCAG AACTGCATGA TGGTAAAATT
GAACTTATAA GTGATGGAAT AAAAGGTTCA GAATTTATAG TATGGCTACC AAATGTAAAA
TTAAATTACA CAGAAGAAAG CAATAATTTA GTTGATTATA TAACAGATGA TAAAAATATA
GAGTTAGAGC TTTCTGATAT TTATGAAGTA CATTAA
 
Protein sequence
MDNPIFFHIF NDDKNKLETQ RIIKLSIFIV FSIFFILLID LSYKVLIRKN IEFIPGNSMP 
SFSLSLSLIL GTMAYISSLI YYSSTKKDDF FIISLIYMNL SVELLITKGH NLIIFDKFIF
IHAIFRIILL FYVAFNKKGI SPLITKHKII TSIVVFLFSV ITPMINYRIF SNNLFAKDIY
FYATLMTMII ILYIIACIFL SKKSLDDCEL IYSFIIASIL LIALRGLYWI CEVLLPNITL
LKTNNVVLLL TILSFLLAIS GVFNEITAKN KRSSLLQNEL QVFYHLVEFN TSSSIILYDN
KKKVIYTNKT IRERYCKSTK LKDQLKEVEK LFVDSIFIDD SEKNATKALF NKGNWEGKLI
LKNGKIVSAY IQILNVENKN YFAVNLKDIT EEYTLTKNIK RNEQLLSCIN NNVQDLIISV
DNNGLITYVN DSVLKTLNYT YEEIIGMPII NLLGKNDEIL NQLKLEDEED SIKCKLVGKH
SFVYVESIIR TLNDNNEIPY GKVIVAKNLT SKKRLENLAI KFKEAKAYEQ IRNEFFANIS
HELRTPLNII YSTIQLLNSK HETDPMDFNN FYDKYKQGLK INCYRMLRLI NNLIDVSKIE
VGFLKADFTN RDIVFLVENI VSLVIPHSEN KDINIIFDTN VEENIIKCDP VKIERLILNL
LSNAIKFTQN HGEIFVDLNI SKDWVKISIK DNGIGIPKEM QASIFDRFVQ ADKSLKRRNE
GSGIGLSIVK SIAELHDGKI ELISDGIKGS EFIVWLPNVK LNYTEESNNL VDYITDDKNI
ELELSDIYEV H