Gene CPR_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1954 
Symbol 
ID4204118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2158913 
End bp2160787 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content19% 
IMG OID642566504 
Productsensor histidine kinase 
Protein accessionYP_699264 
Protein GI110801543 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000131402 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTAA AACTTAGTAA AGTTAAAAAT TTAATTAATA TTATTTTAGT TATAGCAACT 
ATTTTGTATA TTGTATATTT TTTTAGACCA CTTACAGCAT TTTATTTACC AATAATAATT
GAAACTATAT TGGTGTTTAG TATAACATTT ATTGCTACAA CTGTATTTAA ATTTTCTAAT
AAAAAAATTT TTAAATTAAT AAGTATGTAT TTGCTGTTAA TTGCTATATT AAAATTTATA
TCACTTGTTT ATTTAATGTT TTTCTTTGGT GAAAAAGAAA TAATTCAGTG TTTAAATTTA
AAAATATTAG CTATTTGTAA TGTTTTAGAA AGTATTTATT GCTTCATGAT AATAGTATAT
GTAAAATATG TAAATTTAAA TATGAAATTA ATAGTTGTTT TTTTTACTAT AATATCCTTA
GTTAGTATAT GTTTATTTGA TGTAATTGTG TTTGGGGTGA TATCTACAGT ACTTCTTTTG
CTTATATTAT ATCTTTTAAA AGATTTTAAG ATTTTTAAGA ATAATCTATT AAATTACTTA
AAGTTATTCA TTATAATAAA TCTATCTTTA GTATTAATAT ATGCTCTAGT ATATATTTTT
AATTTAGAAG CTTTGCTATT GTTAATATAT ATATTAAAAA TATTAATATA TCTTATAATA
TACGTTTGGA TTTCTGAAAT GCTTATTACA AAACCATATA AAATTTTACA TCAAGATATA
ATTGATAAAA ATAAAAAACT TAAAATATTA AATAAAAAAA TGGAAGAGAG TAATAATGAA
TTTAAAGAAT TCAAGGAAAA GTTAAAAGAT AATGAAACTT ATTTTAAAAA TTTTATAAAC
AACGCTCCTA TGTCCATAGT AATATTAAAT AATCATAATT ATAGAATATT TTCCATAAAT
AAACAATTTT CAAATGAATT AAATTTAAAA AATAATAGAC AAGTAATAAA TAGAAATTTA
TTTAAAATAA TAAATATTGA AAATAAAGAC GAGTTTTTAA TAACAAAAAA AGGAGAAGCT
TCATATTCTT CAGGTAAAAT TGATATTTTT TGGCAATTAA ATATTTTATT ACAAACAAAA
GATTATTTAA TAATTTCAAT GAAAAATATA ACAGAATTTA AATTTTCAGA AAAGATAAAA
TTTGATTTAG GAAAAAGAAA ATTATCAGAA AAAATAAAAA ATGATTTTTT ATCAAGCATA
TCTCATGACT TAAAGACACC TATAAATGTC ATATATTCAT CAGTACAAGT TCAGGAGAAG
TTTTATGTTG ATGAAAACAT AAATAAAATA GGTCATTATA ATCAAGTTAA TAAGGAAAAT
TGTATTACAT TAATGAGATT AGCTAATAAT CTGATAGATT CATCAAAAAT TGATTATGAT
TATCTAAAGC CAAATTTTAA AGTTTATAAC ATAGTAACTT TAGTAGAAGA TTCTATATTA
AATTTAGCTG AGTATATAAA AGAAAAAAAA TTGACTTATA TATTTGATAC AGATGAAGAA
GAGTTATATG TTAAATGCGA TCAAGAATTC ATTCAAAGAA TTATTTTAAA TTTAATTTCT
AATTCAATAA AATATACCAA AAGAGGTGGA ATAAGAGTAC AAATAAAATC GAATAGAAAC
AAGGTTATTA TTGATTTTAC AGATACTGGA GAAGGTATGG ATAAAGATTT TATAGAAAAA
GCTTTTTTAA GATATAGTAA AGGAAATAAA AAAGGAATAA AAAATAAAAG TACAGGAATT
GGATTATATA TTGTTAAGAG TTTAGTTGAA CTACAAAATG GAACTATATA TATAAATTCT
ATTAAGGATG CAGGAACTAA TGTTAGATTA GAGTTTAGTA GGGAGAAAAA TTGTGGAATA
CAAAGTGGAA TATGA
 
Protein sequence
MYLKLSKVKN LINIILVIAT ILYIVYFFRP LTAFYLPIII ETILVFSITF IATTVFKFSN 
KKIFKLISMY LLLIAILKFI SLVYLMFFFG EKEIIQCLNL KILAICNVLE SIYCFMIIVY
VKYVNLNMKL IVVFFTIISL VSICLFDVIV FGVISTVLLL LILYLLKDFK IFKNNLLNYL
KLFIIINLSL VLIYALVYIF NLEALLLLIY ILKILIYLII YVWISEMLIT KPYKILHQDI
IDKNKKLKIL NKKMEESNNE FKEFKEKLKD NETYFKNFIN NAPMSIVILN NHNYRIFSIN
KQFSNELNLK NNRQVINRNL FKIINIENKD EFLITKKGEA SYSSGKIDIF WQLNILLQTK
DYLIISMKNI TEFKFSEKIK FDLGKRKLSE KIKNDFLSSI SHDLKTPINV IYSSVQVQEK
FYVDENINKI GHYNQVNKEN CITLMRLANN LIDSSKIDYD YLKPNFKVYN IVTLVEDSIL
NLAEYIKEKK LTYIFDTDEE ELYVKCDQEF IQRIILNLIS NSIKYTKRGG IRVQIKSNRN
KVIIDFTDTG EGMDKDFIEK AFLRYSKGNK KGIKNKSTGI GLYIVKSLVE LQNGTIYINS
IKDAGTNVRL EFSREKNCGI QSGI