Gene CPR_1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1334 
SymbolprkA 
ID4204517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1503714 
End bp1505636 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content29% 
IMG OID642565888 
Productserine protein kinase 
Protein accessionYP_698654 
Protein GI110802501 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.175734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTA AAGAGTTTAT AAAAAGTGAT AGGGAAAAAC ATAATAAGCA GAAGTTTAAG 
GGAACATTTC TTGATTATTT AGAAATTGTA AAAAATAATC CTCATGTTGC AAAACTTTCT
CATAAAAGGA TTTATGATCT TATTATGGAT AAGGGCTTTG AAATATTAAG ACCTGAGGAA
AATGCAAAAA TTAAAAAGAT ATATGGTAAT GAAAAAATAA AGAAATATAA CTTTTTTAAA
GAAGATTTTT ATGGAATTGA TACAGTTATT ATGAAATTAA TGAATTATTT TCATTCAGCT
TCAATGAGGG GAGAAGAAGC AAGACAAGTA TTATACTTAG TAGGACCTGT GGGAGCTGGT
AAATCCTCTT TAGTAGAATC AATTAAAAGA GTTTTAGAAA CAGCACCGCC AGTATATGTT
TTAGATGGAT GTCCTATGCA TGAAGAGCCA CTACATTTAA TACCTAAACA TTTAAGAAAA
GAATTTGAGA ATATGTTAGG TGTCGAGATT GAAGGAGATT TATGTCCAGT TTGTAAGTAT
AGATTATTAA ATGAATTTGG TGGAGAATAT GAAAAGTTTC CAGTTTCAAC AAAGAATTTC
TCAATTCGTT CTAGGAAAGG AATTGGAGTT GTACCTCCAG TAGATCCTAA TAATCAAGAT
ACCTCAATTT TAACAGGATC TATTGATATT TCTAAAATGG ACATGTATCC AGAGGATGAT
CCAAGGATAT TCTCATTAAA CGGTGCATTT AATGTTGGTA ATAGAGGGTT AGTTGAATTT
ATTGAAGTAT TTAAAAATGA TGTGGAATAT TTACACACCA TAATAACTGC AACTCAAGAA
AAATCTATTC CTTCACCAGG TAAGGGATCA ATGATTTATT TTGATGGGCT TATAATAGCT
CACTCTAATG AAGCTGAGTG GAATAAATTT AAATCAGATC ATACTAATGA AGCAATCTTA
GATAGAATAG TAAAAATTGA AGTTCCATAT TGTTTAGAGC TTAGTGAAGA AATTAAAATT
TATGAAAAAA TATTAAAGAA ATCTAATTTT GATGCTCATA TAGCTCCTCA TACCATGGAA
ATTGCTTCAA TGTTTGCTAT ATTAACAAGA TTGCTTCCTT CTATGAAAGT TGATCCAATT
ACTAAACTTA AACTTTATAA TGGAGAGGAA ATTGTTGAAA AGGGAACAAC AAAGAGAATT
GATGTTTTAG AGTTAAAAGA AGAAGCTGGA ACTATGGAAG GCATGAAGGG AATTTCAACA
AGATTTATTA TAAAGGCAAT AGATAATGCT CTATCAAATG CAGAGCATAA GTGTATAAAT
CCATTAAGTG TTATGGAAAG TATAATAAAA TCTGTTAAGG ATATGGATAT ATCTCAAGAG
GATAAGAAGA AATATTTAGG ATATATTCAA GATACAATAA GAAAAGAATA TAATAAGATT
CTTGAAAAAG AAATTACAAA GGCATTTATT CACTCTTTTA AGGAGCAAGC AGAAAGTTTA
TTTAATAATT ATATAGATAA TGCAGAGGCT TATGTAAATA AAACAAAACT TAAGGATTCT
TCAACTAGTG AAGAGTTAGA GCCAGATGAA GAATTTATGA GAAGCATAGA GGAACAAATA
GGAATATCAG AATCTTCTTC CAAGGGGTTT AGGGCAGATG TAACATCATA CATGTTCTAT
ATAGTTAGAA GTGGCGGAAA AATGGACTAT AGATCTTATG AACCATTAAA AGAAGCCATT
GAAAAGAAAC TAACAGCTTC AGTTAAAGAT TTATCAAGAA TAATAACTAA ATCTAGAGTA
AGAGATAAAG ATCAAGATAG AAAGTATAAT TCAATGGTAA ATCAAATGGA GAAAAACGGT
TATTGTCCAC ATTGCTGCGA TGTAATATTA AAATATGCAG CCAATAACCT ATGGAAGGAT
TAA
 
Protein sequence
MDFKEFIKSD REKHNKQKFK GTFLDYLEIV KNNPHVAKLS HKRIYDLIMD KGFEILRPEE 
NAKIKKIYGN EKIKKYNFFK EDFYGIDTVI MKLMNYFHSA SMRGEEARQV LYLVGPVGAG
KSSLVESIKR VLETAPPVYV LDGCPMHEEP LHLIPKHLRK EFENMLGVEI EGDLCPVCKY
RLLNEFGGEY EKFPVSTKNF SIRSRKGIGV VPPVDPNNQD TSILTGSIDI SKMDMYPEDD
PRIFSLNGAF NVGNRGLVEF IEVFKNDVEY LHTIITATQE KSIPSPGKGS MIYFDGLIIA
HSNEAEWNKF KSDHTNEAIL DRIVKIEVPY CLELSEEIKI YEKILKKSNF DAHIAPHTME
IASMFAILTR LLPSMKVDPI TKLKLYNGEE IVEKGTTKRI DVLELKEEAG TMEGMKGIST
RFIIKAIDNA LSNAEHKCIN PLSVMESIIK SVKDMDISQE DKKKYLGYIQ DTIRKEYNKI
LEKEITKAFI HSFKEQAESL FNNYIDNAEA YVNKTKLKDS STSEELEPDE EFMRSIEEQI
GISESSSKGF RADVTSYMFY IVRSGGKMDY RSYEPLKEAI EKKLTASVKD LSRIITKSRV
RDKDQDRKYN SMVNQMEKNG YCPHCCDVIL KYAANNLWKD