Gene CPR_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2005 
SymboldnaK 
ID4205662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2210616 
End bp2212475 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content34% 
IMG OID642566555 
Productmolecular chaperone DnaK 
Protein accessionYP_699314 
Protein GI110802609 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAA TAATCGGTAT AGATTTAGGA ACAACAAATT CATGCGTAGC TGTTATGGAA 
GGTGGAGAAC CAGTAGTTAT CACTAACTCA GAAGGTGCTA GAACAACTCC ATCAGTAGTT
TCATTCCAAG CAAACGGAGA AAGATTAGTA GGTCAAGTTG CTAAAAGACA AGCAATAACA
AATCCTGAAA AAACAATAAT GTCAATCAAA AGACATATGG GAACTGACTA TAAAGTTAAT
ATAGATGGAA AAGATTATAC ACCACAAGAA ATATCAGCAA TGATACTTCA AAAATTAAAA
GCAGATGCAG AAGCTTACTT AGGAGAAAAA GTAACAGAAG CTGTTATCAC AGTTCCAGCT
TACTTCAATG ATGCTGAAAG ACAAGCAACT AAGGATGCTG GTAGAATCGC TGGTTTAGAT
GTTAAGAGAA TAATAAACGA ACCAACAGCT GCATCATTAG CTTATGGATT AGATAAAATG
GATAGTGCTC ACAAAATCTT AGTATATGAC CTAGGTGGTG GTACTTTCGA TGTATCTATC
TTAGACTTAG GAGATGGAGT ATTTGAAGTT GTATCAACAA ACGGAGATGC TAGATTAGGT
GGAGATGACT TCGACCAAAG AATTATAGAT TATATAGCAG AAGATTTTAA AGCTCAAAAT
GGAATTGATT TAAGACAAGA TAAAATGGCT CTTCAAAGAT TAAAAGAAGC TGCTGAAAAA
GCTAAAATTG AGTTATCATC ATCAACTCAA ACATTAATCA ACTTACCATT TATAACTGCT
GATGCAACTG GTCCAAAACA CATAGATATG ACATTAACAA GAGCTAAATT CAATGAATTA
ACTCATGACT TAGTTGAAAG AACAATAGAC ATAATGAAAG AAGCCTTAAA ATCAGGTAAT
GTTTCATTAA ATGATATAGA TAAAGTAATC TTAGTTGGTG GATCAACAAG AATACCAGCA
GTTCAAGAAG CTGTTAAAAA CTTTACTGGA AAAGAACCTT CAAAAGGAGT TAACCCAGAT
GAGTGCGTAG CAATGGGTGC TGCTATCCAA GCTGGTGTAT TAACTGGTGA TGTTAAAGAT
GTATTATTAT TAGATGTTAC TCCATTAACA TTAGGAATCG AAACTTTAGG AGGAGTTGCA
ACTCCATTAA TCGAAAGAAA TACAACTATC CCTGCAAGAA AGAGCCAAAT ATTCTCAACT
GCAGCAGATA ACCAAACTTC AGTTGAAATT CACGTAGTAC AAGGTGAAAG ACAAATGGCA
GCTGATAACA AAACTTTAGG TAGATTTACT CTATCAGGAA TTGCTCCAGC TCCAAGAGGA
ATCCCTCAAA TAGAAGTTGC TTTCGATATA GATGCTAACG GTATAGTTAA AGTTTCAGCA
ACTGATAAAG CTACTGGAAA AGAAGCTAAC ATTACAATCA CAGCTTCAAC TAACTTAAGC
GATGCTGAAA TAGATAAGGC TGTAAAAGAA GCAGAACAAT TTGCTGAAGA AGATAAGAAG
AGAAAAGAAG CTATAGAAGT TAAAAATAAT GCTGAGCAAA TTGTTTACCA AACAGAAAAA
ACTTTAAATG AACTTGGCGA TAAAGTTTCA GCTGAAGAAA AATCAGAAAT AGAAGCTAAA
ATCGAAGAAG TTAAAAAAGT TAAAGATGGT GACGATATAG AAGCTATCAA GAAAGCTATG
GAAGATTTAA CTCAAGCATT CTACAAAATA TCAGAAAAAT TATACCAACA AAATGGTGGA
GCACAAGGTG AAGGGTTCGA TCCAAACAAC ATGGGTGGAG CTAATGCTGG AACAGGTGCT
GCAAATAGCA ACGATGACAA TGTTGTAGAT GCTGATTTCG AAGTTCAAGA TGATAAATAA
 
Protein sequence
MSKIIGIDLG TTNSCVAVME GGEPVVITNS EGARTTPSVV SFQANGERLV GQVAKRQAIT 
NPEKTIMSIK RHMGTDYKVN IDGKDYTPQE ISAMILQKLK ADAEAYLGEK VTEAVITVPA
YFNDAERQAT KDAGRIAGLD VKRIINEPTA ASLAYGLDKM DSAHKILVYD LGGGTFDVSI
LDLGDGVFEV VSTNGDARLG GDDFDQRIID YIAEDFKAQN GIDLRQDKMA LQRLKEAAEK
AKIELSSSTQ TLINLPFITA DATGPKHIDM TLTRAKFNEL THDLVERTID IMKEALKSGN
VSLNDIDKVI LVGGSTRIPA VQEAVKNFTG KEPSKGVNPD ECVAMGAAIQ AGVLTGDVKD
VLLLDVTPLT LGIETLGGVA TPLIERNTTI PARKSQIFST AADNQTSVEI HVVQGERQMA
ADNKTLGRFT LSGIAPAPRG IPQIEVAFDI DANGIVKVSA TDKATGKEAN ITITASTNLS
DAEIDKAVKE AEQFAEEDKK RKEAIEVKNN AEQIVYQTEK TLNELGDKVS AEEKSEIEAK
IEEVKKVKDG DDIEAIKKAM EDLTQAFYKI SEKLYQQNGG AQGEGFDPNN MGGANAGTGA
ANSNDDNVVD ADFEVQDDK