Gene Caul_0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0004 
SymboldnaK 
ID5897716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5803 
End bp7698 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content65% 
IMG OID641560487 
Productmolecular chaperone DnaK 
Protein accessionYP_001681640 
Protein GI167643977 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0827361 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA TTATCGGTAT CGACCTTGGC ACCACGAATT CGTGCGTGGC CATCATGGAC 
GGCAAGACCC CGAAGGTGAT CGAGAACGCC GAGGGCGCTC GCACCACCCC GTCGGTGGTG
GCCTTTCTCG AGGACGGCGA ACGCCTTGTC GGCCAGCCGG CCAAGCGCCA GGCCGTCACC
AACCCGACCA ACACCCTTTT CGCGATCAAG CGCCTGATCG GCCGTAACTT CGCCGATCCC
GTCGTGGCCA AGGACAAGGC CATGGTCCCC TACGAGATCG TCAAGGGTCC GACCGGCGAC
GCCTGGGTCA AGGCCCACGG CAAGGACTAC AGCCCGCAGG AAGTCTCCGC CTTCATCCTG
CAGAAGATGA AGGAAGCGGC CGAGAGCCAT CTGGGCGAGC CGGTGACCAA GGCGGTCATC
ACCGTTCCGG CCTATTTCAA CGACGCCCAG CGTCAGGCGA CCAAGGACGC CGGCAAGATC
GCCGGCCTGG AAGTCCTGCG CATCATCAAC GAGCCGACCG CGGCCGCCCT GGCCTACGGC
CTGGAAATGA ACGAAGGCAA GAAGATCGCC GTCTACGACC TGGGCGGCGG CACCTTCGAC
GTCTCGGTCC TGGAAATCGG CGACGGCGTC TTCGAAGTGA AGTCGACCAA CGGCGACACC
TTCCTGGGCG GCGAGGACTT CGACCTGCGG ATCGTCGACT ACCTGGCCGA CGAGTTCAAG
AAGGAGCAGG GCGTCGACCT GCGCAAGGAC AAGCTGGCCC TGCAGCGTCT GCGCGAAGAG
GCTGAAAAGG CCAAGAAGGA GCTGTCCTCG ACGGCTCAGT ACGAAGTCAA CCTGCCCTTC
ATCTCGATGA ACGCGTCGGG TCCGCTGCAT CTGAACATCA AGCTGTCGCG CTCCAAGCTC
GAAGCCCTGG TGGAAGACCT GATCACGCGC ACCATCGGTC CGTGCGAACA GGCCCTCAAG
GACGCCGGCC TGAAGAAGAG CGACATCGAC GAAGTGATCC TGGTCGGCGG CATGAGCCGC
ATGCCCAAGG TCCAGCAGGC GGTGCAGGAC TTCTTCGGCC GCGAGCCGCA CAAGGGCGTG
AACCCTGACG AAGTCGTGGC CCTGGGCGCC GCCGTTCAGG CCGGCGTGCT GCAAGGCGAC
GTCAAGGACG TGCTGCTGCT GGACGTGACC CCTCTGACCC TGGGCATCGA GACCCTGGGC
GGCGTGTTCA CCCCGCTGAT CGAGCGCAAC ACCACCATCC CGACCAAGCG CTCGCAGACC
TTCTCGACCG CCGACGACAA CCAGTCGGCG GTGACGATCC GCGCCTTCCA GGGCGAGCGT
CCGATGGCCG TCGACAACAA GTTCCTGGGT CAGTTCGACC TGCAGGGCAT TCCGCCGGCG
CCGCGCGGCG TGCCGCAGAT CGAGGTCACC TTCGACATCG ACGCCAACGG CATCGTCAAC
GTCCACGCCA AGGACAAGGC GACCAACAAG GAGCACTCGA TCCGCATCCA GGCCAACGGC
GGCCTGAGCG ACGCGGACAT CGAGCGTATG GTCAAGGAAG CCGAGGCCAA CAAGGCTTCG
GACGAGAAGA AGAAGGCGCT GGTCGAGGCC AAGAACCAGG GCGAGGCCAT CGTGCACTCG
ACCGAGAAGG CCTTCGCCGA ACACGGCGAC AAGATCGGCG GGGCCGAGAA GACCGCGATC
GAGACCGGCC TGACCGATCT GAAGGCGGCC CTGGAAGGCG AGGACGTCGA GGCCATCCAG
GCCAAGACCC AGGCCCTGAT CCAGGCGTCG ATGAAGCTCG GCGAAGCGAT GTACGGCGCC
CAGCAAGGCG CCGACGGCGG CGAGGAAGCC GCCCACGATG ACGGCGTCGT CGACGCCGAA
TTCGAGGAAG TCGACGACTC CAAGCCGTCG GCGTGA
 
Protein sequence
MSKIIGIDLG TTNSCVAIMD GKTPKVIENA EGARTTPSVV AFLEDGERLV GQPAKRQAVT 
NPTNTLFAIK RLIGRNFADP VVAKDKAMVP YEIVKGPTGD AWVKAHGKDY SPQEVSAFIL
QKMKEAAESH LGEPVTKAVI TVPAYFNDAQ RQATKDAGKI AGLEVLRIIN EPTAAALAYG
LEMNEGKKIA VYDLGGGTFD VSVLEIGDGV FEVKSTNGDT FLGGEDFDLR IVDYLADEFK
KEQGVDLRKD KLALQRLREE AEKAKKELSS TAQYEVNLPF ISMNASGPLH LNIKLSRSKL
EALVEDLITR TIGPCEQALK DAGLKKSDID EVILVGGMSR MPKVQQAVQD FFGREPHKGV
NPDEVVALGA AVQAGVLQGD VKDVLLLDVT PLTLGIETLG GVFTPLIERN TTIPTKRSQT
FSTADDNQSA VTIRAFQGER PMAVDNKFLG QFDLQGIPPA PRGVPQIEVT FDIDANGIVN
VHAKDKATNK EHSIRIQANG GLSDADIERM VKEAEANKAS DEKKKALVEA KNQGEAIVHS
TEKAFAEHGD KIGGAEKTAI ETGLTDLKAA LEGEDVEAIQ AKTQALIQAS MKLGEAMYGA
QQGADGGEEA AHDDGVVDAE FEEVDDSKPS A