Gene Cag_1893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1893 
SymboldnaK 
ID3746792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2403753 
End bp2405666 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content46% 
IMG OID637774430 
Productmolecular chaperone DnaK 
Protein accessionYP_380186 
Protein GI78189848 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.457552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAA TTATCGGCAT TGACCTTGGC ACAACCAACT CCTGCGTGGC GGTTATGCAG 
GGTTCACAGC CAACAGTTAT TGAAAATTCT GAAGGTAACC GCACTACTCC ATCGGTTGTT
GGTTTTACAA AAACGGGCGA TCGCCTTGTA GGGCAAGCCG CAAAACGCCA AGCTATTACC
AACCCAAAGA ACACCATTTT CTCTATTAAG CGCTTTATGG GACGCCGCTT TGATGAGGTT
GGAGAAGAAA AGAAAATGGC GCCGTATGAG CTTATTAACG ATAGCGGTGA AGCTCGCGTA
AAAATCAACG ATAAGGTGTA CTCCCCACAA GAAATTTCAG CGATGGTGCT TCAAAAAATG
AAGCAGACGG CTGAAGATTT TCTTGGTGAA AAAGTAACCG AAGCTGTTAT TACCGTACCA
GCCTACTTTA ATGATGCCCA ACGCCAAGCA ACAAAAGATG CGGGACGTAT TGCAGGGCTT
GAAGTAAAGC GCATTATTAA CGAACCAACG GCTGCGGCAC TTGCTTATGG CCTCGATAAA
AAGAATGCCA GTGAAAAAGT TGCCGTTTTT GATCTCGGTG GTGGTACCTT CGACATTTCC
ATTCTTGAGC TTGGCGAAGG CGTTTTTGAA GTAAAATCCA CCGATGGCGA CACCCATCTT
GGTGGCGACG ACTTCGACCA AAAAATTATT GACTACATTG CTGAAGAGTT TAAAAAGCAA
GAAGGCATTG ACTTACGTAA AGATGCCATT ACGCTTCAGC GCTTGAAAGA AGCTGCTGAA
AAAGCAAAAA TTGAGCTTTC ATCGCGTAGT GATACGGAAA TCAATTTGCC CTTTATTACT
GCAACGCAAG AAGGTCCAAA GCACTTGGTG ATTAACCTTA CTCGGGCGAA GTTTGAAGCT
ATTTCGGCTG ATTTATTTAA TAAAGTGTTG GATCCATGCC GCCGTGCCGT AAAAAACGCT
AAAATTGAAA TGCGTGAAAT TGACGAGGTG GTGCTTGTTG GTGGTTCAAC CCGAATTCCA
AAAATTCAAG CTCTTGTAAA AGAGTTCTTT GGCAAAGAGC CAAACAAAAG CGTGAACCCC
GATGAAGTGG TAGCAATTGG AGCGGCTATT CAAGGTGGCG TGCTCAAAGG CGATGTTACC
GATGTGTTGC TGCTCGACGT TACTCCACTT TCGCTTGGTA TTGAAACGCT TGGTGGCGTT
ATGACAAAGC TGATTGAAGC AAACACCACC ATTCCAACCA AAAAGCAAGA GGTATTCTCA
ACCGCAAGTG ATAACCAAAC CTCGGTTGAA GTGCATGTGT TGCAAGGTGA ACGCCCAATG
GCTGCCGACA ACAAAACGCT TGGTCGCTTC CACCTTGGCG ATATTCCACC CGCACCTCGT
GGCATTCCAC AAGTTGAGGT AATTTTTGAT ATTGATGCTA ACGGCATTTT ACACGTATCA
GCTAAAGACA AAGCAACCGG TAAAGAGCAA AGCATCCGCA TTGAAGCCAG CAGCAAGCTC
AGTGATGCTG AAATCAATAA AATGAAGGAT GATGCTAAGC AACATGCTGA TGAGGATAAA
AAGCGCAAAG AGGAGATTGA TATTAAGAAT AGCGCCGATG CTCTGATTTT CAGCACCGAA
AAACAGCTTA CCGAACTTGG TGAGAAAATT CCAACCGATA AGAAAAGTGC ACTGGAAGGT
TCCCTCGACA AGCTCCGTGA TGCCTACAAA AACGGCACCA CCGAATCCAT TAAGAGCGCT
ATGGATGACC TCAACAGCCA ATGGAACAGC ATTGCCTCCG ACCTTTACCA ATCAGGCGCA
GGTGCAGCGC AAGCACAACC TGAAGCACCG CAAAACAGTG GTAGCAGCCA AAGCTCAGGT
GGCGATGGCG CTGTAAATGC CGAGTACGAA GTTATTAACG ATGACAAAAA GTAA
 
Protein sequence
MGKIIGIDLG TTNSCVAVMQ GSQPTVIENS EGNRTTPSVV GFTKTGDRLV GQAAKRQAIT 
NPKNTIFSIK RFMGRRFDEV GEEKKMAPYE LINDSGEARV KINDKVYSPQ EISAMVLQKM
KQTAEDFLGE KVTEAVITVP AYFNDAQRQA TKDAGRIAGL EVKRIINEPT AAALAYGLDK
KNASEKVAVF DLGGGTFDIS ILELGEGVFE VKSTDGDTHL GGDDFDQKII DYIAEEFKKQ
EGIDLRKDAI TLQRLKEAAE KAKIELSSRS DTEINLPFIT ATQEGPKHLV INLTRAKFEA
ISADLFNKVL DPCRRAVKNA KIEMREIDEV VLVGGSTRIP KIQALVKEFF GKEPNKSVNP
DEVVAIGAAI QGGVLKGDVT DVLLLDVTPL SLGIETLGGV MTKLIEANTT IPTKKQEVFS
TASDNQTSVE VHVLQGERPM AADNKTLGRF HLGDIPPAPR GIPQVEVIFD IDANGILHVS
AKDKATGKEQ SIRIEASSKL SDAEINKMKD DAKQHADEDK KRKEEIDIKN SADALIFSTE
KQLTELGEKI PTDKKSALEG SLDKLRDAYK NGTTESIKSA MDDLNSQWNS IASDLYQSGA
GAAQAQPEAP QNSGSSQSSG GDGAVNAEYE VINDDKK