Gene Sare_0112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0112 
SymboldnaK 
ID5707040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp126060 
End bp127895 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content67% 
IMG OID641269638 
Productmolecular chaperone DnaK 
Protein accessionYP_001535038 
Protein GI159035785 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000334679 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACGTG CGGTCGGTAT CGACCTCGGC ACGACGAACT CCTGCGTCAG CGTCCTGGAG 
GGCGGTGAGC CCACCGTCAT CGCCAACGCG GAGGGCTCGC GGACGACCCC GTCGATCGTC
GCGTTCGCCC GTAACGGCGA GGTGCTCGTC GGTGAGGTCG CCAAGCGTCA GGCAGTGACC
AACCCGGACC GGACGATCCG GTCGGTCAAG CGGGAGGTCG GCACCAACTG GTCCGTCGAC
ATCGACGACA AGAAGTACAC GCCGCAGGAG ATCTCGGCTC GCACGCTGAT GAAGCTCAAG
CGGGACGCCG AGTCGTACCT GGGCGAGCAG ATCACCGATG CGGTGATCAC CGTTCCGGCG
TACTTCAACG ACGGTCAGCG ACAGGCCACC AAGGAGGCCG GTGAGATCGC CGGCTTCAAC
GTGCTGCGGA TCGTCAACGA GCCGACCGCG GCTGCCCTGG CGTACGGGCT GGACAAGGGC
TCCAAGGAGC AGACCGTACT GGTCTTCGAC CTCGGCGGCG GCACCTTCGA CGTGTCGCTG
CTGGAACTGG CCGAGGGTGT CATCGAAGTC AAGTCGACCA GCGGCGACAA CCTCCTCGGC
GGTGACGACT GGGACCAGCG GATCATCGAC CACCTGGTCA AGACCTTCAA CGGTGAGCAC
GGCATCGACC TGTCCCAGGA CAAGATGGCG ATGCAGCGGC TCAAGGAGGC GGCTGAGAAG
GCGAAGATCG AGCTGTCCGC CGCTGCCACC AGCAACATCA ACCTGCCGTA CATCACCGCC
GGCGCCGCCG GCCCGCTGCA CCTCGACGTG ACGCTCACCC GGGCCGAGTT CCAGCGGATG
ACGCAGGACC TGCTGGACCG GTGCAAGGGC CCGTTCGAGC AGGCCGTCAA GGACGCCGGG
ATCAAGGTCG CCGACGTCGA ACACGTCATC CTGGTCGGCG GCTCGACCCG GATGCCGGCC
GTGACCGAGT TGGTCAAGGA CCTCACCGGC AGGGACCCCA ACAAGGGCGT GAACCCGGAC
GAGGTCGTCG CCGTCGGTGC CGCCCTGCAG GCCGGTGTGC TCAAGGGTGA GGTCAAGGAT
GTCCTGCTGC TCGACGTGAC CCCGCTGAGC CTCGGAATCG AGACCAAGGG GGGCATCTTC
ACCAAGCTGA TCGAGCGCAA CACCACCATC CCGACCAAGC GCTCCGAGGT CTTCACCACG
GCAGACGACA ACCAGCCGTC AGTGCTGATT CAGGTCTTCC AGGGTGAGCG TGAGATCGCC
GCCTACAACA AGAAGCTCGG CACCTTCGAG CTGACCGGCC TGCCGCCGGC GCCGCGCGGT
ATGCCGCAGA TCGAGGTCAC CTTCGACATC GACGCCAACG GCATCGTGAA CGTGCACGCG
AAGGACCTCG GCACCGGCAA GGAGCAGAAG ATGACGGTCA CCGCCGGCTC CTCGCTGCCG
AAGGAGGACA TCGAGCGGAT GCGTCGGGAC GCCGAGGAGC ACGCCGAGGA GGACAAGCGG
CGTCGCGAGG AGGCGGAGAC CCGCAACCTG GCCGAGGCGC TCCAGTGGCA GACCGAGAAG
TTCCTCGCCG AGAGCGGCGA CAAGCTTCCC ACCGAGTCCC GGGACCAGAT CAACGAGGCG
CTTGGCGAGC TGCGCAGCGC ACTCGGTGGT CAGGACATCG AAAAGATCAA GTCGGCGCAC
GCGCAGCTGG CCCAGGTCTC CCAGCAGGCC GGCTCCCAGC TCTACACCCA GCAGGGTGAG
CAGGCCGGTG CGACCGGAGC CCAGGCCGGC GGTGCGCAGG CCGGTGGCCC GGACGACGTG
GTCGACGCGG AGATCGTGGA CGAGGACAAG AAGTGA
 
Protein sequence
MARAVGIDLG TTNSCVSVLE GGEPTVIANA EGSRTTPSIV AFARNGEVLV GEVAKRQAVT 
NPDRTIRSVK REVGTNWSVD IDDKKYTPQE ISARTLMKLK RDAESYLGEQ ITDAVITVPA
YFNDGQRQAT KEAGEIAGFN VLRIVNEPTA AALAYGLDKG SKEQTVLVFD LGGGTFDVSL
LELAEGVIEV KSTSGDNLLG GDDWDQRIID HLVKTFNGEH GIDLSQDKMA MQRLKEAAEK
AKIELSAAAT SNINLPYITA GAAGPLHLDV TLTRAEFQRM TQDLLDRCKG PFEQAVKDAG
IKVADVEHVI LVGGSTRMPA VTELVKDLTG RDPNKGVNPD EVVAVGAALQ AGVLKGEVKD
VLLLDVTPLS LGIETKGGIF TKLIERNTTI PTKRSEVFTT ADDNQPSVLI QVFQGEREIA
AYNKKLGTFE LTGLPPAPRG MPQIEVTFDI DANGIVNVHA KDLGTGKEQK MTVTAGSSLP
KEDIERMRRD AEEHAEEDKR RREEAETRNL AEALQWQTEK FLAESGDKLP TESRDQINEA
LGELRSALGG QDIEKIKSAH AQLAQVSQQA GSQLYTQQGE QAGATGAQAG GAQAGGPDDV
VDAEIVDEDK K