Gene Mlg_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1900 
SymboldnaK 
ID4270100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2166125 
End bp2168140 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content66% 
IMG OID638126656 
Productmolecular chaperone DnaK 
Protein accessionYP_742734 
Protein GI114321051 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.969601 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCC GGCGCCGCGC CGGGGCAGGG TCCACGGACA ACAGCATTCA GGAGCATTTT 
ATGAGTAAGA TCATCGGCAT CGACCTGGGC ACCACCAACT CCTGCGTGGC CGTCATGGAC
GGTGGCAGCA CCCGGGTCAT CGAGAACAGC GAGGGCGATC GCACCACCCC CTCGGTGGTG
GCCTTCGCCG AAGACGGCGA GGTGCTCACC GGCGCGCCGG CCAAGCGCCA GGCGGTGACC
AACCCGGAAA ACACCGTGTT TGCGGTGAAG CGCCTGATCG GCCGCCGCTT CGAAGAGGAC
GTGGTGCAGC GCGACGTGCG CGAGATGCCC TACAAGATCG TCAAGGCCGA TAACGGGGAC
GCCTGGGTGG AGGTGCGCGG CAAGAAGATG GCGCCGCCGG AGATCTCCGC CCGCACCCTG
CAGAAGATGA AAAAGACCGC CGAGGACTAC CTGGGCGAGA CCGTCACCGA GGCGGTCATC
ACTGTCCCGG CCTACTTCAA CGACTCTCAG CGCCAGGCCA CCAAGGACGC CGGTAAGATC
GCCGGGCTGG AGGTCAAGCG CATCATCAAC GAGCCCACCG CGGCGGCCCT GGCCTACGGT
CTGGACAAGA AGGGCGGCGA TCGCAAGGTG GCCGTCTACG ACCTGGGCGG CGGCACCTTC
GACATCTCCA TCATCGAGAT CGCCGAGGTG GACGGCGAGC ACCAGTTCGA GGTGCTGGCC
ACCAACGGTG ACACCTTCCT GGGCGGCGAG GACTTCGACC GGGCGGTCAT CGACTACCTG
ATCGCCGAGT TCAAGAAGGA CCAGGGCATC GACCTGGGCG GCGACCGCCT GGCCATGCAG
CGCCTCAAGG AGGCCGCTGA GAAGGCCAAG ATTGAGCTCT CGTCGGCCCA GCAGACCGAA
GTGAATCTGC CCTACATCAC GGCCGATCAG GCCGGCCCGA AGCACCTGGC CATCAAGCTC
ACCCGGGCCA AGCTGGAGTC GCTGGTCGAG GGCCTGATCA AGCGCACCAT CGAGCCCTGC
AAGGTGGCCC TGAAGGATGC CGGCCTGTCC GCCAGCGATG TGGACGAGGT GATCCTGGTG
GGCGGCCAGA CCCGCATGCC CAAGGTCCAG GAGGCCGTCA CCCAGTTCTT CGGCAAGGAG
CCGCGCAAGG ACGTCAACCC GGACGAGGCC GTGGCCGTGG GTGCCGCCAT CCAGGGCGGT
GTGCTGGGCG GTGACGTCAA GGACGTGCTG CTGCTGGATG TCACCCCGCT CTCCCTGGGC
ATCGAGACCC TGGGCGGGGT GATGACCAAG CTGATCGAGA AGAACACCAC CATCCCGACC
AAGGCCTCGC AGACCTTCTC CACGGCCGAG GACAACCAGG GGGCGGTGAC GGTGCACGTG
CTCCAGGGTG AGCGCGAGAT GGCCAAGGAC AACAAGAGCC TGGGCCGCTT CGACCTGACC
GACATCCCGC CGGCACCGCG CGGCGTGCCG CAGATCGAGG TCACCTTCGA CATCGACGCC
AACGGCATCC TGCACGTCTC CGCCAAGGAC AAGGCCACCG GCAAGGAGAA CAAGATCGTC
ATCAAGGCCT CCTCCGGTCT GTCCGAGGAG GAGATCGAGA ACATGGTCAA GGACGCCGAG
GCCCACGCCG AGGAGGACCG CAAGGCGCGC GAGTTGGTGG AGGCCCGCAA CCAGGCCGAT
AACATGATCC ATGCCACCAA CAAGTCGCTG AGCGAGTTCG GCGACAAGAT CGACTCCGGC
GAGAAGCAGT CCATCGAGGA GGCGATCAAG GAGCTGGAAG AGGCCATGAA GGGCGATGAC
AAGGAGGCCA TCGAGGCTAA GACCCAGCAG CTGGCCGAGC GCTCCGGCAA GCTGGCCGAG
AAGATGTACG CCGCCCAGGG TGGCGAGGAG GCCGCGGAGC AGGCGGCCGG CGGCGAGCAG
CAGCAGGCCG GTGGCAGCGG CAAGTCCGAG GACGACGTGG TCGACGCGGA GTTCGAGGAG
GTCAAGGATC AGGACGAGGA CAAGGACCGC AAGTAA
 
Protein sequence
MAARRRAGAG STDNSIQEHF MSKIIGIDLG TTNSCVAVMD GGSTRVIENS EGDRTTPSVV 
AFAEDGEVLT GAPAKRQAVT NPENTVFAVK RLIGRRFEED VVQRDVREMP YKIVKADNGD
AWVEVRGKKM APPEISARTL QKMKKTAEDY LGETVTEAVI TVPAYFNDSQ RQATKDAGKI
AGLEVKRIIN EPTAAALAYG LDKKGGDRKV AVYDLGGGTF DISIIEIAEV DGEHQFEVLA
TNGDTFLGGE DFDRAVIDYL IAEFKKDQGI DLGGDRLAMQ RLKEAAEKAK IELSSAQQTE
VNLPYITADQ AGPKHLAIKL TRAKLESLVE GLIKRTIEPC KVALKDAGLS ASDVDEVILV
GGQTRMPKVQ EAVTQFFGKE PRKDVNPDEA VAVGAAIQGG VLGGDVKDVL LLDVTPLSLG
IETLGGVMTK LIEKNTTIPT KASQTFSTAE DNQGAVTVHV LQGEREMAKD NKSLGRFDLT
DIPPAPRGVP QIEVTFDIDA NGILHVSAKD KATGKENKIV IKASSGLSEE EIENMVKDAE
AHAEEDRKAR ELVEARNQAD NMIHATNKSL SEFGDKIDSG EKQSIEEAIK ELEEAMKGDD
KEAIEAKTQQ LAERSGKLAE KMYAAQGGEE AAEQAAGGEQ QQAGGSGKSE DDVVDAEFEE
VKDQDEDKDR K