Gene Saro_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2054 
SymboldnaK 
ID3917701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2195435 
End bp2197342 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content64% 
IMG OID640444806 
Productmolecular chaperone DnaK 
Protein accessionYP_497327 
Protein GI87200070 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAG TTATCGGCAT CGACCTGGGT ACGACCAACA GCTGCGTTGC GGTGATGGAC 
GGGGGCACGC CCAAGGTCAT TGAGAACTCG GAAGGTGCGC GCACCACGCC GTCGATCGTC
GCCTTCACCA AGGATGGCGA GCGTCTGATC GGCCAGCCGG CCAAGCGCCA GGCGGTGACG
AACCCGGACA ACACGATTTT CGCGGTGAAG CGCCTCATCG GCCGCCGCTT TGACGATCCG
ATGACCCAGA AGGACACGGA ACTCGTCCCC TACACCATCA CCAAGGGCAA GAACGGCGAC
GCCTGGGTCA AGGCGGGCGG GCAGGACTAC AGCCCTTCGC AGATCTCGGC CTTCACCCTG
CAGAAGATGA AGGAAACCGC CGAGGCCTAT CTCGGCGAGA CCGTGACGCA GGCGGTGATC
ACCGTTCCGG CATACTTCAA CGACGCGCAG CGCCAGGCGA CCAAGGACGC CGGCCAGATC
GCGGGCCTCG AAGTGCTGCG CATCATCAAC GAGCCGACCG CGGCGGCGCT GGCCTATGGC
CTCGACAAGC AGGACGGCAA GACGATCGCG GTCTATGACC TTGGCGGCGG CACCTTCGAC
ATCTCGATCC TCGAGATCGG CGATGGCGTG TTCGAGGTGA AGTCGACCAA CGGCGACACC
TTCCTCGGCG GCGAAGACTT CGACACCGCG GTGGTCGAGT ATCTGGCGGA CAAGTTCAAG
GCCAAGGAAG GCATGGACCT GAAGACCGAC AAGCTCGCCC TGCAGCGCCT GAAGGAAGCG
GCGGAAAAGG CCAAGATCGA GCTTTCGTCG GCACAGACGA CCGAGATCAA CCTGCCGTTC
ATCACGGCCC GCATGGAAGG CGGCGCGACC ACGCCGCTGC ACCTGGTGGA AACCGTCACC
CGCGCCGACC TTGAAAAGCT TGTCGCCGGC CTGATCCAGC GCACGCTCGA TCCGTGCAAG
AAGGCGCTGG CCGATGCCGG CATCTCGGCC AAGGAGATCG ACGACGTCGT TCTCGTGGGC
GGCATGACCC GCATGCCCAA GGTCCGCGAA GTGGTGAAGG ACTTCTTCGG CAAGGAACCG
CACACCGGCG TGAATCCTGA CGAAGTCGTG GCGATGGGCG CGGCAATCCA GGCCGGCGTT
CTCCAGGGCG ACGTCAAGGA CGTGCTGCTT CTCGACGTGA CCCCGCTTTC GCTGGGCATC
GAGACGCTGG GCGGCATCAT GACCAAGATG ATCGACCGCA ACACCACGAT CCCGACCAAG
AAGAGCCAGG TCTATTCGAC TGCCGAGGAC AATCAGCAGG CGGTGACGAT CCGGGTCTTC
CAGGGCGAAC GCGAAATGGC GCAGGACAAC AAGCTCCTTG GCCAGTTCGA CCTCGTCGGC
ATCCCGCCCG CACGGCGCGG CGTGCCGCAG ATCGAGGTGA CGTTCGATAT CGACGCCAAT
GGCATCGTCA ACGTCTCGGC CAAGGACAAG GGCACCGGCA AGGAGCAGCA GATCCGCATC
CAGGCCTCGG GCGGTCTTTC GGACGCAGAC ATCGACCAGA TGGTCCGCGA TGCCGAGAAG
TTCGCTGAAG AGGACAAGAA GCGCCGTGCG GCGGCCGAGG CGAAGAACAA CGCCGAAAGC
CTGATCCATG CGACCGAGCG CCAGCTTGAG GAAAACGGGG ACAAGGTCGA CGCGGGCCTC
AAGGCCGAGA TCGAAGCGGC CATCGCCGAG GCGAAGACAG CCGTCGAGAG CGGCGACATC
GACGCCATGA ACGCCAAGGC GCAGGCCCTG ACGGACAAGG CCATGAAGAT GGGCCAGGCC
ATCTACGAGA AGGAGCAGGC AACTGCGGCT TCTCCGGGTG CCGAAGCCCC GAAGGCCGAC
GATGACGTCG TCGACGCCGA GTTCTCGGAA GTCGACGAGA ACAAGTGA
 
Protein sequence
MAKVIGIDLG TTNSCVAVMD GGTPKVIENS EGARTTPSIV AFTKDGERLI GQPAKRQAVT 
NPDNTIFAVK RLIGRRFDDP MTQKDTELVP YTITKGKNGD AWVKAGGQDY SPSQISAFTL
QKMKETAEAY LGETVTQAVI TVPAYFNDAQ RQATKDAGQI AGLEVLRIIN EPTAAALAYG
LDKQDGKTIA VYDLGGGTFD ISILEIGDGV FEVKSTNGDT FLGGEDFDTA VVEYLADKFK
AKEGMDLKTD KLALQRLKEA AEKAKIELSS AQTTEINLPF ITARMEGGAT TPLHLVETVT
RADLEKLVAG LIQRTLDPCK KALADAGISA KEIDDVVLVG GMTRMPKVRE VVKDFFGKEP
HTGVNPDEVV AMGAAIQAGV LQGDVKDVLL LDVTPLSLGI ETLGGIMTKM IDRNTTIPTK
KSQVYSTAED NQQAVTIRVF QGEREMAQDN KLLGQFDLVG IPPARRGVPQ IEVTFDIDAN
GIVNVSAKDK GTGKEQQIRI QASGGLSDAD IDQMVRDAEK FAEEDKKRRA AAEAKNNAES
LIHATERQLE ENGDKVDAGL KAEIEAAIAE AKTAVESGDI DAMNAKAQAL TDKAMKMGQA
IYEKEQATAA SPGAEAPKAD DDVVDAEFSE VDENK