Gene Arth_3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3808 
SymboldnaK 
ID4447734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4292276 
End bp4294144 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content64% 
IMG OID639691632 
Productmolecular chaperone DnaK 
Protein accessionYP_833283 
Protein GI116672350 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGTG CAGTAGGTAT CGACCTCGGA ACCACTAACT CCGTCGTCTC CGTTCTCGAA 
GGTGGCGAGC CCACCGTTAT TGCCAACGCC GAGGGTGGCC GCACCACGCC GTCCGTCGTT
GCGTTCTCCA AGTCCGGCGA AGTCCTGGTC GGCGAGATCG CCAAGCGCCA GGCCGTCAAC
AACATCGATC GCACCATCGC CTCCGTCAAG CGCCACATGG GCACTGACTG GAACGTCGCC
ATCGATGACA AAAAGTACAC CGCGCAGGAA ATCTCCGCGC GCATCCTGAT GAAGCTGAAG
AACGACGCCG AGTCCTACCT CGGCGAAAAG GTCACCGATG CCGTGGTCAC CGTTCCCGCG
TACTTCAACG ACGCCCAGCG CCAGGCCACC AAGGAAGCCG GCGAAATCGC GGGCCTCAAC
GTCCTCCGCA TTGTCAACGA GCCCACCGCG GCGGCGTTGG CCTACGGCCT GGACAAGGGC
AAGGAAGATG AACTCATCCT CGTCTTCGAC CTCGGCGGCG GAACCTTCGA CGTCTCCCTG
CTGGAAGTAG GCAAGGACGA AGACAACTTC TCCACCATCC AGGTGCGCTC CACCGCCGGT
GACAACCACC TCGGCGGCGA CGACTGGGAC CAGCGCGTGG TCAACTACCT GCTGAACCAG
CTCAAGGTCA AGGGCATCGA CCTCTCCAAG GACAAGATCG CCCTGCAGCG CCTCCGTGAA
GCTGCCGAGC AGGCCAAGAA GGAACTTTCC TCCTCCACGA GCACCAACGT CTCCCTCCAG
TACCTCTCCG TCACCCCCGA CGGCCCGGTC CACCTGGACG AGCAGCTGAC CCGTGCCAAG
TTCCAGGACC TCACCAAGGA CCTCCTGGAC CGCACCAAGA AGCCGTTCCA GGACGTCATC
AAGGAAGCCG GCATCAAGCT CTCCGAGATC GACCACATCG TGCTCGTGGG CGGTTCCACC
CGCATGCCCG CCGTGTACGA GCTCGTCAAG GAACTCGCCG GCGGCAAGGA GCCCAACAAG
GGCGTCAACC CGGATGAGGT CGTCGCGGTC GGCGCCGCAC TGCAGGCAGG TGTCCTGAAG
GGTGAGCGCA AGGACGTGCT GCTGATCGAC GTCACCCCGC TGTCCCTCGG CATTGAAACC
AAGGGCGGTG TCATGACGCA CCTGATCGAG CGCAACACGG CCATCCCCAC CAAGCGGTCC
GAGACCTTCA CCACGGCTGA CGACAACCAG CCGTCCGTGG CCATCCAGGT CTTCCAGGGT
GAGCGCGAGT TCACCCGCGA CAACAAGCCG CTGGGCACGT TCGAGCTGAC CGGCATCGCT
CCGGCTCCGC GCGGCGTCCC GCAGGTCGAG GTTACCTTCG ACATCGACGC CAACGGCATC
GTCCACGTCT CGGCGAAGGA CAAGGGCACC GGCAAGGAAC AGTCCATGAC CATCACCGGT
GGCACCGCGC TCTCCAAGGA AGACATTGAC CGCATGGTCA AGGACGCCGA GGAGCACGCA
GCCGAGGACA AGGCCCGCCG CGAGGCAACC GACACCCGCA ACACCGCGGA GCAGCTCGCC
TACTCCGTGG ACAAGCTGAT CGCGGACAAC GCCGACAAGC TGCCTGAAGA GGTCAAGACC
GAGGTCAAGG CCGACGTCGA CGCCCTCAAG AAGGCCCTCG AAGGCACCGA TGACGCCGCC
GTGAAGACCG CCTTTGAGAA GCTGCAGGCT TCCCAGAGCA AGCTCGGCGA AGCCATCTAC
GCGCAGGCCG GTTCGCCGGA CGGTGCCACG GGTCCTGCAG GCGCAGAGAG TGCCGCAGGC
TCTGAAGGTG CCAAGGCTGA TGAGGACATC GTCGACGCCG AGATCGTTGA CGAAGACGAG
AAGAAGTAA
 
Protein sequence
MSRAVGIDLG TTNSVVSVLE GGEPTVIANA EGGRTTPSVV AFSKSGEVLV GEIAKRQAVN 
NIDRTIASVK RHMGTDWNVA IDDKKYTAQE ISARILMKLK NDAESYLGEK VTDAVVTVPA
YFNDAQRQAT KEAGEIAGLN VLRIVNEPTA AALAYGLDKG KEDELILVFD LGGGTFDVSL
LEVGKDEDNF STIQVRSTAG DNHLGGDDWD QRVVNYLLNQ LKVKGIDLSK DKIALQRLRE
AAEQAKKELS SSTSTNVSLQ YLSVTPDGPV HLDEQLTRAK FQDLTKDLLD RTKKPFQDVI
KEAGIKLSEI DHIVLVGGST RMPAVYELVK ELAGGKEPNK GVNPDEVVAV GAALQAGVLK
GERKDVLLID VTPLSLGIET KGGVMTHLIE RNTAIPTKRS ETFTTADDNQ PSVAIQVFQG
EREFTRDNKP LGTFELTGIA PAPRGVPQVE VTFDIDANGI VHVSAKDKGT GKEQSMTITG
GTALSKEDID RMVKDAEEHA AEDKARREAT DTRNTAEQLA YSVDKLIADN ADKLPEEVKT
EVKADVDALK KALEGTDDAA VKTAFEKLQA SQSKLGEAIY AQAGSPDGAT GPAGAESAAG
SEGAKADEDI VDAEIVDEDE KK