Gene Francci3_4352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4352 
SymboldnaK 
ID3907324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5197168 
End bp5199018 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content68% 
IMG OID637881683 
Productmolecular chaperone DnaK 
Protein accessionYP_483427 
Protein GI86743027 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.971406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAA GGCACACAGA CATGGCACGA GCGGTCGGTA TCGACCTCGG CACCACGAAC 
TCCGTCGTGA GCGTGCTCGA GGGCGGCGAA CCCACGGTGA TCGCCAACGC CGAGGGCTCC
CGGACGACCC CGTCGGTCGT GGCTTTCGCG AAGAACGGCG AGGTCCTGGT CGGCGAGGTC
GCCAAGCGTC AGGCGGTGAC GAACGTCGAG CGCACCATCC GGTCGGTCAA ACGCCACATG
GGCACCGACT GGAAGATGAA GGTCGACGCG AAGGATCTCA CGCCCCAGGA GATCAGCGCG
TTCATCCTGC AGAAGCTGAA GCGGGACGCC GAGTCCTACC TCGGTGAGAC GGTTACGGAC
GCGGTCATCA CCGTCCCGGC GTACTTCGAC GACGCCCAGC GTCAGGCCAC CACCGAGGCG
GGCACGATCG CCGGGCTCAA CGTCCTGCGG ATTGTCAACG AGCCGACCGC CGCGGCCCTC
GCCTACGGGT TGGACAAGGG CGAGAAGGAG CAGACCATCC TGGTCTTCGA CCTCGGTGGC
GGCACCTTCG ACGTCTCCCT GCTGGAGATC GGCGACGGTG TCGTCGAGGT GAAGTCCACC
TCCGGTGACA CCCACCTCGG TGGTGACGAC TGGGATCAGC GCATCACCGA CCACCTGATC
AAGACCTTCA ACGGCCAGCA CGGGGTCGAC CTCGGCAAGG ACAAGATGGC CCTGCAGCGG
CTGCGGGAGG CCGCCGAGAA GGCGAAGATC GAGCTTTCGC AGTCCACCCA GACCGCGATC
AACCTGCCCT ACATCACCGC GTCGGCCGAG GGTCCCCTGC ACCTCGACGT CTCGCTGTCG
CGCGCCGAGT TCCAGCGGAT GACCTCCGAC CTCATCGACC GCTGCAGGCA CCCCTTCCAG
CAGGCGGTGA AGGACGCGGG GATCAAGGTC AGCGACATTG ACCACGTCGT CCTCGTCGGT
GGTTCCACCC GGATGCCCGC CGTCGTCGAG CTGGTCCGCG AGCTGACCGG GGGCAAGGAG
CCGAACAAGG GCGTCAACCC GGACGAGGTC GTCGCCGTCG GCGCCTCGCT GCAGGCCGGC
GTGCTCAAGG GCGAAGTCAA GGACGTCCTG CTCCTCGACG TCACCCCGCT CTCGCTCGGC
ATCGAGACGA AGGGGGGGAT CATGACCAAG CTCATCGAGC GCAACACCAC GATCCCCACC
AAGCGCTCGG AGATCTTCAC CACCGCCGAG GACAGCCAGC CCTCCGTGCA GATCCAGGTC
TTCCAGGGCG AGCGCGAGAT GGCCGCCTAC AACAAGAAGC TCGGCATGTT CGAGCTGACC
GGCCTGCCGC CCGCCCCGCG GGGCATGCCC CAGATCGAGG TCACCTTCGA CATCGACGCG
AACGGCATCG TGCACGTGTC CGCGAAGGAC CTGGGCACCG GCAAGGAGCA GTCGATGACC
ATCACCGGCG GCTCGGCGCT GCCGAAGGAC GACATCGACC GCATGGTCCG CGACGCCGAG
CAGTACGCCG AGGAGGACCG CAGGCGCCGG GAGGAGGCCG AGACCCGCAA CAAGGCCGAG
ACGCTCGTCT ACTCGACCGA GCGGTTCCTC GCCGAGAACG GCGAAAAGAT CGACGCCGCG
GTGAAGAGCG ACGTGGAGGA GAAGCTCGGC ACGCTGCGGG GCGCCGTCGC CGGCACCGAC
ACGGCGGCGA TCCGCGAGGC GGCCGACGCG CTGGCCGCGG CGAGCCAGGC GATGGGCCAG
GCCATGTACG CGCAGGCCGC CCAGGAGGGC GCGGGTGCCT CCGGCGGCGC CGGAGCAGCC
GACGACGACG TGGTCGACGC GGAGATCGTG GATGAGGAGG ACAAGAAGTG A
 
Protein sequence
MIERHTDMAR AVGIDLGTTN SVVSVLEGGE PTVIANAEGS RTTPSVVAFA KNGEVLVGEV 
AKRQAVTNVE RTIRSVKRHM GTDWKMKVDA KDLTPQEISA FILQKLKRDA ESYLGETVTD
AVITVPAYFD DAQRQATTEA GTIAGLNVLR IVNEPTAAAL AYGLDKGEKE QTILVFDLGG
GTFDVSLLEI GDGVVEVKST SGDTHLGGDD WDQRITDHLI KTFNGQHGVD LGKDKMALQR
LREAAEKAKI ELSQSTQTAI NLPYITASAE GPLHLDVSLS RAEFQRMTSD LIDRCRHPFQ
QAVKDAGIKV SDIDHVVLVG GSTRMPAVVE LVRELTGGKE PNKGVNPDEV VAVGASLQAG
VLKGEVKDVL LLDVTPLSLG IETKGGIMTK LIERNTTIPT KRSEIFTTAE DSQPSVQIQV
FQGEREMAAY NKKLGMFELT GLPPAPRGMP QIEVTFDIDA NGIVHVSAKD LGTGKEQSMT
ITGGSALPKD DIDRMVRDAE QYAEEDRRRR EEAETRNKAE TLVYSTERFL AENGEKIDAA
VKSDVEEKLG TLRGAVAGTD TAAIREAADA LAAASQAMGQ AMYAQAAQEG AGASGGAGAA
DDDVVDAEIV DEEDKK