Gene Francci3_2173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2173 
SymboldnaK 
ID3906773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2545187 
End bp2547112 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content70% 
IMG OID637879506 
Productmolecular chaperone DnaK 
Protein accessionYP_481272 
Protein GI86740872 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID[TIGR02350] chaperone protein DnaK 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.232061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.194401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGG CAGTCGGCAT TGATCTGGGG ACGACCAATT CGGTGATCGC CGCCTGGGAG 
GGTGGCGAGC CGAGGGTCAT CCCGAACGCG GAGGGTGCGC GGACGACACC GTCGGTGGTG
GCGTTCACCG AGAGCGGTGA ACGCCTGGTC GGGCAGCTGG CGCGACGGCA GGCGATCCTG
AACCCGAAGG GCACGATCTA CTCGGCCAAG CGGTTCATCG GCCGCCGCTA TGACGAGGTG
ACCGAGGAGG CCAAAGCGGT CGCCTTCGAC GTCGTCCCGG GCCCGGAGGG CGCCGCCCGG
TTCAGAATCC GGGACAAGCT CTACGCGCCG GAGGAGATCA GCGCGCTGGT GCTGCGCAAG
CTGGCCGACG ACGCTGGGCG GAACCTGGGC GAGAAGGTCA CCGAAGCGGT CATCACGGTG
CCGGCCTACT TCAACGACGC GCAGCGCACC GCGACGAAGG ACGCCGGCCG GATCGCCGGC
CTGAACGTGC TGCGGATCAT CAACGAGCCG ACCGCCGCCG CATTGGCGTA CGGGCTGGAC
AAGAAGGGCC ACGAGACCGT CCTGGTCTTC GATCTCGGCG GCGGCACGTT CGACGTGAGC
ATCCTCGACG TCGGCGACGG GGTGGTCGAG GTCCGGGCCA CGGCCGGGGA CAGCCACCTG
GGCGGGGACG ACTTCGACCG CCGGCTGGTC GACTACCTCG CCGACGGGTT CCACCGGGAA
CAGGGCATCG ACCTGCGCCG GGACCCCCAG GCACTGCAAC GGCTGTTCGA GGCCGCGGAG
AAGGCCAAGG TCGAACTGAG CGCGGTCACC CAGACCCAGG TGAACCTGCC GTTCATCACC
GCCGACGCCA CCGGGCCGCG GCACCTGACC GCCACGGTCA TGCGCTCCAC CTTCGAACAG
ATCACCGCCG ACCTGGTCGA ACGGACCATG GGACCGGTCC GCCAGGCGAT GGAGGACTCC
AAGATCAGTG CCGACGACCT CGACGAGGTG ATCCTCGTCG GCGGCGCCAC CCGGATGCCC
GCGGTGCAGG CACTGGTCCG GCGGCTGACC GGCGGCAAGG AACCGAACAT GAGCGTCAAC
CCCGACGAGG TCGTCGCCCT GGGCGCGGCG ATCCAGGCCG GGGTGCTCAA GGGCGAAGTC
TCCGACGTGC TCCTGCTCGA CGTCACCCCG CTCTCCCTCG GGGTGGAGAC CCGCGGCGGC
CTGATGACGA AAATCGTCGA ACGCAACACC ACGATCCCGG TGCGACGCTC CGAGGTGTTC
TCCACCGCGG AGGACAACCA GCCGGCCGTG GACATCGTGG TACTGCAGGG CGAACGCGAA
CGCGCCGCCG ACAACCGGGT ACTCGGCCGC TTCCGCCTGG AAAACATCCG CCCGGCGCCC
CGCGGCGAAC CGCAGGTCGA GGTCACCTTC GACATCGACG CCAACGGCAT CCTCAACGTC
ACCGCCCGCG ACAAGGACAC CGGAGCCCAG CAGTCGATCA CGATCAGTGA GACCTCCAAC
CTCCCCAAGG AGGAGGTCGA ACGGATGATC ACCGAGGCGG AGCGTAACCG CACCACCGAC
CAGGCGCTAC GCGCCGCAAT CGACGCCCGC AACGAGCTGG ACTCGGTCGC CTACCGCGTC
GAACGGCTGC TGGCCGAACT CGGTGACGCC GCCCCGCCCC ACGAGAAGGC CCGCGCCGAC
CTGCTGGTCG CCGACGCCCG CCAGGCCATC AAAGAGGAGG CTCCGGTGGA CCGGGTCCGC
GCCCTGACCT CGGATCTCCA ACAGATCTAC CACGGCCTGG CCGCCGCCCG AACCGGCCAG
GCCGCCCCCA CCGAGGGAAC CGGCGGCAGC CCGACCGACA CCGCCCCGGG CGGCCCACAG
CCACCACCTG GCGGCGGCAC CCCTGGCGGC GACGACGTGG TAGACGCCGA GTTCGACCGG
GGCTGA
 
Protein sequence
MAKAVGIDLG TTNSVIAAWE GGEPRVIPNA EGARTTPSVV AFTESGERLV GQLARRQAIL 
NPKGTIYSAK RFIGRRYDEV TEEAKAVAFD VVPGPEGAAR FRIRDKLYAP EEISALVLRK
LADDAGRNLG EKVTEAVITV PAYFNDAQRT ATKDAGRIAG LNVLRIINEP TAAALAYGLD
KKGHETVLVF DLGGGTFDVS ILDVGDGVVE VRATAGDSHL GGDDFDRRLV DYLADGFHRE
QGIDLRRDPQ ALQRLFEAAE KAKVELSAVT QTQVNLPFIT ADATGPRHLT ATVMRSTFEQ
ITADLVERTM GPVRQAMEDS KISADDLDEV ILVGGATRMP AVQALVRRLT GGKEPNMSVN
PDEVVALGAA IQAGVLKGEV SDVLLLDVTP LSLGVETRGG LMTKIVERNT TIPVRRSEVF
STAEDNQPAV DIVVLQGERE RAADNRVLGR FRLENIRPAP RGEPQVEVTF DIDANGILNV
TARDKDTGAQ QSITISETSN LPKEEVERMI TEAERNRTTD QALRAAIDAR NELDSVAYRV
ERLLAELGDA APPHEKARAD LLVADARQAI KEEAPVDRVR ALTSDLQQIY HGLAAARTGQ
AAPTEGTGGS PTDTAPGGPQ PPPGGGTPGG DDVVDAEFDR G