Gene Rsph17029_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3939 
Symbol 
ID4898241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp1072308 
End bp1073552 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content74% 
IMG OID640114542 
Productmolecular chaperone, DnaK 
Protein accessionYP_001045789 
Protein GI126464676 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0443] Molecular chaperone 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0350575 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCG ACACGCTCGC CATCGACTTC GGAACCTCGA ACTCCGCCGC CGCGGTGCTC 
GAGGAGGGCC GCGTGCAGAG GCTCGCGCTC GAACCGGGTG CCGACACGCT GCCGACGGCG
GTCTTCTTCC CCGCCGACCG CGGCCCGATG CGGATCGGCG CCGCGGCGGC CGAGGCGCTG
ATCGCGGGCG AGGAGGGGCG CTACATGCGC GCGCTGAAAA GCGTGCTGGG CGCGCCGCTC
TTTCACGAAG CGCGGCTGGT GGGCGGGCGG CGGCGTACGC TGGCCCAAGT GGTGACGGCC
TTCCTCGCCG AGGTGAAAGC GCGTGCCGAG CAGACCACGG GCCGCCCGTT CCGCCGCGCC
CTCTCGGGAC GCCCCGTCCG CTTTCATGCC GATCCGGCGC GCGACGCGCG CGCCGAAGAG
GATCTGCGCG CCTGCTATCT CGCGGCCGGT TTCGAGGAGG TCTCGTTCCT CTACGAGCCA
GAGGCCGCCG CGCTCGCGAG CCACGCGCTC GGCCGCGACG GCGCGCCGGG GCTGATCGTG
GACATCGGCG GCGGCACCTC GGACTTCTCG CTCTTCCGCG CCGAGGCCGG GCAGGTGGGC
ATCCTCGCCA GCCACGGCAT CCGGCTCGGC GGGACCGACT TCGACCGCTC GGTCTCGCTC
GCCCATGCCA TGCCCTGCCT CGGCCATGGC GGGCGGCTGC GGCGCGAGAT GGGGCCGGGG
CTTCTGCCCA TGCCGAACGA CCTCTTCGTC GATCTCGCGA CCTGGGCCAA GATCCCGTTC
CTCTATACCC GCGAAACCCG CTCGGCGGTG GCCGAGATGG TGAAGCGCGC CGAGGAGCCG
CGGCTGCTGC GCCGTCTGGC CGAGGTGCTG GAGCATGAGC TCGGGCACGA TCTGGCATTC
GCGGTCGAGC GCGGCAAGAT CGAGGCCAAT GCGGACAGGC CCGGCGCCGC GATCCGCATG
GGCCGGATCG AGCGCGGGCT CTACGCCGAG ATCAGCCGGG CCTCGCTGGC CGAGGCGCTG
GCACGGCACC GCGCCGACCT CCGGGCCGCC GCCGAAGAGA CCTGCCGCCG CGCGGGCGTG
GCGCCGCAGG AGGTCGAGAC GGTGATCCTC GTCGGCGGCT CGAGCCTCAT GCGGCTGGTG
GCCGAGGAGG CCCGCGCCCT CTGCCCCGCA GCCGAACTGC GGAGCGCCGA GGCCTTCACG
GCGGTGGTGG ACGGGCTGGC GCTGGCGATC GGCCGGCCCG GCTGA
 
Protein sequence
MQADTLAIDF GTSNSAAAVL EEGRVQRLAL EPGADTLPTA VFFPADRGPM RIGAAAAEAL 
IAGEEGRYMR ALKSVLGAPL FHEARLVGGR RRTLAQVVTA FLAEVKARAE QTTGRPFRRA
LSGRPVRFHA DPARDARAEE DLRACYLAAG FEEVSFLYEP EAAALASHAL GRDGAPGLIV
DIGGGTSDFS LFRAEAGQVG ILASHGIRLG GTDFDRSVSL AHAMPCLGHG GRLRREMGPG
LLPMPNDLFV DLATWAKIPF LYTRETRSAV AEMVKRAEEP RLLRRLAEVL EHELGHDLAF
AVERGKIEAN ADRPGAAIRM GRIERGLYAE ISRASLAEAL ARHRADLRAA AEETCRRAGV
APQEVETVIL VGGSSLMRLV AEEARALCPA AELRSAEAFT AVVDGLALAI GRPG