Gene Rsph17025_2645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2645 
Symbol 
ID5085069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2686155 
End bp2687144 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content68% 
IMG OID640484208 
ProductHsp33 protein 
Protein accessionYP_001168837 
Protein GI146278678 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1281] Disulfide bond chaperones of the HSP33 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0673391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.801183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG GTTCACAGAT CGCCTGGGAC GACACCGTCC TGCCCTTCCA GCTTGACCGC 
TCGGACATCC GCGGCCGTGT GGTGCGGCTG GATGGCGTGC TCGAGGAAGT GCTGTCGAAG
CATGATTATC CGCCGCAGAT CGAGGCGCTG GTGGCCGAGG CGTCGCTGCT GACCGCGCTG
ATCGGGCAGG CCATCAAGCT GCGCTGGAAA CTGTCGCTGC AGATCCGCGG CAATGGGGCG
GTGCGGATGA TTGCCACCGA CTATTACAGC CCTCCGGAAG ATGGCGAGCC CGCGCGGATC
CGTGCCTATG CGAGCTATGT CGCCGAGGAT CTGGCCCCCG GCGCCGCCGC CTTCGACCAG
CTGGGTGAGG GCTATTTTGC CATCCTGATC GATCAGGGGC AGGGGATGGT GCCCTATCAG
GGCATCACTC CGATCGCGGG CGGGTCGTTG ACCGCCTGCG CCGAGACCTA TTTCGCCCAG
TCCGAGCAGC TTCCCACCCG CTTCGCCCTG TCCTTCGGCC AGTCCACGGC CAACGGCGCG
ACTCACTGGC GGGGCGGCGG CGTGATGCTT CAGCACATGC CCAAGGCCTC GCCCGGCGTG
GCGGGCGAGG GCGGATCGGG CGAGGGCGGG CTTCTGCAGC ACCACGACCT GCTCGAGGGC
GACGAGGGCG AGAACTGGAC GCGCGCGAAC CTGCTGCTCG ACACGGTCGA GGATCTCGAA
CTGGTGGGTC CGTCGGTTCA GCCGCCGGAC CTGCTCGTGC GGCTGTTCCA CGAAGAGGAA
CCGCGCGTGT TCGAGGCGCA GACCCTGCGC TTTGGCTGCT CCTGTTCGGC GGATCGGGTG
CGCGAGTCGC TGTCGATCTA CGCGCCCGAG GAGATCGCCG AGATGACGAC GGATGAGGGC
ATCCTTACCG CCGACTGCCA GTTCTGCGGC GCGCATTACG AGTTCGACCC CGCGACGCTG
GGGACCGGCG CAGGGAGCGG CGATGCCTGA
 
Protein sequence
MTIGSQIAWD DTVLPFQLDR SDIRGRVVRL DGVLEEVLSK HDYPPQIEAL VAEASLLTAL 
IGQAIKLRWK LSLQIRGNGA VRMIATDYYS PPEDGEPARI RAYASYVAED LAPGAAAFDQ
LGEGYFAILI DQGQGMVPYQ GITPIAGGSL TACAETYFAQ SEQLPTRFAL SFGQSTANGA
THWRGGGVML QHMPKASPGV AGEGGSGEGG LLQHHDLLEG DEGENWTRAN LLLDTVEDLE
LVGPSVQPPD LLVRLFHEEE PRVFEAQTLR FGCSCSADRV RESLSIYAPE EIAEMTTDEG
ILTADCQFCG AHYEFDPATL GTGAGSGDA