Gene Hhal_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1475 
Symbol 
ID4710004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1595172 
End bp1596329 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content67% 
IMG OID639855942 
Productchaperone protein DnaJ 
Protein accessionYP_001003044 
Protein GI121998257 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGC GTGACTACTA CGAGGTGTTG GGGGTCAACA AGAATGCCTC CGATGCCGAG 
ATCAAGAAGG CCTACCGGCG CATGGCCCAG AAGTTCCACC CGGACCGCAA CCCGGGGGAT
GAGGAATCGG CCGAGCGCTT TAAGGAGGTC AAAGAGGCCT ACGAGGTCCT CTCCGACGCG
CAGAAGCGCG CCGCCTACGA TCAGTTCGGC CACGCCGGAG TGGATCCGTC GGCTGGCGGC
GGGCCGCGTG GCTACGGTGG CGGGGCCGGT CCGGGCGGTC CCGATTTCTC GGATATCTTC
TCCGATGTCT TCGGCGATAT CTTCGGGTCG GGGGGTGGTC GCGGCGGTGG CCGCAGCCGC
GCCTTCCGCG GCGCCGATCT GCGCTACACC CTGGAGCTGA GCCTCGAGGA CGCCGTGCGG
GGTACCGAGG AGCAGATCCA GGTGCCGACC CACGTCGAGT GCGACGCCTG CAAGGGCTCG
GGTTCGCGGG CCGGCTCCAA GCCGCAGACC TGCCCGACGT GTAAGGGCCA CGGCGATGTG
CGCGTCCAAC AGGGCTTCTT CTCGATCCAG CAGACCTGCC CGCGCTGTGG TGGTGAAGGG
ACCATGGTGA CCGATCCGTG TCCCAAGTGC CGGGGACGGG GTCGGGTCGA GGATCGCAAG
ACGCTCAATG TGCGCATCCC CGCCGGTGTC GATACCGGCG ACCGAATCCG CCTCTCCGGT
GAAGGTGAGC CGGGTGAGCG GGGGGGGCCG CCCGGTGATC TTTACGTGCA GGTGGCCGTG
CGCGAGCACG AGTTCTTCGA GCGCGACGGC GCGGATCTGC ACTGTCAGGT GCCGGTGGAT
ATCGTCACCG CGGCCCTGGG TGGTGAGGTG GAGGTCCCCA CCCTCGACGG TCGCGTCAAC
CTGCGCATCC CGCCGGGGAC GCAGCCCAAC CAAGTCTTCC GTCTGCGCGG CAAGGGCGTG
AAGCCGGTGC GCGGCAACCG TCAGGGTGAT CTGCTCTGCC GGATCCACGT CGAGACACCG
GTCAACCTGA CCAAGCGCCA GCGTGAGCTG CTTGAGGAGT TCCAGGCGAC CCTGCAGGAC
ACCGGCGGCA AGCACCATCC GCACACCTCG TCGTGGCTGG ACAAGGTCAA GCGCTTCATC
GAGGAGTGGC GGATATGA
 
Protein sequence
MAKRDYYEVL GVNKNASDAE IKKAYRRMAQ KFHPDRNPGD EESAERFKEV KEAYEVLSDA 
QKRAAYDQFG HAGVDPSAGG GPRGYGGGAG PGGPDFSDIF SDVFGDIFGS GGGRGGGRSR
AFRGADLRYT LELSLEDAVR GTEEQIQVPT HVECDACKGS GSRAGSKPQT CPTCKGHGDV
RVQQGFFSIQ QTCPRCGGEG TMVTDPCPKC RGRGRVEDRK TLNVRIPAGV DTGDRIRLSG
EGEPGERGGP PGDLYVQVAV REHEFFERDG ADLHCQVPVD IVTAALGGEV EVPTLDGRVN
LRIPPGTQPN QVFRLRGKGV KPVRGNRQGD LLCRIHVETP VNLTKRQREL LEEFQATLQD
TGGKHHPHTS SWLDKVKRFI EEWRI