Gene Hlac_0681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0681 
Symbol 
ID7401816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp698231 
End bp699391 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID643707747 
Productchaperone protein DnaJ 
Protein accessionYP_002565353 
Protein GI222479116 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID[TIGR02349] chaperone protein DnaJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.034617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.293226 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATA ACTTCTACGA CGTCCTCGGG GTATCGCGGG ACGCCAGCGA GGAGGAGATC 
AAGAAGGCGT ACCGCAAGCA GGCCGCCGAA CACCACCCGG ACGTCAGCGA CGACGACGAC
GCCGAGGAGC GGTTCAAGGC GATCCAGAAG GCCAAGGAGG TGCTCACCGA CGAGCAGAAG
CGACAGCAGT ACGACCAGTT GGGTCACGAC CGCTTCACCG AGGCCGACAA GCGCGGCGCG
ACCGGCGGGG GCGGCCCCGG CGGCGCCGGT GGTCCCTTCG GCGGCGCGGG GGGTGCCGGC
GGCGCCGGCG GCTTCGAGGA CATCTTCAAC CAATTCTTCG GCGGTGGCGG CGGCCGCGGC
GGTGGCGGCG GCAACCGACC GCGTCAGGGA CAGGACCTCC GCACCGGGCT CACGATCGAC
TTAGAGGAGG CGTTCGAGGG CGCGACCAAG GAGGTCACAC TCACGCGACC GACCCAATGC
GACACCTGCG ACGGCGCGGG CCACCCGCCC GACGCCGACG TGGAGACCTG TTCGCAGTGT
AACGGGCGCG GACAGGTCCA GCAGGTCCAG CAGACGCCGC TCGGCCGCGT CCAGCAGACC
TCGACGTGTC CCCGGTGTGA AGGATCTGGC GAGCTGTACA GCGAGGACTG CGCCGACTGC
GGCGGCGACG GCGTCGTCCG CGAGGAGGCG ACCCTCTCGG TCGAGATTCC GGCGGGGATC
CGCTCGGGAC AGAGCCTCCG GATGGAACGC GAGGGCGCAC CGGGCGAGAA TGGCGGCCCC
AACGGCGACC TGCTGATCGA GGTCGACGTC GACGTCGGCG ACCGGTTCGA GCGCGACGGC
GACGACCTCC GGGTGAACGA GGCCGTCTCC TTCCCGCAGG CCGTCTTCGG CGACACGATC
GAGGTCGAGA CGGTCGACGG CAGCGTCGAG ATGGACGTGC CGACCGGGAC TCAGAGCGGC
GAGACGTTCC GTCTCAAGGG GAAGGGAATG CCCCGCCTGC GCCGGCGCGG GCGCGGCGAC
CTCTACGTGA AAGTGGGCGT CGTGATTCCC GACTCGCTCA ACGAGGAGCA GCGCGAGGCG
CTGGAGGCGT TCGCCGAGGC CGGCGGCGAA GACGTCGATG TCGGCGGCGG GTTCTTCAAG
AAGCTGAAGA GTTCCTTCTA G
 
Protein sequence
MSDNFYDVLG VSRDASEEEI KKAYRKQAAE HHPDVSDDDD AEERFKAIQK AKEVLTDEQK 
RQQYDQLGHD RFTEADKRGA TGGGGPGGAG GPFGGAGGAG GAGGFEDIFN QFFGGGGGRG
GGGGNRPRQG QDLRTGLTID LEEAFEGATK EVTLTRPTQC DTCDGAGHPP DADVETCSQC
NGRGQVQQVQ QTPLGRVQQT STCPRCEGSG ELYSEDCADC GGDGVVREEA TLSVEIPAGI
RSGQSLRMER EGAPGENGGP NGDLLIEVDV DVGDRFERDG DDLRVNEAVS FPQAVFGDTI
EVETVDGSVE MDVPTGTQSG ETFRLKGKGM PRLRRRGRGD LYVKVGVVIP DSLNEEQREA
LEAFAEAGGE DVDVGGGFFK KLKSSF