Gene Hlac_0416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0416 
Symbol 
ID7401033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp436055 
End bp437701 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content68% 
IMG OID643707480 
Productthermosome 
Protein accessionYP_002565089 
Protein GI222478852 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.213711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCAGG GTCAGCCGAT GATCATCATG GGCGACGACG CCCAGCGCGT CAAGGACAAG 
GACGCACAGG AGTACAACAT CTCCGCGGCG CGCGGCGTCG CCGAATCCGT ACGTTCGACG
CTCGGGCCGA AAGGGATGGA CAAGATGCTC GTCGACTCGA CGGGCGGCGT CACCATCACG
AACGATGGCG TCACCATCCT GCAGACGATG GACATCGACA ACCCGACCGC CGAGATGATC
GTCGAAGTCG CCCAGACTCA GGAGGACGAG GCCGGCGACG GCACGACGAG CGCGGTCGCG
ATCGCGGGCG AGCTCCTGAA GAACGCCGAG GACCTTCTCG AACAGGATAT TCACCCGACG
GCCGTCATCA AGGGCTTCAA CCTCGCGAGC GAGTACGCCC GCGAGCAGGT CGACGAGGTC
GCCACACAGG TCGACCCCGA CGACACCGAG ACCCTGAAAA GCGTCGCCGA GACGTCGATG
ACCGGCAAGG GCGCGGAGCT CGATAAGGAC ACCCTCGCTG ACCTCGTCGT CCGCGCGATT
CAGGGCGTCA CCGTCGAGGC CGACGACGGC TCGCACGTCG TCGATCTGGC GAACCTCAAC
ATCGAGACGC GCACCGGCCG CGCGGCCGGA CAGTCCCGCC TGCTCTCGGG TGCCGTCATC
GACAAGGACC CGGTCCACGA GGACATGCCG ACCGACTTCG AGGACGCGAA CGTCCTCCTC
CTCAACGACC CGATCGAGGT CGAGGAGGCC GACGTTGACA CCGCCGTCAA CGTCGAGTCG
CCCGACCAGC TCCAGCGCTT CCTCGATCAG GAAGAAGAGC AGCTCCGCGA CAAAGTCGAC
AAAATCGTCG AATCTGGCGC CGACGTGGTC TTCTGTCAGA AGGGGATCGA CGACCTCGCC
CAGCACTACC TCGCGAAGGA GGGCATCCTC GCCGTCCGCC GGACGAAGAA GTCCGACCTG
ACCTTCCTCA AGAACGTCCT CGGCGCGCCC ATCGTCAGCG ACCTCGATTC GCTCACCGCC
GACGACCTCG CGATCGGTAC GGTCAGCCGT GACACCGAGG AGGGGCTGTT CTACGTCGAG
GGCGAGGACG CCCACGGCGT CACCCTCCTG CTGTACGGCA CCACCGAGCA CGTCGTCGAC
GAGCTCGAAC GCGGCATCCA GGACGCCATC GACGTGGTCT CGACGACCGT CTCCGACGGG
CGGACGCTCC CCGGCGGCGG CGCCATCGAG GTCGAGCTCG CGAGCCGTCT GCGCGACTAC
GCCGACACCG TCTCGGGCCG CGAGCAGCTC GCCGTCGAGG CGTTCGCCGA CTCGCTCGAA
CTGATCCCCC GCGTGCTCGC CGAGAACGCC GGGCTCGACG CCATCGACCT GCTCGTCGAC
CTCCGCGCGG CCCACGAGGC CGGCGACACC GAGGCCGGAC TGAACGTCTT CTCCGGCGAA
GTCGAGAACA CGACCGAGGC CGGCGTCGTC GAGACGGCTC ACGCGAAAGA GCAGGCGATC
GCCTCCGCCG CCGAGGCCGC GAACCTCGTG CTGAAAATCG ACGACATCAT CTCTGCGGGC
GACCTGTCGA CCGGCGGCGA CGGCGACGAG GGCGGCGCCC CCGCCGGCGG CATGGGCGGC
ATGGGCGGTA TGGGCGGTGC GATGTAA
 
Protein sequence
MQQGQPMIIM GDDAQRVKDK DAQEYNISAA RGVAESVRST LGPKGMDKML VDSTGGVTIT 
NDGVTILQTM DIDNPTAEMI VEVAQTQEDE AGDGTTSAVA IAGELLKNAE DLLEQDIHPT
AVIKGFNLAS EYAREQVDEV ATQVDPDDTE TLKSVAETSM TGKGAELDKD TLADLVVRAI
QGVTVEADDG SHVVDLANLN IETRTGRAAG QSRLLSGAVI DKDPVHEDMP TDFEDANVLL
LNDPIEVEEA DVDTAVNVES PDQLQRFLDQ EEEQLRDKVD KIVESGADVV FCQKGIDDLA
QHYLAKEGIL AVRRTKKSDL TFLKNVLGAP IVSDLDSLTA DDLAIGTVSR DTEEGLFYVE
GEDAHGVTLL LYGTTEHVVD ELERGIQDAI DVVSTTVSDG RTLPGGGAIE VELASRLRDY
ADTVSGREQL AVEAFADSLE LIPRVLAENA GLDAIDLLVD LRAAHEAGDT EAGLNVFSGE
VENTTEAGVV ETAHAKEQAI ASAAEAANLV LKIDDIISAG DLSTGGDGDE GGAPAGGMGG
MGGMGGAM