Gene Hlac_2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2662 
Symbol 
ID7400867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2645397 
End bp2647088 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content68% 
IMG OID643709734 
Productthermosome 
Protein accessionYP_002567303 
Protein GI222481066 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGTAA TGTCACAGCG CCAGCGGATG GGCAATCAGC CCATGATCGT ACTTTCCGAG 
GAGTCACAGC GTACCTCCGG AAAGGACGCT CAGAACATGA ATATCACGGC CGGCAAGGCG
GTCGCGGAGT CCGTCCGCAC CACGCTCGGC CCGAAAGGGA TGGACAAGAT GCTCGTCGAC
TCCGGCGGGT CCGTCGTCGT CACGAACGAC GGCGTCACCA TCCTGAAGGA GATGGACATC
GACCACCCGG CGGCCAACAT GATCGTCGAG GTCAGCGAGA CGCAGGAGGA GGAAGTCGGT
GACGGCACCA CCTCCGCTGT CGTCGTCGCC GGTGAGCTCC TCGATCAGGC CGAGGAGCTC
CTCGATCAGG ATATCCACGC GACCACGCTC GCGCAGGGAT ACCGTCAGGC CGCCGAGAAG
GCCAAGGATA TCCTCGAAGA GGAGGCCATC GAGGTCTCCG AGGACGACCG CGACACCCTC
GTCCAGATCG CCGAGACGGC GATGACGGGC AAGGGCGCCG AGAACTCCAA GGACCTCCTC
GCCGAACTCG TCGTCGACTC CGTCCTCGCG GTTCAGGACG ACGACAGCAT CGACACGGAC
AACGTCTCCG TCGAGAAGGT CGTCGGCAGC TCCATCGACA AGTCCGAGCT CGTCGAGGGC
GTCATCGTCG ACAAAGAGCG CGTCGACGAG AACATGCCCT TCGCGGTCGA GGACGCCGAC
GTGGCGCTGT TCGACGGCGC CATCGAAGTG AAGGAGACGG AGATCGACGC CGAAGTCAAC
GTCACGGACC CCGACCAGCT CCAGCAGTTC CTCGACCAGG AAGAGGAGCA GCTCCGCGAG
ATGGTCGACC ACCTCGTCGA CATCGGTGCC GACGTCGTCT TCGTCGGTGA CGGCATCGAC
GACATGGCAC AGCACTACCT CGCGCAGGAG GGCATCCTCG CCGTCCGCCG CGCGAAGTCC
GACGACCTCA AGCGCCTCGC CCGCGCGACC GGCGGCCGCG TCGTCTCCAA CCTCGACGAC
ATCGAGACGG ACGACCTCGG CTTCGCCGGC TCCGTCGCAC AGAAGGACAT CGGCGGCGAC
GAGCGCATCT TCGTCGAGGA CGTCGAAGAG GCCAAGTCCG TCACGCTCAT CCTCCGCGGC
GGCACCGAAC ACGTCGTTGA CGAGGTCGAG CGCGCCATCG AGGACTCGCT CGGCGTCGTC
CGCACGACGC TGCTCGACGG GAAGGTGCTG CCCGGCGGCG GCGCCCCCGA GGCCGAGCTC
GCGCTGCAGC TCCGCGACTT CGCCGACTCC GTCGGCGGCC GCGAGCAGCT CGCCGTCGAG
GCGTTCGCCG ACGCGCTGGA AGTCATCCCG CGCACCCTCG CCGAGAACGC GGGTCTCGAT
CCCATCGACT CGCTGGTCGA CCTCCGCTCC CGGCACGACG CCGGCGAGTT CGGCGCCGGT
CTCGACGCCT ACACGGGCGA CGTGATCGAC ATGGAGGCCG AGGGCGTCGT GGAGCCGCTC
CGCGTCAAGA CCCAGGCCAT CGAGTCCGCC ACCGAGGCGG CAGTCATGAT CCTCCGCATC
GACGACGTCA TCGCGGCCGG CGACCTCAAG GGCGGCGGCT CCGACGACGG CGGCGACGAG
GGCGGCCCCG GCGGCGCGCC CGGCGGCATG GGCGGCATGG GCGGCATGGG CGGCATGGGC
GGTGCGATGT AA
 
Protein sequence
MVVMSQRQRM GNQPMIVLSE ESQRTSGKDA QNMNITAGKA VAESVRTTLG PKGMDKMLVD 
SGGSVVVTND GVTILKEMDI DHPAANMIVE VSETQEEEVG DGTTSAVVVA GELLDQAEEL
LDQDIHATTL AQGYRQAAEK AKDILEEEAI EVSEDDRDTL VQIAETAMTG KGAENSKDLL
AELVVDSVLA VQDDDSIDTD NVSVEKVVGS SIDKSELVEG VIVDKERVDE NMPFAVEDAD
VALFDGAIEV KETEIDAEVN VTDPDQLQQF LDQEEEQLRE MVDHLVDIGA DVVFVGDGID
DMAQHYLAQE GILAVRRAKS DDLKRLARAT GGRVVSNLDD IETDDLGFAG SVAQKDIGGD
ERIFVEDVEE AKSVTLILRG GTEHVVDEVE RAIEDSLGVV RTTLLDGKVL PGGGAPEAEL
ALQLRDFADS VGGREQLAVE AFADALEVIP RTLAENAGLD PIDSLVDLRS RHDAGEFGAG
LDAYTGDVID MEAEGVVEPL RVKTQAIESA TEAAVMILRI DDVIAAGDLK GGGSDDGGDE
GGPGGAPGGM GGMGGMGGMG GAM