Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0416 |
Symbol | |
ID | 7401033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 436055 |
End bp | 437701 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707480 |
Product | thermosome |
Protein accession | YP_002565089 |
Protein GI | 222478852 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.213711 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCAGG GTCAGCCGAT GATCATCATG GGCGACGACG CCCAGCGCGT CAAGGACAAG GACGCACAGG AGTACAACAT CTCCGCGGCG CGCGGCGTCG CCGAATCCGT ACGTTCGACG CTCGGGCCGA AAGGGATGGA CAAGATGCTC GTCGACTCGA CGGGCGGCGT CACCATCACG AACGATGGCG TCACCATCCT GCAGACGATG GACATCGACA ACCCGACCGC CGAGATGATC GTCGAAGTCG CCCAGACTCA GGAGGACGAG GCCGGCGACG GCACGACGAG CGCGGTCGCG ATCGCGGGCG AGCTCCTGAA GAACGCCGAG GACCTTCTCG AACAGGATAT TCACCCGACG GCCGTCATCA AGGGCTTCAA CCTCGCGAGC GAGTACGCCC GCGAGCAGGT CGACGAGGTC GCCACACAGG TCGACCCCGA CGACACCGAG ACCCTGAAAA GCGTCGCCGA GACGTCGATG ACCGGCAAGG GCGCGGAGCT CGATAAGGAC ACCCTCGCTG ACCTCGTCGT CCGCGCGATT CAGGGCGTCA CCGTCGAGGC CGACGACGGC TCGCACGTCG TCGATCTGGC GAACCTCAAC ATCGAGACGC GCACCGGCCG CGCGGCCGGA CAGTCCCGCC TGCTCTCGGG TGCCGTCATC GACAAGGACC CGGTCCACGA GGACATGCCG ACCGACTTCG AGGACGCGAA CGTCCTCCTC CTCAACGACC CGATCGAGGT CGAGGAGGCC GACGTTGACA CCGCCGTCAA CGTCGAGTCG CCCGACCAGC TCCAGCGCTT CCTCGATCAG GAAGAAGAGC AGCTCCGCGA CAAAGTCGAC AAAATCGTCG AATCTGGCGC CGACGTGGTC TTCTGTCAGA AGGGGATCGA CGACCTCGCC CAGCACTACC TCGCGAAGGA GGGCATCCTC GCCGTCCGCC GGACGAAGAA GTCCGACCTG ACCTTCCTCA AGAACGTCCT CGGCGCGCCC ATCGTCAGCG ACCTCGATTC GCTCACCGCC GACGACCTCG CGATCGGTAC GGTCAGCCGT GACACCGAGG AGGGGCTGTT CTACGTCGAG GGCGAGGACG CCCACGGCGT CACCCTCCTG CTGTACGGCA CCACCGAGCA CGTCGTCGAC GAGCTCGAAC GCGGCATCCA GGACGCCATC GACGTGGTCT CGACGACCGT CTCCGACGGG CGGACGCTCC CCGGCGGCGG CGCCATCGAG GTCGAGCTCG CGAGCCGTCT GCGCGACTAC GCCGACACCG TCTCGGGCCG CGAGCAGCTC GCCGTCGAGG CGTTCGCCGA CTCGCTCGAA CTGATCCCCC GCGTGCTCGC CGAGAACGCC GGGCTCGACG CCATCGACCT GCTCGTCGAC CTCCGCGCGG CCCACGAGGC CGGCGACACC GAGGCCGGAC TGAACGTCTT CTCCGGCGAA GTCGAGAACA CGACCGAGGC CGGCGTCGTC GAGACGGCTC ACGCGAAAGA GCAGGCGATC GCCTCCGCCG CCGAGGCCGC GAACCTCGTG CTGAAAATCG ACGACATCAT CTCTGCGGGC GACCTGTCGA CCGGCGGCGA CGGCGACGAG GGCGGCGCCC CCGCCGGCGG CATGGGCGGC ATGGGCGGTA TGGGCGGTGC GATGTAA
|
Protein sequence | MQQGQPMIIM GDDAQRVKDK DAQEYNISAA RGVAESVRST LGPKGMDKML VDSTGGVTIT NDGVTILQTM DIDNPTAEMI VEVAQTQEDE AGDGTTSAVA IAGELLKNAE DLLEQDIHPT AVIKGFNLAS EYAREQVDEV ATQVDPDDTE TLKSVAETSM TGKGAELDKD TLADLVVRAI QGVTVEADDG SHVVDLANLN IETRTGRAAG QSRLLSGAVI DKDPVHEDMP TDFEDANVLL LNDPIEVEEA DVDTAVNVES PDQLQRFLDQ EEEQLRDKVD KIVESGADVV FCQKGIDDLA QHYLAKEGIL AVRRTKKSDL TFLKNVLGAP IVSDLDSLTA DDLAIGTVSR DTEEGLFYVE GEDAHGVTLL LYGTTEHVVD ELERGIQDAI DVVSTTVSDG RTLPGGGAIE VELASRLRDY ADTVSGREQL AVEAFADSLE LIPRVLAENA GLDAIDLLVD LRAAHEAGDT EAGLNVFSGE VENTTEAGVV ETAHAKEQAI ASAAEAANLV LKIDDIISAG DLSTGGDGDE GGAPAGGMGG MGGMGGAM
|
| |