Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0599 |
Symbol | |
ID | 7401735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 618998 |
End bp | 619963 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707665 |
Product | AN1-type Zinc finger protein |
Protein accession | YP_002565271 |
Protein GI | 222479034 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.715069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.614872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGT GCGACGTGTG TGGGGAGTAC GAGAACCTCC CGTACCAGTG TAACCGGTGC GGCAAGACGT TCTGTGCCAA CCACCGACTG CCCGAGAATC ACAACTGTCC GGGCCTCGCC GAGTGGGACG ACCCCGGCGG CGTCTTCGAC AGCGGCTTCG ACGGGAGCGT CGAGAGCGGC GGCGCGGGCG GGTCAGGCGA CGGTGCGTCC GCAGGCGTCA CGGACCGCGT CAAACAGCGA ATCGACCGCG AGACGAGCAC TGGCGGGATT GTGAGCTATT TCCGCGGGAA CGCGACATAT GCCCTGCTGG CGGCGATGTG GATCACGTTC CTCGCGCAGT GGGCCGTAAC CCTCCTCTTC GGCGAGGCCG CCCACAGCCA GATCTTCGTC CTCCGATCGG ACGCGATCGG CAACGTCTGG ACGTGGGTGA CCTCCGTGCT CTCGCACTCG CGGTTCGGAC TGTTCCACAT CATCGGCAAC AGCATCGTGA TCTTGTTCTT CGGCCCACTC GTCGAGCGCG CGGTCGGCTC CCGCCGCTTC GTCGGGTTCT TCTTCGCGTC GGGGATCCTC GCCGGCCTGG GCCACGTCCT GTTCGCGATC GCGACGGGCG CCCCGACGAC GGGCGTGCTC GGTGCCAGCG GTGCCGGCTT CGCGATCTTA GGCGTGCTCA CCGTGTGGCG GCCGAACATG CAGGTGCTCC TCTTCTTCGT CATCCCGATG AAGATCAAGT ACCTCACGTG GGGGATCGCG CTCATCTCGG CGGTGCTCGT CGTCCAAAGC GGCACGGGCG GCGTCGGCGG CATCGCGCAC CTCGCCCACC TGATCGGCTT CGCGATCGGA CTCGCGTTCG GCAAGCGAAA CGAGAGCCTC GCGCGGTCCG CGGGCGGTCC CGGCGGGATG CAGATGGGCG GCGCGAGAGG GCCGGGCGGT CCCCGAGGAC CGGGCGGGCC CGGCGGGCGG TTCTGA
|
Protein sequence | MATCDVCGEY ENLPYQCNRC GKTFCANHRL PENHNCPGLA EWDDPGGVFD SGFDGSVESG GAGGSGDGAS AGVTDRVKQR IDRETSTGGI VSYFRGNATY ALLAAMWITF LAQWAVTLLF GEAAHSQIFV LRSDAIGNVW TWVTSVLSHS RFGLFHIIGN SIVILFFGPL VERAVGSRRF VGFFFASGIL AGLGHVLFAI ATGAPTTGVL GASGAGFAIL GVLTVWRPNM QVLLFFVIPM KIKYLTWGIA LISAVLVVQS GTGGVGGIAH LAHLIGFAIG LAFGKRNESL ARSAGGPGGM QMGGARGPGG PRGPGGPGGR F
|
| |