Gene Hlac_0599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0599 
Symbol 
ID7401735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp618998 
End bp619963 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID643707665 
ProductAN1-type Zinc finger protein 
Protein accessionYP_002565271 
Protein GI222479034 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.715069 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.614872 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGT GCGACGTGTG TGGGGAGTAC GAGAACCTCC CGTACCAGTG TAACCGGTGC 
GGCAAGACGT TCTGTGCCAA CCACCGACTG CCCGAGAATC ACAACTGTCC GGGCCTCGCC
GAGTGGGACG ACCCCGGCGG CGTCTTCGAC AGCGGCTTCG ACGGGAGCGT CGAGAGCGGC
GGCGCGGGCG GGTCAGGCGA CGGTGCGTCC GCAGGCGTCA CGGACCGCGT CAAACAGCGA
ATCGACCGCG AGACGAGCAC TGGCGGGATT GTGAGCTATT TCCGCGGGAA CGCGACATAT
GCCCTGCTGG CGGCGATGTG GATCACGTTC CTCGCGCAGT GGGCCGTAAC CCTCCTCTTC
GGCGAGGCCG CCCACAGCCA GATCTTCGTC CTCCGATCGG ACGCGATCGG CAACGTCTGG
ACGTGGGTGA CCTCCGTGCT CTCGCACTCG CGGTTCGGAC TGTTCCACAT CATCGGCAAC
AGCATCGTGA TCTTGTTCTT CGGCCCACTC GTCGAGCGCG CGGTCGGCTC CCGCCGCTTC
GTCGGGTTCT TCTTCGCGTC GGGGATCCTC GCCGGCCTGG GCCACGTCCT GTTCGCGATC
GCGACGGGCG CCCCGACGAC GGGCGTGCTC GGTGCCAGCG GTGCCGGCTT CGCGATCTTA
GGCGTGCTCA CCGTGTGGCG GCCGAACATG CAGGTGCTCC TCTTCTTCGT CATCCCGATG
AAGATCAAGT ACCTCACGTG GGGGATCGCG CTCATCTCGG CGGTGCTCGT CGTCCAAAGC
GGCACGGGCG GCGTCGGCGG CATCGCGCAC CTCGCCCACC TGATCGGCTT CGCGATCGGA
CTCGCGTTCG GCAAGCGAAA CGAGAGCCTC GCGCGGTCCG CGGGCGGTCC CGGCGGGATG
CAGATGGGCG GCGCGAGAGG GCCGGGCGGT CCCCGAGGAC CGGGCGGGCC CGGCGGGCGG
TTCTGA
 
Protein sequence
MATCDVCGEY ENLPYQCNRC GKTFCANHRL PENHNCPGLA EWDDPGGVFD SGFDGSVESG 
GAGGSGDGAS AGVTDRVKQR IDRETSTGGI VSYFRGNATY ALLAAMWITF LAQWAVTLLF
GEAAHSQIFV LRSDAIGNVW TWVTSVLSHS RFGLFHIIGN SIVILFFGPL VERAVGSRRF
VGFFFASGIL AGLGHVLFAI ATGAPTTGVL GASGAGFAIL GVLTVWRPNM QVLLFFVIPM
KIKYLTWGIA LISAVLVVQS GTGGVGGIAH LAHLIGFAIG LAFGKRNESL ARSAGGPGGM
QMGGARGPGG PRGPGGPGGR F