Gene Hlac_0061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0061 
Symbol 
ID7401416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp64231 
End bp65565 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content72% 
IMG OID643707122 
Productamidohydrolase 
Protein accessionYP_002564737 
Protein GI222478500 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0182363 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACGC TTCGCGTGAC GGGCGGACGG GTGCTCCGTC CCGACGGCCG CGTGACCGAG 
TCGGACGTGA CGATCGACCG CGACGCCGGA ACGATCGTCG CGGTGGGCGA CGAGACGGTG
AGCGACGCCG AAGCGGGAAG CGACGCCGAA GTCGCGAGCG ACGGGGAGAC CCTCGACGCG
TCCGGCTCGC TCGTGATCCC CGGACTCGTC AACGCGCACA CGCACGTCGC GATGACGCTC
CTTCGGGGAT ACGCCGACGA CAAGCCGCTC GACCCGTGGC TACGGGAGGA CATCTGGCCG
GCCGAGGCCG AACTCACGCC GGACGACATC GAGGCGGGCG CCGAACTCGG CGTCGTCGAG
ATGATCCGGT CGGGGACGAC CGCGTTCGCG GACATGTACT TCGCGATGGA CCGCGTCGCG
GACGTGGTCG ATCGCGCGGG GCTGCGGGCG CGCCTTGGCC ACGGGGTCGT CACGATCGGG
AAGGACGCCG AGGGCGCTCG CGCCGACGTC GAGGAGAGTC TCGCGGTCGC TCGCGAACTC
GACGGCGCCG GAGACGGGCG GATCCGGACC GCCTTCATGC CGCACTCGCT GACGACGGTG
GGCGAGGAGT ACCTCCACGA GGGCGTCGCG GAGGCGCGCG AGGCGGGCGT CCCGATTCAC
CTCCACGCGA ACGAGACGGA AGACGAGGTC GACCCGATCG TCGACGAGCG CGGGGAGCGT
CCGATCGCGT ACGCGCAGGA TCTCGACGCG CTCGGCCCGG ACGACTTCTT CGCGCACGGC
GTCCACCTCG ACGGCTCGGA GATCGACCAG ATCGCCGACG CGGGCACCGC GATCGTCCAC
TGTCCGGCCT CGAACATGAA GCTCGCAAGC GGGATGGCCC CGGTCCAGCG GCTCCGCGAC
GCGGGCGTCA CGGTCGCGCT CGGCACCGAT GGGGCGGCCT CGAACAACGA CCTCGATGTG
TTCGACGAGA TGCGCGACGC CGCCATGCTC GGGAAGCTCG CTGCGGACGA CGCCACCGCG
GTGCCCGCCG AGGCGGTCGT GGAGATGGCG ACGGCCGGCG GTGCAGACGC TCTCGGCCTC
CCCGGCGGTC GGATCGAGCC GGGCGCGGCC GCCGACCTCG CCGTCGTTGA CCTCGACGCC
CCGCACCTGA CGCCAGTCCA CGACCCCGTC TCCCACCTCG CGTACGCGGC GCACGGGAGC
GACGTGCGCC ACACCGTCTG CGACGGCGAG GTGTTGATGC GCGACCGCGA GGTCCTGACG
CTCGACGCTG AGCGCGTACA GGAGCGGGCG GCGACGGCCG CGAGCGACCT CGTCGATCGA
GTCAGCGAAT CGTAA
 
Protein sequence
MNTLRVTGGR VLRPDGRVTE SDVTIDRDAG TIVAVGDETV SDAEAGSDAE VASDGETLDA 
SGSLVIPGLV NAHTHVAMTL LRGYADDKPL DPWLREDIWP AEAELTPDDI EAGAELGVVE
MIRSGTTAFA DMYFAMDRVA DVVDRAGLRA RLGHGVVTIG KDAEGARADV EESLAVAREL
DGAGDGRIRT AFMPHSLTTV GEEYLHEGVA EAREAGVPIH LHANETEDEV DPIVDERGER
PIAYAQDLDA LGPDDFFAHG VHLDGSEIDQ IADAGTAIVH CPASNMKLAS GMAPVQRLRD
AGVTVALGTD GAASNNDLDV FDEMRDAAML GKLAADDATA VPAEAVVEMA TAGGADALGL
PGGRIEPGAA ADLAVVDLDA PHLTPVHDPV SHLAYAAHGS DVRHTVCDGE VLMRDREVLT
LDAERVQERA ATAASDLVDR VSES