Gene Hlac_1568 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1568 
Symbol 
ID7401501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1586171 
End bp1587235 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content71% 
IMG OID643708635 
Productamidohydrolase 
Protein accessionYP_002566225 
Protein GI222479988 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCTTG AAGGGACCGT TCTGGTCGGT CCGGAGTTCG AGCCGGTCGA AGGCCGGGTC 
ATCGTCGCCG ACGACGAGAT CGTCCGGATC GAGGAGACGA GCGTCGACTC CGACGACGTG
ATCCTCCCGG CGTTCGTCAA CGCCCACACC CACATCGGTG ACTCCATCGC CAAGGAGGCC
GGCGAGGGCC TCTCGCTCGA CGAGCTGGTC GCACCGCCGG ACGGGCTCAA ACACCGCCTC
CTGCGGGAGG CGAGCCACGA GGCGAAAGTC GCCGCGATGG CGCGGAGCCT CCGGTACATG
GAGTCGACCG GGACCGGCAC GTTCCTGGAG TTCCGCGAGG GCGGCGTCAA GGGCGTGGCC
GCGCTCCGGG ACGCGGTCGC GGGCGAGGGC GTCGACTTCG GCGAGCGCGC GATCGATCCG
GTCGTGTTCG GTCGTGACGA CCCCGACGTG CTCTCGGTCG CTGACGGGTA CGGCGCCTCC
GGTGCCCGCG ACGCCGACTT CGACGCGGTG CGCTCGGAGA CTCGCGAGGC GGGCAAGCTG
TTCGGGATCC ACGCCGGCGA GCGCGACGCC GACGACATCA ACGCCGCGAT GGATCTCGAC
CCCGACTTTC TCGTCCACAT GGTCCACGCC GAGCCGATCC ACCTCGAGCG GCTCGCCGAC
CGCGGGACGC CCGTGGCCGT CTGCCCCCGG TCGAATCTCG TGACGAACGT CGGCGTGCCG
CCGATCCGCG ACCTCGCCGA GCGGACGACG GTCGCGCTCG GCACCGACAA CGTCATGCTC
GACTCGCCGT CGATGTTCCG CGAGATGGAG TTCGCCGCGA AGCTCTCCGA TCTCCCGGCC
CGAGAGATCC TGCGGATGGC GACGGTGAAC GGCGCGGCGA TCGCGGGGCT GAACCGCGGC
GTGATCGAGC TGGGCGCGGA TGCCGATCTG TTGGTGCTCG ACGGCGACTC CGACAACCTT
GCCGGCGCGC ACGACCTCGT TCGCGCGATC GTCAGGCGCG CCGGCGCGGC CGACGTGTCT
CGGGTCGTAA TCGGCGGCGA GCCGGCCGGG AGAGAAACGG TTTAG
 
Protein sequence
MHLEGTVLVG PEFEPVEGRV IVADDEIVRI EETSVDSDDV ILPAFVNAHT HIGDSIAKEA 
GEGLSLDELV APPDGLKHRL LREASHEAKV AAMARSLRYM ESTGTGTFLE FREGGVKGVA
ALRDAVAGEG VDFGERAIDP VVFGRDDPDV LSVADGYGAS GARDADFDAV RSETREAGKL
FGIHAGERDA DDINAAMDLD PDFLVHMVHA EPIHLERLAD RGTPVAVCPR SNLVTNVGVP
PIRDLAERTT VALGTDNVML DSPSMFREME FAAKLSDLPA REILRMATVN GAAIAGLNRG
VIELGADADL LVLDGDSDNL AGAHDLVRAI VRRAGAADVS RVVIGGEPAG RETV