Gene Hlac_2646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2646 
Symbol 
ID7400851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2630278 
End bp2631453 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID643709718 
Productamidohydrolase 
Protein accessionYP_002567287 
Protein GI222481050 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTGA TTCGAGGCGG ACAAGTGGCT GACGTCGACG GGACTCGCGA GGCCGATGTC 
GCGGTCGCAG ACGGCGAGAT CGTCGCGGTC GGGCCGGACG CGGTCGACGA GATCGGCGGT
GAGGACGCGG TCGACGCCGA GACCGACGCG AGCGGCTCGG TGGTCGCATC GGGGCTGATC
GACGCGCACG TCCACGTCAT GATGGACGGG CGACCAGATG TCGCCACCGC GGTCTCCGAC
AGCGACTACA CCGCGAGCTA CCGGACCGCC GGCAACCTCC GAGACGCCCT CGAAGCGGGG
GTCACGACGG TCCGCGATCT GGGGGGCCGC GGGACGCTCG CGCTCGACGC GGGCGAGGCG
GTCGCCGCCG GCGACATCGA CGGTCCGCGC GTCCTCGCCT GCGGCCGCAA CGTGATCATG
ACCGGCGGCC ACGGCAACTG GTTCGGCCGC GAGGCCGACG GTCCGGCCGA GGTCCGAAAG
GCGGCCCGCG AGCAGCTGAA GGCGGGCGCG GACGTGCTCA AGTGCATGGC GACGGGCGGC
GTCCTTACCG AGGGCGCGGT GACCGGCGCC CCGGAGCTGA CTCCCGAAGA ACTCGCGGCG
TTCACCGATG CCGCCGCTCC GACGAACACT CCTACCGCGG CTCACGCCCA CGGCGAGACA
GGGATCAAGA ACGCGGTCGA GGCCGGGATT TCGAGTATCG AGCACGGCAC CTTCATGGAC
CGCGAGGCCG CCGAGATGAT GGCCGATCGA GGGACCTATT GGGTGCCGAC CGCGAGTGCG
CTCCGCGGAA TCGTTGATCA CGGCGTCGAG TCCGGGATCC CGGAGGACGC CGTCGAAAAG
GCCGAAGACG CCGCCGACCG CTTCGACGAC GCGTGGGGCC ACGCGCTGGA GGCCGACGTG
CCGATCGCAA TGGGCACGGA CGCCGGCACC CCGTTCAACT TCTTCGGGGA CATCCCGCGG
GAGCTTGCGT ACATGGTCGA GCACGGACTC TCGCCGGAGC GGGCGCTCGA GGCCGCCACC
GTCAACGCCG CGGATCTGCT CGGGCTCGAC GACGTGGGCC GAATCGGGGA GGGGTACCGC
GCCGACCTCG TCGTCCTCGA GGCCGACCCC ACCGAGGACG TGGCGGCGTG GCAGGAGCCG
GAGGCAGTGT TCGCCGCCGG CGAGCGGGTC GCGTAA
 
Protein sequence
MHVIRGGQVA DVDGTREADV AVADGEIVAV GPDAVDEIGG EDAVDAETDA SGSVVASGLI 
DAHVHVMMDG RPDVATAVSD SDYTASYRTA GNLRDALEAG VTTVRDLGGR GTLALDAGEA
VAAGDIDGPR VLACGRNVIM TGGHGNWFGR EADGPAEVRK AAREQLKAGA DVLKCMATGG
VLTEGAVTGA PELTPEELAA FTDAAAPTNT PTAAHAHGET GIKNAVEAGI SSIEHGTFMD
REAAEMMADR GTYWVPTASA LRGIVDHGVE SGIPEDAVEK AEDAADRFDD AWGHALEADV
PIAMGTDAGT PFNFFGDIPR ELAYMVEHGL SPERALEAAT VNAADLLGLD DVGRIGEGYR
ADLVVLEADP TEDVAAWQEP EAVFAAGERV A