Gene Hlac_0729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0729 
Symbol 
ID7400202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp743444 
End bp744505 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content63% 
IMG OID643707795 
Productamidohydrolase 2 
Protein accessionYP_002565401 
Protein GI222479164 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.455369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGAGA AGGACGGCGA GGAGATCTTC GTCATCGACG GCCACGTCCA CCTGTGGGAC 
GCGCGGCAGG AGAACATCAT TCACGAGGGG GGCGAGCAGT TCCTCCAGTG TTTCTACGAT
TACCATACCG GGTTCACCCC GGAGGAGGAA CAGTGGGACA TCGACGAGTA CCGCCACTAC
GGCGCCGACC GCATGACCGA GGACCTGTTC GGGAACGCGG CCGCCGACAT GGCAATCTTC
CAGCCGACGT ACCTCTCCGA CTTCTACGAC GAGGGGTTCA ACACGACCGA ACAGAACGCC
GAACTCGCGG AGGAGTACCC CGAGCGGTTC GTGCTCAACG GGAGCTTCGA CCCGCGTGAC
GGCGAAGAGG GGCTGCGCTA CCTCGAACAC CTCAAAGAGG AGTACGACAT CCCCGGCGTG
AAGCTGTACA CCGCTGAGTG GCGCGACGAC TCGAAGGGGT GGCGGCTCGA CAGCGACGAC
GCCTTCAAGT TCCTCGAGAA GTGTTCGGAG CTCGGCATCG AGAACATCAA CGCCCACAAG
GGACCGACGA TCCGCCCGCT CAACCGCGAC GCGTTCGACG TGAAGGACAT CGACGACGCC
GCCTCGTCGT TCCCGGAGCT CAACTTCATC GTCAACCACG TCGGGCTCCC GCGGCTCGAC
GACTTCTGTT GGATCGCCGC CCAAGAGCCG AACGTGTACG GCGGGCTCGC GGTCGCCTCC
GCGATGTCGA CTCACCGCGA GCGGAAATTC GGCGAGATCA TGGGTGAGCT CCTCTTCTGG
CTCGGCGAAG ACCGGGTCCT GTTCGGCTCC GACTACGCGC TGTGGAACCC CGACTGGCTC
GTCGAACAGG TGATAAACGC GGAACTCACC GACGAGCAGA AAGACGAGTA CGGCGTCGAG
CTCGACGTCG ATACGATGAA GAAGATCATG GGCGAGAACG CCGCGGAGCT GTACGACATC
GATATCGAGG AGAAAAAGCG GCAGTTCCGC GACGACGACA TCACGGAACG GTTCGACCTC
GAGTCCCACT ACGGCGGCGA TGCGGGGGCC AGGGCGGACT GA
 
Protein sequence
MYEKDGEEIF VIDGHVHLWD ARQENIIHEG GEQFLQCFYD YHTGFTPEEE QWDIDEYRHY 
GADRMTEDLF GNAAADMAIF QPTYLSDFYD EGFNTTEQNA ELAEEYPERF VLNGSFDPRD
GEEGLRYLEH LKEEYDIPGV KLYTAEWRDD SKGWRLDSDD AFKFLEKCSE LGIENINAHK
GPTIRPLNRD AFDVKDIDDA ASSFPELNFI VNHVGLPRLD DFCWIAAQEP NVYGGLAVAS
AMSTHRERKF GEIMGELLFW LGEDRVLFGS DYALWNPDWL VEQVINAELT DEQKDEYGVE
LDVDTMKKIM GENAAELYDI DIEEKKRQFR DDDITERFDL ESHYGGDAGA RAD