Gene Hmuk_2436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2436 
Symbol 
ID8411980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2338786 
End bp2340045 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content67% 
IMG OID645020779 
ProductHaloacid dehalogenase domain protein hydrolase type 3 
Protein accessionYP_003178253 
Protein GI257388480 
COG category[R] General function prediction only 
COG ID[COG0561] Predicted hydrolases of the HAD superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.454359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00185111 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGCAGT ACGACAGACT CTACCGACTC TACGACGACT TCGATACGGC GTCGTTGCGG 
GCCTATCAGG AGTTCGTCGA CCTCTTTCCA CCGGTCGACT CGCGAGTCGC ACTGGACCAC
TGGCAGGACG CCAGCGACGA ACTGGACGAG CGAAAGCGCG AGATCCGGGA GGCGTACCCG
ACGACCGGCG AGCGCTACGC GGCGATCGCG GCGTACCTCG ATCGCGATCA GGCCTTTACC
GCGCTGGATC TGGCGACCGA GTACGGCCGG CCGATCAACG CGCTCGTACT CGACGTCGAC
GAGACGCTGC GCAGTGCCGG GACGACCGAC AACGAGATCC CCCGAGAGAC GCTCCACCTC
CTCGATCTCC TCCACGAGGA CGGCATTCCG ATCGTCGTCT GTACCGGCCA GACCCTGGAG
AACGTCAAGG GATTCCTCAT CCAGGGGCTC GGCAACGCCT TCGTCCACTC GGGAGACGTG
AGCATCGTCT ACGAGTCGGG CAACGGCGTG TTCACGCCCC GCCACGGCAG CCAGACCAAG
CGGCTGCTGT ACGAATCGCT GGACGAGTCG ATCACGAACG TCTTCGACAC CGTGCGTTCG
CGGATCCTCG CCGACGCCCC CGACGCCATC GCGCGAGGCT GTCACCTCCA GGGCAACGAG
TTCAACGTCA CGCTGAAGCC AAACGCCGAG AACGGCACCC GCAAGGCGGT CGAGATCATC
GACGACGGCC TGCTGTACCT GCTCGATCTC CTCGGCGAGA CGGTCGCGAG CGAGGTCGAT
GCCGACGTGG ACGAGCCCGG CGACTGGGCG CGGGCGTACT ACGCGTCCGA TCCCGAGATC
CGCGCGGTCA TCGAAGACGC GGGGGAGGGA CAGGAAGCCA GCCCCGACGA GATGCCGCCC
GCCCTCCGGT CGCTGTTCGA CCGCATCGAC CTGGGATACT ACGAAGGTGA CGCCGCGGAG
ATCGTCAGCC TCGAACTCGA CAAGCCGGCC GGGGTCCGGG CTGCATTCGA CGTACTCGAC
ATCGACGATC CGTTCGCGCT CGTGATGGGC GACAGCAAGA GCGACCTCCG GGTCATGGAG
TGGCTCGAAC GAGACGACGC CGGCATCGCC GCCGCGCCGG AACACGCCTC GCGGGACGTG
CTCGATCACG TCGTCCGGAC GAACGAACTC CTCTTCGACG CGGGCGACGC CGCTGCGGTG
CTGGCGGTCG TCTACGGCCT GAACCGCCTC GTCACGAGCC GCGACGACGC GGACCGGTGA
 
Protein sequence
MKQYDRLYRL YDDFDTASLR AYQEFVDLFP PVDSRVALDH WQDASDELDE RKREIREAYP 
TTGERYAAIA AYLDRDQAFT ALDLATEYGR PINALVLDVD ETLRSAGTTD NEIPRETLHL
LDLLHEDGIP IVVCTGQTLE NVKGFLIQGL GNAFVHSGDV SIVYESGNGV FTPRHGSQTK
RLLYESLDES ITNVFDTVRS RILADAPDAI ARGCHLQGNE FNVTLKPNAE NGTRKAVEII
DDGLLYLLDL LGETVASEVD ADVDEPGDWA RAYYASDPEI RAVIEDAGEG QEASPDEMPP
ALRSLFDRID LGYYEGDAAE IVSLELDKPA GVRAAFDVLD IDDPFALVMG DSKSDLRVME
WLERDDAGIA AAPEHASRDV LDHVVRTNEL LFDAGDAAAV LAVVYGLNRL VTSRDDADR