Gene Hmuk_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2447 
Symbol 
ID8411991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2348906 
End bp2350753 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content67% 
IMG OID645020788 
Producthemerythrin-like metal-binding protein 
Protein accessionYP_003178262 
Protein GI257388489 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID[TIGR02481] hemerythrin-like metal-binding domain 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.148826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0273457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGAGC CAGCCCAGAG TATGCAACTA GAGTCCGAGG ACGCGTTCGC CCGGTGGGAC 
GACGAGCGCT ACAGCACACA GATCGACCGC TTCGACGAAC AGCACAAGCG GCTGTTCGGC
CTGCTGAACG ATCTCCACAC GGCCATGGAC GAGGGCCACT CACAGGACGA GATCGGTGAC
ATCCTCCGAG AGCTCGAACG GTACACCGAG TACCACTTCG GCGACGAGGA AGAGTTCATG
CAAGACTGTG GCTACGCGAT GGACTGTGCC GACTGTTTCT ACGACCACCG AGAGATGCAC
GAGGAGTTCG CGTCGACGGT CAGCGACTTT CGCGAACGTC ACGAGGACGG CGAGTACGTC
ACGATGGAGG TCCTCACCTT CCTCCGGGAC TGGCTCGACA GCCACATCGC CGCGGGTGAC
GAAGACCAGC GCTACAGCGA GTACTACCAG ACGGACGTCG ACGAGACCTA CGAGTACACG
CCCGGAACGC TCCGGCAGCG TCGCCGCTCG GAGTCACCCC CCGAGACGAC CGACGACCAG
CCGACGACGA CGGTCTCCGT GGAAGAAGCG GTCTTAGACG GCGGCGAGCT GTCCGTTCCC
GCCGGCCCGA TCGCGTCGTG GTTCGAACAG GTGGCGACGA CACACGGAGA CCGTGTGGCT
ACGGTCGAAC ACGGCGGCGA CGAGCGGACG GAACGGAGCT TCGAGTCGCT GTACGAGCGT
GCGACGACGG TCGCTGGCGG GCTACTGGAG ACGGAGCTCA CGCCGGGCGA CCGGCTGGCG
ATCGACCTCG AATCGAACGG CGAGTCGCTG CTCTTCGACC TCGCCAGCCA CCTCGCCGGG
CTCGTCTCTG TTCCGCTGTA TCCCTCGTTC GACGACGAAC AGCTGCGATC GATCGTCACG
ACCGCCGACA TCGACGGGTT CGCGTCACCC GATGACCCGC CTTCGGCCGT CGAGCGGGCA
GTCGACGTGG TCGTCGACAC GGAGCCGCTG CCAGCGTCGC CACAGCGCTC CCTGCCGGGG
TTGAATCGCC GCGGGACCGA TCTCGCGACG ATCGTCTATC AGGTCCCCGC AGACGACGAG
CCGACCGGCG TCGCGTTGAC CCACCGGAAC CTCCGGGCGG CCATCGCGGC ACTGAGCGAC
GCGCTCCCAC TGGACCCCGG TGCGACGGGG ACCGCGCTCC TGCCGGTCGC ACACGTCTAC
CAGCGCGTCG GGGCCTACTA CCTCTGGGAC GCCGGGGCCA CCGTCGCGTA CACCGACCGG
GCCAGCAGTG TCGAGGCACT GCCGGCACTC GGCCCGGAGG TACTGATCGG CGTCCCCAAG
CTGTACCAGC AGCTGTACGG CGAGCTTCAG GACCGGATCG GCACCTTCGG CTGGGCCAAG
CGCAAGGTCG CCGGGAGCGT CACCGGGTAC GGGCGCGACG TGATCGACGG CAGCGGCACG
CCGCTGAAGT ACGCGGCAGC GGAACGACTG GCCTACCGGC CGCTGCGCCA GGAGTTCGGG
CTCGACGATC TGACCTACGC ACTGTCGAGC ACCGGTCGCC TCGACGATCA CCTCCTCGAT
TTCTTCCACG GCCTCGGTGT CCCCCTGTGT GAACTGTCCG GGACCACCGA GACGAGCGCC
GTCGGAACGA TCAACGGCCC CGACGACTTC GAGCGCGACA GTGTCGGGGA GGCACTGCCC
GGCGTTACCG TCGGGCTCTC GGCCGACAGC GACGTCCTGA TCGACGGTCC GACAGTCATG
GACAGGTACT GCAACGATCC CGAGGCGACC GAGCGGGCAG TACACGACGG CTGGTTCCGC
ATCGACGACG GCTCGGTCGA AGGCAGTGAT CTCGGGCTCC AGAAGTGA
 
Protein sequence
MVEPAQSMQL ESEDAFARWD DERYSTQIDR FDEQHKRLFG LLNDLHTAMD EGHSQDEIGD 
ILRELERYTE YHFGDEEEFM QDCGYAMDCA DCFYDHREMH EEFASTVSDF RERHEDGEYV
TMEVLTFLRD WLDSHIAAGD EDQRYSEYYQ TDVDETYEYT PGTLRQRRRS ESPPETTDDQ
PTTTVSVEEA VLDGGELSVP AGPIASWFEQ VATTHGDRVA TVEHGGDERT ERSFESLYER
ATTVAGGLLE TELTPGDRLA IDLESNGESL LFDLASHLAG LVSVPLYPSF DDEQLRSIVT
TADIDGFASP DDPPSAVERA VDVVVDTEPL PASPQRSLPG LNRRGTDLAT IVYQVPADDE
PTGVALTHRN LRAAIAALSD ALPLDPGATG TALLPVAHVY QRVGAYYLWD AGATVAYTDR
ASSVEALPAL GPEVLIGVPK LYQQLYGELQ DRIGTFGWAK RKVAGSVTGY GRDVIDGSGT
PLKYAAAERL AYRPLRQEFG LDDLTYALSS TGRLDDHLLD FFHGLGVPLC ELSGTTETSA
VGTINGPDDF ERDSVGEALP GVTVGLSADS DVLIDGPTVM DRYCNDPEAT ERAVHDGWFR
IDDGSVEGSD LGLQK