Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_2447 |
Symbol | |
ID | 8411991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2348906 |
End bp | 2350753 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645020788 |
Product | hemerythrin-like metal-binding protein |
Protein accession | YP_003178262 |
Protein GI | 257388489 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | [TIGR02481] hemerythrin-like metal-binding domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.148826 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0273457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGAGC CAGCCCAGAG TATGCAACTA GAGTCCGAGG ACGCGTTCGC CCGGTGGGAC GACGAGCGCT ACAGCACACA GATCGACCGC TTCGACGAAC AGCACAAGCG GCTGTTCGGC CTGCTGAACG ATCTCCACAC GGCCATGGAC GAGGGCCACT CACAGGACGA GATCGGTGAC ATCCTCCGAG AGCTCGAACG GTACACCGAG TACCACTTCG GCGACGAGGA AGAGTTCATG CAAGACTGTG GCTACGCGAT GGACTGTGCC GACTGTTTCT ACGACCACCG AGAGATGCAC GAGGAGTTCG CGTCGACGGT CAGCGACTTT CGCGAACGTC ACGAGGACGG CGAGTACGTC ACGATGGAGG TCCTCACCTT CCTCCGGGAC TGGCTCGACA GCCACATCGC CGCGGGTGAC GAAGACCAGC GCTACAGCGA GTACTACCAG ACGGACGTCG ACGAGACCTA CGAGTACACG CCCGGAACGC TCCGGCAGCG TCGCCGCTCG GAGTCACCCC CCGAGACGAC CGACGACCAG CCGACGACGA CGGTCTCCGT GGAAGAAGCG GTCTTAGACG GCGGCGAGCT GTCCGTTCCC GCCGGCCCGA TCGCGTCGTG GTTCGAACAG GTGGCGACGA CACACGGAGA CCGTGTGGCT ACGGTCGAAC ACGGCGGCGA CGAGCGGACG GAACGGAGCT TCGAGTCGCT GTACGAGCGT GCGACGACGG TCGCTGGCGG GCTACTGGAG ACGGAGCTCA CGCCGGGCGA CCGGCTGGCG ATCGACCTCG AATCGAACGG CGAGTCGCTG CTCTTCGACC TCGCCAGCCA CCTCGCCGGG CTCGTCTCTG TTCCGCTGTA TCCCTCGTTC GACGACGAAC AGCTGCGATC GATCGTCACG ACCGCCGACA TCGACGGGTT CGCGTCACCC GATGACCCGC CTTCGGCCGT CGAGCGGGCA GTCGACGTGG TCGTCGACAC GGAGCCGCTG CCAGCGTCGC CACAGCGCTC CCTGCCGGGG TTGAATCGCC GCGGGACCGA TCTCGCGACG ATCGTCTATC AGGTCCCCGC AGACGACGAG CCGACCGGCG TCGCGTTGAC CCACCGGAAC CTCCGGGCGG CCATCGCGGC ACTGAGCGAC GCGCTCCCAC TGGACCCCGG TGCGACGGGG ACCGCGCTCC TGCCGGTCGC ACACGTCTAC CAGCGCGTCG GGGCCTACTA CCTCTGGGAC GCCGGGGCCA CCGTCGCGTA CACCGACCGG GCCAGCAGTG TCGAGGCACT GCCGGCACTC GGCCCGGAGG TACTGATCGG CGTCCCCAAG CTGTACCAGC AGCTGTACGG CGAGCTTCAG GACCGGATCG GCACCTTCGG CTGGGCCAAG CGCAAGGTCG CCGGGAGCGT CACCGGGTAC GGGCGCGACG TGATCGACGG CAGCGGCACG CCGCTGAAGT ACGCGGCAGC GGAACGACTG GCCTACCGGC CGCTGCGCCA GGAGTTCGGG CTCGACGATC TGACCTACGC ACTGTCGAGC ACCGGTCGCC TCGACGATCA CCTCCTCGAT TTCTTCCACG GCCTCGGTGT CCCCCTGTGT GAACTGTCCG GGACCACCGA GACGAGCGCC GTCGGAACGA TCAACGGCCC CGACGACTTC GAGCGCGACA GTGTCGGGGA GGCACTGCCC GGCGTTACCG TCGGGCTCTC GGCCGACAGC GACGTCCTGA TCGACGGTCC GACAGTCATG GACAGGTACT GCAACGATCC CGAGGCGACC GAGCGGGCAG TACACGACGG CTGGTTCCGC ATCGACGACG GCTCGGTCGA AGGCAGTGAT CTCGGGCTCC AGAAGTGA
|
Protein sequence | MVEPAQSMQL ESEDAFARWD DERYSTQIDR FDEQHKRLFG LLNDLHTAMD EGHSQDEIGD ILRELERYTE YHFGDEEEFM QDCGYAMDCA DCFYDHREMH EEFASTVSDF RERHEDGEYV TMEVLTFLRD WLDSHIAAGD EDQRYSEYYQ TDVDETYEYT PGTLRQRRRS ESPPETTDDQ PTTTVSVEEA VLDGGELSVP AGPIASWFEQ VATTHGDRVA TVEHGGDERT ERSFESLYER ATTVAGGLLE TELTPGDRLA IDLESNGESL LFDLASHLAG LVSVPLYPSF DDEQLRSIVT TADIDGFASP DDPPSAVERA VDVVVDTEPL PASPQRSLPG LNRRGTDLAT IVYQVPADDE PTGVALTHRN LRAAIAALSD ALPLDPGATG TALLPVAHVY QRVGAYYLWD AGATVAYTDR ASSVEALPAL GPEVLIGVPK LYQQLYGELQ DRIGTFGWAK RKVAGSVTGY GRDVIDGSGT PLKYAAAERL AYRPLRQEFG LDDLTYALSS TGRLDDHLLD FFHGLGVPLC ELSGTTETSA VGTINGPDDF ERDSVGEALP GVTVGLSADS DVLIDGPTVM DRYCNDPEAT ERAVHDGWFR IDDGSVEGSD LGLQK
|
| |