Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1646 |
Symbol | |
ID | 5104851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1587658 |
End bp | 1588704 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507537 |
Product | hypothetical protein |
Protein accession | YP_001191725 |
Protein GI | 146304409 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3425] 3-hydroxy-3-methylglutaryl CoA synthase |
TIGRFAM ID | [TIGR00748] hydroxymethylglutaryl-CoA synthase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATACAG GTATCATTGG ATGGGGCTCT TACGTTCCGA AGTATAGAAT TAAGGTAAGC GATATTGCCT CTGTTTGGGG CAAGGAGGAG GGAGTTGTTA AGGCACTAGG TCTCACGGAG AAGTCAGTTC CAGCAGCTGA TGAGGACTCA ACGACGATGG CAATTGAGGC CTCTAGGGAT GCCCTAACGA GGGCAATGAT TGACCCTAGG GAAGTTGAGA TGGCACTCTT TGGTTCTGAG TCAAAGGTTT ACGCAGTGAA GTCAACCTCA GCGATCCTGA TAGACGCGCT TGGTCTGTCC AAGTTCTCCT TAACGGCAGA CCTAGAGTTC GCCTGCAGGG CTGCTTCGGC AGGACTCAGG ATGGCTTTCT CCATGGTCGA GAGCGGTCAG GTTTCCTACT CCCTAGTGGT TGGATCTGAT ACGGCCCAAT CCAACCCAGG TGACGTCCTC GAGTTAAGCT CTGCCGCAGC TGCAGTTGCC TTCGTTGTCG GAAGAGCGGA GGAGGCCTCA GCTGTGGTCG AGGCGAGTAC ATCCTACGTT ACCGATACCC CGGATTTCTG GAGGAGGGAT GGAATGCCTT ACCCGCTTCA CGGGGAGGCC TTCACAGGAG AACCAGCTTA CTTTGCCCAC ATTTATGAGG CCGTGAATAG GTTGCTTCAG GACACCGGGC TCAAGGTTTC TGACTTTGAC TACTTTGTGT TTCACCAACC CAACGGAAAG TTCCCGTTCC AGATGGCCAA GAAACTTGGG GTACCACTTG AAAAGGTGAA ACAGGGGATG GTCTCAACCC TGATTGGGAA TCCCTACAAT GCCTCGGCTC TCCTCGGGTT CGCGAGGGTA CTAGATGTGG CCAAGCCTGG CCAGAGGGTT CTCGTTGCTC CCTTCGGGAG CGGTGCTGGA AGTGACGCAT ACAGCTTCGT GATAACTGAT AAGATCCTTG AAAGACAGAA GTTAGCCCAC ACCACGGACT ACTACATCCA AAGAAAGAAG CTCGTGGATT ACGCGAGTTA CGCAAAGACA ACCCACAAGT TCAAGGTTTA CGACTAG
|
Protein sequence | MHTGIIGWGS YVPKYRIKVS DIASVWGKEE GVVKALGLTE KSVPAADEDS TTMAIEASRD ALTRAMIDPR EVEMALFGSE SKVYAVKSTS AILIDALGLS KFSLTADLEF ACRAASAGLR MAFSMVESGQ VSYSLVVGSD TAQSNPGDVL ELSSAAAAVA FVVGRAEEAS AVVEASTSYV TDTPDFWRRD GMPYPLHGEA FTGEPAYFAH IYEAVNRLLQ DTGLKVSDFD YFVFHQPNGK FPFQMAKKLG VPLEKVKQGM VSTLIGNPYN ASALLGFARV LDVAKPGQRV LVAPFGSGAG SDAYSFVITD KILERQKLAH TTDYYIQRKK LVDYASYAKT THKFKVYD
|
| |