Gene Msed_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1646 
Symbol 
ID5104851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1587658 
End bp1588704 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content53% 
IMG OID640507537 
Producthypothetical protein 
Protein accessionYP_001191725 
Protein GI146304409 
COG category[I] Lipid transport and metabolism 
COG ID[COG3425] 3-hydroxy-3-methylglutaryl CoA synthase 
TIGRFAM ID[TIGR00748] hydroxymethylglutaryl-CoA synthase, putative 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACAG GTATCATTGG ATGGGGCTCT TACGTTCCGA AGTATAGAAT TAAGGTAAGC 
GATATTGCCT CTGTTTGGGG CAAGGAGGAG GGAGTTGTTA AGGCACTAGG TCTCACGGAG
AAGTCAGTTC CAGCAGCTGA TGAGGACTCA ACGACGATGG CAATTGAGGC CTCTAGGGAT
GCCCTAACGA GGGCAATGAT TGACCCTAGG GAAGTTGAGA TGGCACTCTT TGGTTCTGAG
TCAAAGGTTT ACGCAGTGAA GTCAACCTCA GCGATCCTGA TAGACGCGCT TGGTCTGTCC
AAGTTCTCCT TAACGGCAGA CCTAGAGTTC GCCTGCAGGG CTGCTTCGGC AGGACTCAGG
ATGGCTTTCT CCATGGTCGA GAGCGGTCAG GTTTCCTACT CCCTAGTGGT TGGATCTGAT
ACGGCCCAAT CCAACCCAGG TGACGTCCTC GAGTTAAGCT CTGCCGCAGC TGCAGTTGCC
TTCGTTGTCG GAAGAGCGGA GGAGGCCTCA GCTGTGGTCG AGGCGAGTAC ATCCTACGTT
ACCGATACCC CGGATTTCTG GAGGAGGGAT GGAATGCCTT ACCCGCTTCA CGGGGAGGCC
TTCACAGGAG AACCAGCTTA CTTTGCCCAC ATTTATGAGG CCGTGAATAG GTTGCTTCAG
GACACCGGGC TCAAGGTTTC TGACTTTGAC TACTTTGTGT TTCACCAACC CAACGGAAAG
TTCCCGTTCC AGATGGCCAA GAAACTTGGG GTACCACTTG AAAAGGTGAA ACAGGGGATG
GTCTCAACCC TGATTGGGAA TCCCTACAAT GCCTCGGCTC TCCTCGGGTT CGCGAGGGTA
CTAGATGTGG CCAAGCCTGG CCAGAGGGTT CTCGTTGCTC CCTTCGGGAG CGGTGCTGGA
AGTGACGCAT ACAGCTTCGT GATAACTGAT AAGATCCTTG AAAGACAGAA GTTAGCCCAC
ACCACGGACT ACTACATCCA AAGAAAGAAG CTCGTGGATT ACGCGAGTTA CGCAAAGACA
ACCCACAAGT TCAAGGTTTA CGACTAG
 
Protein sequence
MHTGIIGWGS YVPKYRIKVS DIASVWGKEE GVVKALGLTE KSVPAADEDS TTMAIEASRD 
ALTRAMIDPR EVEMALFGSE SKVYAVKSTS AILIDALGLS KFSLTADLEF ACRAASAGLR
MAFSMVESGQ VSYSLVVGSD TAQSNPGDVL ELSSAAAAVA FVVGRAEEAS AVVEASTSYV
TDTPDFWRRD GMPYPLHGEA FTGEPAYFAH IYEAVNRLLQ DTGLKVSDFD YFVFHQPNGK
FPFQMAKKLG VPLEKVKQGM VSTLIGNPYN ASALLGFARV LDVAKPGQRV LVAPFGSGAG
SDAYSFVITD KILERQKLAH TTDYYIQRKK LVDYASYAKT THKFKVYD