Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1929 |
Symbol | |
ID | 5103316 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1875011 |
End bp | 1876729 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507817 |
Product | acetolactate synthase catalytic subunit |
Protein accession | YP_001191993 |
Protein GI | 146304677 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR00118] acetolactate synthase, large subunit, biosynthetic type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCAG GTGCTAGATT AGTAGTTGAT TCTCTCAAAA GAGAGGGAGT CAAGGTAATT TTCGGAATAC CTGGTCTAAC CAATATGCCA ATATATGACG CCTTTCTCGA GGACCTTCAG AATGGGGAGC TCAGGCACGT GTTAATGAGA CACGAACAAG CTGCAGTTCA CGCTGCGGAC GGATTTGCTA GGGCCTCAGG AAGACCTGGA GTGTGCACAG CAACTTCAGG CCCTGGTGCC ACAAATCTAG TAACGGGAAT GGTTACAGCG TACTGGGATA GCTCTCCAGT AGTGGCTCTC ACAGGTCAGG TGGTTAGACC AGTTATTGGA AAGATGGCCT TCCAGGAGGC AGACACTCCT GGGATTTTTG CCAACGCCGC CAAGTATGTG GTACAACTCA AGAATATCTA CGAAATACCC ATCTGGATCA AGAACGCCTT TTACATAGCT TCTACGGGAA GACCAGGTCC GGTAGTAGTA GATATTCCTA GGGACATTCA ACTGGAAAAG ATTGAGGACG TAAAGTGGCC AGAGAGGCCA GAGGTCAAGG GTTATAGACC TTTCAGAACA ATAATAGATC CAGTAAAGAT TAAAAGGGCA GCGGAGATTT TAGTGGAGGC CGAGAAGCCA ATTATCCTAG CTGGAACTGG GGCAGTGTGG TCCAACGCTA CTCCCGAGAT CCTAGAACTT TCTGAGCTCT TGGCCATTCC CATGGTTTCA ACACTACCAG GTAAGTCTGC AATTCCTCAC GATCATCCTC TGTTCCTGGG GGCTATGGGA TACTATGGAA GGGCAGAGGC CTCCATGGCT GCCCTCGAGT CCGATGCCAT GCTAGTGGTG GGAGCTAGGT TAAGCGATAG AACGTTTACC TCTTACGATG AGATGATTGA GACAAGGAAG AAGTTCATCA TGATTAACAT AGATCCTACG GACTCGGAGA GAGCCTTCAA GATAGACGTT CCCATGTATG GGGACGCCAA GGTCCTCTTG AGGGAAATAA TCAAGGCTGT GAGGGAATTG GGAAGAAAGA GGGACAACTC CGCATGGGTT AAGAGAGTCA AGGAATTGAG GGACTACTAC GCCCAGTTCT ACTATCACGA GGAGGACGGA AAGCTAAAAC CCTGGAAGAT CCTTAAGACC ATAAGGAACT CAATTCCTAG GGATTCCATA GTGACCACGG GTGTGGGACA GCATCAGATG TGGTCAGAGG TTTTCTGGGA GGTCCTAGAG CCCAGGACCT TCCTTTCGTC CACAGGAATG GGGACGATGG GCTTTGGACT ACCAGCAGCC ATGGGTGCCA AGATGGCTAG GCCTGATAAG GTAGTCGTGG ACCTTGACGG AGATGGATCC TTCCTCATGA CTGCCAATAA CCTCGCCACT GCTGTCGATG AACATATCCC CATCATATCA GTGATATTCG ATAACAGAAC CTTAGGCCTA GTCAGGCAGG TTCAAGATCT CTTCCAGAGC AGGAGGGTAG TGGGAGTGGA TTACGGACCT TCCCCCGATT TCGTGAAGTT CGCTGAGGCC TTTGGAGCCC TAGGATTTAA TGCCACAAGC TATGATGAAA TAGAGAGATC CATTAAGACG GCCATAAAGG AGAATATCCC CGCGGTGATT AGGGTTCCAA TAGATAAGGA AGAGTTGGCT TTGCCTACCC TACCTCCAGG AGGAAAACTT AAACAGGTGA TAGTGCGTGA CCCAAGGAAG GCTACTTAG
|
Protein sequence | MPSGARLVVD SLKREGVKVI FGIPGLTNMP IYDAFLEDLQ NGELRHVLMR HEQAAVHAAD GFARASGRPG VCTATSGPGA TNLVTGMVTA YWDSSPVVAL TGQVVRPVIG KMAFQEADTP GIFANAAKYV VQLKNIYEIP IWIKNAFYIA STGRPGPVVV DIPRDIQLEK IEDVKWPERP EVKGYRPFRT IIDPVKIKRA AEILVEAEKP IILAGTGAVW SNATPEILEL SELLAIPMVS TLPGKSAIPH DHPLFLGAMG YYGRAEASMA ALESDAMLVV GARLSDRTFT SYDEMIETRK KFIMINIDPT DSERAFKIDV PMYGDAKVLL REIIKAVREL GRKRDNSAWV KRVKELRDYY AQFYYHEEDG KLKPWKILKT IRNSIPRDSI VTTGVGQHQM WSEVFWEVLE PRTFLSSTGM GTMGFGLPAA MGAKMARPDK VVVDLDGDGS FLMTANNLAT AVDEHIPIIS VIFDNRTLGL VRQVQDLFQS RRVVGVDYGP SPDFVKFAEA FGALGFNATS YDEIERSIKT AIKENIPAVI RVPIDKEELA LPTLPPGGKL KQVIVRDPRK AT
|
| |