Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1037 |
Symbol | |
ID | 5104337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 961839 |
End bp | 963494 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640506933 |
Product | acetolactate synthase, large subunit |
Protein accession | YP_001191126 |
Protein GI | 146303810 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0844069 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGGTT CAACTCTTCT TCTAGAACTA CTGAAGGATT ACGACGTGGA TAGGGTTTTT GGACTTCCTG GAGAGACATC TATCCCATAC TACCCCGAAT TCGCAGAGCT TCAGGTGATA ACTAGGGATG AGAGGAACGC CGTCTACATG GCTGACGCCT ATGCCAGGGT TAGTTTCAAG CCGGGAGTGG TTGAGGGACC GAGCGTTGGC TCGCCCTACA TGTTACCAGG TGTGATAGAG GCATACAAGT CCTCCTCTCC CGTGATAGTC ATCACCACGG ATACTGACCT CTACGGAGAG AGGATGAACA TGTTGACTTC CCTGGATCAG ACAGCCCTCT TCAAACCCTA CACCAAGGAG TCCATCACCG TGACGAAGGC AGACGACCTG TCCCACGCCG TGAGGAGGGC CTTCAGGTTA GCAACCGGAG GGAGACCTGG ACCAGTTCAC CTAAGGATAC CCCATCATGT ACTCGAGGAG GAGGGATCCA TCTACCTCCC ACCGCAGAGG GAGTTCTCCA GGTATCCAGC TCAAAGGCCC GTCGCAGACC GGGATGCGGT GAGGCTCGCG GTCTCAGCCC TCCTGGACAG TTCCAACCCG GTCATTATCT GCGGTCAAGG AGCACTGTAC TCCAGGGCGT GGGATGAGGT CGTGGAGTTG GCTGAGCTCA TGGGAATTCC AGTGGGTACC ACCATCACGG GGAAGGGATG CATCTCGGAG CTTCACCCCC TCTCCATAGG GGTAGTGGGA GGAAGAGGAG GGACTAGCTT TTCAAATTCC TTTCTGGAGG AGGCCGATCT AATCTTCCTT GTGGGATCAA ACACGGACTC AGCCAACACA GATAGGTGGA GATATCCTCC CAGGACGAAG ACCGTGATCC ATCTAGATGT GAGTGAGGCT GAAGTGGGGA ACAACTACAA CTCCATAAAC CTGATAGGGG ACGCTAAGGC AACGCTTAGG GAGATAATCA GGGAGGTAAG ATCCCGGGGA GTGAAGAGAA GGGAAGTGAA AGTGAATAGG GACGAGTTTG AGGCCAGGGT GAGGGAGATT GCCTCCATGT CCGGGGAAAG GGTCAACCCG GTCAGGTTTG TGAAGGAGTT GGAGAGGAGG GTTAGGGATC AGGTAATAGT AGCAGACCCT GGGGTAGGTG CAATTTACGT CTCCGCCCTT TTCAGAACTG GGAAGGCTGG AAGAAACTTC GTGTTCAACT ACGGCCTTGG GGGACTGGGT TACGCGATAC CTGCCTCAGT TGGGGCCCAA CTGGGATCTG GTAGACAGGT TCTTGCCATG ACCGGGGATG GTAGCTTTGG GTTCTCTGCG GGAGAGCTGG AGACAATTGC TAGGTTGAAA AGTGACGTGG TCTTGTTTGT GTTCAATAAC TCCAGTTTCG GCTGGATAAG GGCAGAAATG AGGATTCAGG GTAGGGATGT GAGGGGGACC GACTTCTCGT CGCTGGATTA CGTTAAGATA GCTGAGGGAT TCGGGCTCAG GGGTTACAGG ATCTCCACGG ACCAAGAGAT AGGCGATGTG TTGGACGAGG CCATGGAGAG CACTCCCAGT CTGGTTGAGG TAGTCGTGGA TCCTGAGGAT AAGTTCTACC CACCTGTGGC ACACTGGGCT AGGGCACTAC TCCACGACGT GAAACATGTA TATTGA
|
Protein sequence | MKGSTLLLEL LKDYDVDRVF GLPGETSIPY YPEFAELQVI TRDERNAVYM ADAYARVSFK PGVVEGPSVG SPYMLPGVIE AYKSSSPVIV ITTDTDLYGE RMNMLTSLDQ TALFKPYTKE SITVTKADDL SHAVRRAFRL ATGGRPGPVH LRIPHHVLEE EGSIYLPPQR EFSRYPAQRP VADRDAVRLA VSALLDSSNP VIICGQGALY SRAWDEVVEL AELMGIPVGT TITGKGCISE LHPLSIGVVG GRGGTSFSNS FLEEADLIFL VGSNTDSANT DRWRYPPRTK TVIHLDVSEA EVGNNYNSIN LIGDAKATLR EIIREVRSRG VKRREVKVNR DEFEARVREI ASMSGERVNP VRFVKELERR VRDQVIVADP GVGAIYVSAL FRTGKAGRNF VFNYGLGGLG YAIPASVGAQ LGSGRQVLAM TGDGSFGFSA GELETIARLK SDVVLFVFNN SSFGWIRAEM RIQGRDVRGT DFSSLDYVKI AEGFGLRGYR ISTDQEIGDV LDEAMESTPS LVEVVVDPED KFYPPVAHWA RALLHDVKHV Y
|
| |