Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2095 |
Symbol | |
ID | 5104389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2016169 |
End bp | 2017740 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507985 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_001192159 |
Protein GI | 146304843 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.729554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAAGA AATCCATAGA AATACTTGAC ACGACCCTTA GGGATGGTTC TCAAGCAGCA ACTATTTCCT TCACCCTAAG GGATAAGATA AAGATAGCCC TTCTCCTAGA TGAGCTAGGA GTGGACTATA TAGAAGGTGG ATGGCCTGGA TCTAACCCTA AGGATTACGA GTTCTTCAGG GAGATTAAGA ACTACCACCT TAAGCATTCA AAGATTGCAG CTTTCGGGAG CACTAGAAGG AAGGATTTAA GGCCAGAGAA CGACCCGAGC CTATCTTCAA TTCTAGAGGC AGACGTCAAG GTTGCGGTGC TCTTCGGTAA AACATGGACG CTTCACGTCA AGGAAATATT GAGGACCACC TTGGAAGATA ATCTCTCCAT AATAGCTGAT AGCATCCAAT TCCTGAGGGA TCATGGACTG GAGGTTATAT TCGATGCTGA ACATTTCTAT CAAGGCTTCA AGGAGGATCC AGATTATGCA ATAAGGGTAG TTAATACTGC AGGTGAGGCT GGGGCAAGAG TTGTCGCCCT AGCGGATACC AACGGGGGCA CGCTACCCCA TGAGGTACTA GAAATAACCA GACGAGTGGC CGAAAACACT AGGGTGAAGC TTGGGGTTCA CATGCACAAT GATTCGGGTA CAGCGGTGGC CAACTCATTG ATGGGGGTAG TAGGAGGGGC TAGACACGTT CAGGGAACTA TTAACGGAAT TGGTGAGAGA ACTGGAAATG CTGACCTAAT TCAGATAATA CCCAGCCTCG TTCTAAAAAT GGGGTATAGG GTACTCAAGA ATGAGGATGG TCTTAAGAAG CTAAAACAGG TGTCCTCCAC ACTCTATGAA TTGGCAGGAT TACATCCCAA CCCCTTCCAG CCCTATGTGG GAGATAACGC ATTTACCCAT AAGGCAGGAG TTCACGCCGA TGCGGTAATG AAGAACACCA GAGCATATGA ACATATAGAC CCGTCCCTAC TGGGCAATCA GAGAAAGGTA GTAATCTCAG AGCTTTCTGG GACGTCAAAC CTAGTGAATT ACCTTGAGAA ACTGGGAATC AAGGTAGAAA AGAAGGAGGA GAAATTGAAG AAGGCATTAA AGGCCATCAA GGAGATGGAG GCCCGCGGAT ACAGCTTTGA CCTGGCACCT GAATCGGCTA TGTTAGTGGC CTTCAAGGAA ATGGGAATAT ATGAAAAGTT CTTCGACGTG CAGTACTGGA AGGTTATAAA CGAGAACGGA TTAGCCCTTG CCGTAGTCAA GGTTAACTCT AGAGTTGAAG CTGCTGAGGG TACGGGACCA GTTCACGCAG TGGACGTGGC GCTGAGACGG TGCCTCTCAA AGGATTTTCC TGAAATAGAG AGAGTTAAAC TCACGGATTA CAGGGTCGTT CTACCAGGCG AGGTAAAGAA CACGGAGAGC GTAGTACGTG TGACCATCGA GTTTAACGAC GGAGAAAGGT CTTGGAGGAC TGAAGGTGTC TCAACTAGCG TTATAGAGGC ATCGATAATG GCACTTGTGG ATGGGCTGGA CTATTATCTT CAACTTAATA AGAAATTAAA GCCTCTAGTT GCAAAAGAGT AG
|
Protein sequence | MAKKSIEILD TTLRDGSQAA TISFTLRDKI KIALLLDELG VDYIEGGWPG SNPKDYEFFR EIKNYHLKHS KIAAFGSTRR KDLRPENDPS LSSILEADVK VAVLFGKTWT LHVKEILRTT LEDNLSIIAD SIQFLRDHGL EVIFDAEHFY QGFKEDPDYA IRVVNTAGEA GARVVALADT NGGTLPHEVL EITRRVAENT RVKLGVHMHN DSGTAVANSL MGVVGGARHV QGTINGIGER TGNADLIQII PSLVLKMGYR VLKNEDGLKK LKQVSSTLYE LAGLHPNPFQ PYVGDNAFTH KAGVHADAVM KNTRAYEHID PSLLGNQRKV VISELSGTSN LVNYLEKLGI KVEKKEEKLK KALKAIKEME ARGYSFDLAP ESAMLVAFKE MGIYEKFFDV QYWKVINENG LALAVVKVNS RVEAAEGTGP VHAVDVALRR CLSKDFPEIE RVKLTDYRVV LPGEVKNTES VVRVTIEFND GERSWRTEGV STSVIEASIM ALVDGLDYYL QLNKKLKPLV AKE
|
| |