Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0088 |
Symbol | |
ID | 5104666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 78253 |
End bp | 79263 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640505987 |
Product | 3-isopropylmalate dehydrogenase |
Protein accession | YP_001190189 |
Protein GI | 146302873 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0473] Isocitrate/isopropylmalate dehydrogenase |
TIGRFAM ID | [TIGR02088] isopropylmalate/isohomocitrate dehydrogenases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00287551 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000627763 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTAGGG TTTCGGTAAT TCCGGGCGAT GGAGTAGGAC CAGAAATATT TTTTGCAAGT AAGAAAATCT TAGCGAAACT TGTGGAAACG TACTCTCTCG GAATAGAGTT TATTGAGGTG GAGGCTGGGG ATTCGGCTCA AGCCAAGTAC GGGGAGGCAT TGCCAAAGAA TACTCTCAAG GTAATAGAAT CGTCTGACAT GATACTTAAG GGCCCAGTAG GCGAGTCAGC CATGGACGTA GTGGTAAAGT TAAGGCAGAT GTATGATATG TACGCCAATC TGAGGCCTGC TAAATCACTT CCAGGAGTTC CCAACAAATA CGGAAACGTG GATATCCTAA TAGTTAGGGA GAACACTGAG GATCTCTATA AGGGGTTCGA GCATGAAATC TCAGAAGGAG TCGCTGTTGG ACTTAAGGTG ATATCAGCTA TGGCTTCCAC CAGGATAGCC AACGTTGCGC TGGATTATGC AAAGAGGAGA AGAAACAAGG TTACCTGCGT TCACAAGGCT AACGTGATGA GGATAACAGA TGGGTTGTTT GCAAGGTCCT GTCGTTCCGT GCTAAAGGGG AAAGTAGAAT ATAACGAGAT GTACGTAGAT GCGGCAGCGG CTAATCTAGT GAAGGACCCA AACATGTTTG ACGTCATTAT CACCACGAAC ATGTACGGAG ATATACTGAG CGACGAGGCT TCACAAATAG CTGGAAGCTT AGGCTTAGCT CCCTCGGCAA ACATTGGGGA AAGGAAGTCG CTATTTGAAC CCGTTCATGG AGCTGCCTTC GACATAGCTG GGAAGGGAAT TGTGAACCCA ACAGCTTTTC TGCTCTCCGT GAGTATGATG CTGGAACGCA TGTATCAACT AAGTAAGGAT CAGAGATATT TACAGGCCTC ACAATCACTT ACTAACTCGA TTTACAAGGT TTATAGCGAG GGTAAAAATC TCACGCCCGA TGTGGGTGGA AGCTCCAAGT TAAGTGACAT AATTGACGCG ATATATTCGA AGCTAACATA G
|
Protein sequence | MFRVSVIPGD GVGPEIFFAS KKILAKLVET YSLGIEFIEV EAGDSAQAKY GEALPKNTLK VIESSDMILK GPVGESAMDV VVKLRQMYDM YANLRPAKSL PGVPNKYGNV DILIVRENTE DLYKGFEHEI SEGVAVGLKV ISAMASTRIA NVALDYAKRR RNKVTCVHKA NVMRITDGLF ARSCRSVLKG KVEYNEMYVD AAAANLVKDP NMFDVIITTN MYGDILSDEA SQIAGSLGLA PSANIGERKS LFEPVHGAAF DIAGKGIVNP TAFLLSVSMM LERMYQLSKD QRYLQASQSL TNSIYKVYSE GKNLTPDVGG SSKLSDIIDA IYSKLT
|
| |