Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0031 |
Symbol | |
ID | 5105170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 27200 |
End bp | 28486 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640505925 |
Product | malate dehydrogenase |
Protein accession | YP_001190132 |
Protein GI | 146302816 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.399579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.119769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCGTAA CTGATTTCGA GAGTATGGCT TTGGACGTTG CAGTGAGATA CAAAGGGAAA ATCCAGGTAA TGCCTAAGGT TCCTGTGAAC TCCCTGAACG ATTTCTCTAT ACTTTATACT CCCGGGGTGG CGGCCGTTTC TCAGGCTATA CACAAGAATA GAGAGCTATC ATTTCATTAT ACATATAGGT GGAATGCTAT CGCAGTGGTC ACAGATGGAT CCAGGGTACT GGGTTTAGGG GATATTGGTC CTGAGGCTGC CATGCCAGTC ATGGAAGGTA AAGCACTCAT CTTCAAGTTT CTAGGTGGGG TAGATGCTAT ACCCCTTCCC TTAGGGACAA AGGACGCAGA CAAGATAGTG GAGACAGTTA AGTTATTGGA ACCTGCATTC GGAGGAATCA ACCTTGAGGA CATAGAATCC CCTAAGTGCT TCTACGTTCT TGAGAGACTC AAGAGCATTA TGGAGATCCC TGTTTGGCAT GACGATCAGC AAGGTACTGC AGGCGCTACC TTGGCGGGAT TGATTACAGC CCTTGAAATA ACTGGGAAGA ACCCTAAGAA CATCAAGATT GTTCTCTTTG GTACAGGTGC AGCTAACATA GCTACCGCGC GATTGCTTGG AAAATTTGGA ATTCCTCTAA AAAACATAGT TCTTGTAGAT TCCGCAGGAG TCATATACAG GGGGAGACAG GACGAGGAAA GAATGAAAAC AGAAAATCCC TGGAAGTACG AGTTACTTAG GGAAACCAAT GGAGAAAATG TGACCACTAT AGAGGATGCC TTCAAGGGAG CTGACGTTGT GATAGCGGCC TCAAAGCAGG GACCAGATGT GATTAAGAAG AGTTGGATAA AGCTCATGAA TACTGACCCC ATAGTATTTG CTCTAGCTAA TCCAACACCT GAAATATGGC CAGAGGAGGC AAAGGAAGCC GGAGCTAAAA TTGTGGCGAC TGGAAGAAGT GATTTCCCGA ACCAAGTCAA TAACTCCCTC ATTTTCCCAG GTGTTTTCAG GGGTTCCCTA GATGTTAGGG CCAAGGCGAT AACGGACGAG ATGGTGATTG ATGCAGCAAG GGAGCTGGCC AGTCACGTAA GGGAGAAAGG AGCAACGCCT GACTATATAA TTCCCAAGAT GACGGAGTGG GAAATATATC CTCGTGTAGC GGCAGCCGTG GGTGTAAGGG CCATACAGCA GAACGTCGCT AGAGTGTCTA GAAATTATAA TGAACTATTT GACAACGCTA AGACCTTGAT CGAGAAGGCA AGAACCCAGT TAAGATCTAT AGCTTAA
|
Protein sequence | MVVTDFESMA LDVAVRYKGK IQVMPKVPVN SLNDFSILYT PGVAAVSQAI HKNRELSFHY TYRWNAIAVV TDGSRVLGLG DIGPEAAMPV MEGKALIFKF LGGVDAIPLP LGTKDADKIV ETVKLLEPAF GGINLEDIES PKCFYVLERL KSIMEIPVWH DDQQGTAGAT LAGLITALEI TGKNPKNIKI VLFGTGAANI ATARLLGKFG IPLKNIVLVD SAGVIYRGRQ DEERMKTENP WKYELLRETN GENVTTIEDA FKGADVVIAA SKQGPDVIKK SWIKLMNTDP IVFALANPTP EIWPEEAKEA GAKIVATGRS DFPNQVNNSL IFPGVFRGSL DVRAKAITDE MVIDAARELA SHVREKGATP DYIIPKMTEW EIYPRVAAAV GVRAIQQNVA RVSRNYNELF DNAKTLIEKA RTQLRSIA
|
| |