Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1054 |
Symbol | |
ID | 5104436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 982416 |
End bp | 983684 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506950 |
Product | malate dehydrogenase |
Protein accession | YP_001191143 |
Protein GI | 146303827 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.767504 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCTG TAGAGATATC CCTCAAGTAC CAGGGAAAGA TTGAGGTGAT GCCCAAGGTT CCCATCTCCA GTTACGACGA CTTCTCGGTG ATATATACCC CTGGAGTTGC TGAGGTAGTC AAGGAGATTT CTAAGGACAA GGACAAGAGT TTCCAATTAA CAAGTAGGTG GAATAACGTA GCTATTATCA CGGATGGTAC CAGGGTTTTG GGTCTAGGTA ACGTTGGCCC AGAGGCATCT CTTCCCGTGA TGGAGGGAAA GGCACTTCTC TTCAAGTACC TGGGAGGCGT AGACGCTATT CCCTTGCCTC TGGCTGTGAG GGATCCTGAC ACCATCATAA ACGTAGTTTC TGCTCTAGAA CCGTCTTTCG GTGGAATAAA CCTGGAGGAC GTAGAGAGTC CCAAGTGTTT CTACATCCTT GAAAAACTTC AGGAAAGGAT GAACATCCCA GTGTGGCATG ACGATCAACA GGGAACCGCT GGAGCTGTAC TTGCTGCCCT CATAAACGCC ATGAAAGTCG CAGGAAAGGG ACTGGACAGC AAGATAGTGA TATTTGGAGC CGGTGCAGCT AACATTGCTA CCGTGAGACT CCTCAAGGCA TATGGTTTCG ACCCCAAGAG GATGATAGTG GTTGATAGGG AAGGCGTGCT TCACGCTGAG AGGAGGGATC TGGACGCAAT GATGTTCAGT CATAAATGGA AGTATGAAAT TGCGGTCACG ACTAACGGGT TCAACATTAC CACGTTGGAT GAGGCATTCA AGGGAGCAGA TATTCTAATT GCTGCCTCTA TGCCAGGTCC GAACACCATT CCTAAGAGAT GGATTAGCCT CATGAAGGAC CCCATTGTCT TTGCCTTGGC CAATCCCGTC CCTGAGATCT ATCCTTCGGA CGCGATAGAC GCCGGAGCCA AGGTGGTGGC CACGGGCAGA AGCGACTTCC CTAACCAGGT GAACAACTCG CTCATTTTCC CTGGGGTTTT CAGGGGAGTC CTGGACAGTA GGTCCTCCAA GGTTGATGAT GCCATGGTCA TAGCGGGAGC GGAGGCCTTA GCGAGATTCG CCGAGAGGAA GGGTATATCG CCCACCTACA TTATCCCCAG GATGGATGAA TGGGACGCCT ATTACGAGTT GGCCTCTGCA GTTGCGGAAA AGGCGGTGGA AAGGGGATAC GCCAGGGTAA GGCTAAGTAG GGAGGAGTTT AGACTGATGG CCAAAACTAA GATAGAGCAG ACCAGGAATA AGATAAGGGC GATCCAGAAT GTTGATTAG
|
Protein sequence | MKPVEISLKY QGKIEVMPKV PISSYDDFSV IYTPGVAEVV KEISKDKDKS FQLTSRWNNV AIITDGTRVL GLGNVGPEAS LPVMEGKALL FKYLGGVDAI PLPLAVRDPD TIINVVSALE PSFGGINLED VESPKCFYIL EKLQERMNIP VWHDDQQGTA GAVLAALINA MKVAGKGLDS KIVIFGAGAA NIATVRLLKA YGFDPKRMIV VDREGVLHAE RRDLDAMMFS HKWKYEIAVT TNGFNITTLD EAFKGADILI AASMPGPNTI PKRWISLMKD PIVFALANPV PEIYPSDAID AGAKVVATGR SDFPNQVNNS LIFPGVFRGV LDSRSSKVDD AMVIAGAEAL ARFAERKGIS PTYIIPRMDE WDAYYELASA VAEKAVERGY ARVRLSREEF RLMAKTKIEQ TRNKIRAIQN VD
|
| |