Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1122 |
Symbol | |
ID | 5103594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1052270 |
End bp | 1053256 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507015 |
Product | L-sulfolactate dehydrogenase / malate dehydrogenase (NAD) |
Protein accession | YP_001191208 |
Protein GI | 146303892 |
COG category | [C] Energy production and conversion |
COG ID | [COG2055] Malate/L-lactate dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.658322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTCT TAAACAGGAT GAACGTCAGC GCAGAAGAAC TGAAATCCAT TATCGTGGAG ATATTGAACG AAAGGGAAGT CGAGGGAAGC GAGGTTATAG CGGACCACAT GGTTGAGGCA GAACTTAGGG GACACTCCTC TCACGGAGTT CAGAGATTGA TACCCCTTGT GAAGGGGGTA GAACTAGGAA CGATCTCAAG GAGATTACAA TACGACGTCC TAAAAAAGGA GAGGGGATCC ATGTGGATCG ATGGAAAACA TAGCGTCGGA ATAGTCCTGT GGAACCAATT AATCCAACAC GAGTTCGACG AACCCTCCAG TGTCATAGCA GTGAAGAATG CTTCCCACAT TGGTTTCCTA GGTTATTACA CGGATAAACT AGCCACACGA GGTTATGTGG GGATCATGTT TGGAAATGCC GAGCCCGCGG TAGTTCTCCC CGGTACCTCA AGGAAACTCC TGTCCACAAC GCCCCTCTCA ATTGGTATTC CCCATGCCCC TCCCATTGTC TTGGATATGG CCCTGTCATC CACGTCAAGG GGGAAAATAT TGGAGGCCCA GAGAAAGGGC GAACCCATAC CCGAGGGATG GGCAGTGGAC AGCGAGGGAA AACCGACCAC CGATCCTGAG CTTGCCCTAA GGGGTGGGAT ACTCCCAATC GGGGGAATAA GGGGTTTCTA CCTCATGCTA GTCCTTGAAA TACTGACGTC ATTCATTTCG GGATCGGCAA TGGGTCCTAA CGTGAAGGGG GTACTTAACA CTGAAAATCC TCCTAATAAG GGAGAAATAT TGATTGTAAT AAATCCATTC TACTTTGCGT CTTACACTGA ACAGATCGAT GAAATCAGGA AACTTCTGGG AATGGAGTTC CCTGGGGAGC ACGGCCTTCA ATTGAAGGCC AGGAGGCTGA CCGAGGGCAT ACCCATTGAT GCACAGCTTT GGAGTACTCT GGTCAAGATG AAGGAAAAGA TTCCCTTCTT TATCTAG
|
Protein sequence | MSVLNRMNVS AEELKSIIVE ILNEREVEGS EVIADHMVEA ELRGHSSHGV QRLIPLVKGV ELGTISRRLQ YDVLKKERGS MWIDGKHSVG IVLWNQLIQH EFDEPSSVIA VKNASHIGFL GYYTDKLATR GYVGIMFGNA EPAVVLPGTS RKLLSTTPLS IGIPHAPPIV LDMALSSTSR GKILEAQRKG EPIPEGWAVD SEGKPTTDPE LALRGGILPI GGIRGFYLML VLEILTSFIS GSAMGPNVKG VLNTENPPNK GEILIVINPF YFASYTEQID EIRKLLGMEF PGEHGLQLKA RRLTEGIPID AQLWSTLVKM KEKIPFFI
|
| |