Gene Msed_1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1122 
Symbol 
ID5103594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1052270 
End bp1053256 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content49% 
IMG OID640507015 
ProductL-sulfolactate dehydrogenase / malate dehydrogenase (NAD) 
Protein accessionYP_001191208 
Protein GI146303892 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.658322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTCT TAAACAGGAT GAACGTCAGC GCAGAAGAAC TGAAATCCAT TATCGTGGAG 
ATATTGAACG AAAGGGAAGT CGAGGGAAGC GAGGTTATAG CGGACCACAT GGTTGAGGCA
GAACTTAGGG GACACTCCTC TCACGGAGTT CAGAGATTGA TACCCCTTGT GAAGGGGGTA
GAACTAGGAA CGATCTCAAG GAGATTACAA TACGACGTCC TAAAAAAGGA GAGGGGATCC
ATGTGGATCG ATGGAAAACA TAGCGTCGGA ATAGTCCTGT GGAACCAATT AATCCAACAC
GAGTTCGACG AACCCTCCAG TGTCATAGCA GTGAAGAATG CTTCCCACAT TGGTTTCCTA
GGTTATTACA CGGATAAACT AGCCACACGA GGTTATGTGG GGATCATGTT TGGAAATGCC
GAGCCCGCGG TAGTTCTCCC CGGTACCTCA AGGAAACTCC TGTCCACAAC GCCCCTCTCA
ATTGGTATTC CCCATGCCCC TCCCATTGTC TTGGATATGG CCCTGTCATC CACGTCAAGG
GGGAAAATAT TGGAGGCCCA GAGAAAGGGC GAACCCATAC CCGAGGGATG GGCAGTGGAC
AGCGAGGGAA AACCGACCAC CGATCCTGAG CTTGCCCTAA GGGGTGGGAT ACTCCCAATC
GGGGGAATAA GGGGTTTCTA CCTCATGCTA GTCCTTGAAA TACTGACGTC ATTCATTTCG
GGATCGGCAA TGGGTCCTAA CGTGAAGGGG GTACTTAACA CTGAAAATCC TCCTAATAAG
GGAGAAATAT TGATTGTAAT AAATCCATTC TACTTTGCGT CTTACACTGA ACAGATCGAT
GAAATCAGGA AACTTCTGGG AATGGAGTTC CCTGGGGAGC ACGGCCTTCA ATTGAAGGCC
AGGAGGCTGA CCGAGGGCAT ACCCATTGAT GCACAGCTTT GGAGTACTCT GGTCAAGATG
AAGGAAAAGA TTCCCTTCTT TATCTAG
 
Protein sequence
MSVLNRMNVS AEELKSIIVE ILNEREVEGS EVIADHMVEA ELRGHSSHGV QRLIPLVKGV 
ELGTISRRLQ YDVLKKERGS MWIDGKHSVG IVLWNQLIQH EFDEPSSVIA VKNASHIGFL
GYYTDKLATR GYVGIMFGNA EPAVVLPGTS RKLLSTTPLS IGIPHAPPIV LDMALSSTSR
GKILEAQRKG EPIPEGWAVD SEGKPTTDPE LALRGGILPI GGIRGFYLML VLEILTSFIS
GSAMGPNVKG VLNTENPPNK GEILIVINPF YFASYTEQID EIRKLLGMEF PGEHGLQLKA
RRLTEGIPID AQLWSTLVKM KEKIPFFI