Gene Msed_1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1118 
Symbol 
ID5103591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1045882 
End bp1047240 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content42% 
IMG OID640507012 
ProductD-lactate dehydrogenase (cytochrome) 
Protein accessionYP_001191205 
Protein GI146303889 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR00387] glycolate oxidase, subunit GlcD 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.410973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGATTT CTTTGTCCGA AAACATTCTA AGGGAATTGG ATGGAATTAA GTGTTCGTTG 
GAAGAAAGGA CAGATTTCCT CAATAATAGA GTAAGACCAG TCGTGGTGAC TTACCCATCT
AGAACTGAGG AAGTAGCGAA AATAGTAAAT ATTGCAAGAG AACATGGTTT ACCCATCGTA
GTCTGGGGTG GAGGTACCAG TTTAGCCGGG CACTTGGTAT GTGATGGTTG TATTTTAATT
GACATGAAGT TCATGGACAA GATAGTCGAG ATAAATGATA CGGAATGGTA CGTGAGAGTA
CAACCGGGCT TGATCCTTTC AAAATTAAAT GATGAGCTTA AAAAAATCGG CTTCTTCATA
CCTCCCGAGC CTGCTAGTTC CTTTGCATGT TCCGTGGGAG GAGTAGTCAA TAACGCCTCA
GGAGGTATGC GAAGTGTAAG GTACGGCACT TTCAGAGACT GGGTGCTTGC CTTAGAGGTG
GTTTTACCAT CTGGAAAGGT GATAAGAGTT GGTGAGCCGT TCGTCAAGAA TAGAGCCGGG
TACGACTTAG TTCACCTCTT CGTGGGTAGT GAGGGAACGC TTGGCATAGT GACGGAGATT
TGGTTCAAGA TCATTCCTGT CCCTGAAGAG GTAAAATACT CGATCATGAT GGAACTGTCT
GACTTCAGAC AAGGTACCGA GATAATCAGG GAACTTAGAA AGAATCGCGT CGTTATAGAT
GTGGCAGAAT ATATGGATGG ATTAGTAGCT AAAACAATAA ATAAACATTT TAATACTAAT
ATACCGGAGA GCGTCGGTGG GACGATTACA CTATCCTCTT CTTCGACTTA TCGAGAAAAA
ATTGAAAAAG TGTTAAGACA GCACTCTATT ACATTCACAG AGGTGGATGA GGATAAAACT
CTATCGGAAA GAGCCTTGGC AGGACTGGCC CTAAAGGCTG AGTGGAACGA AAGAGTTTCG
GAGGACATTG TTGTGCCCCT ATCTAAACTT GATGAAGCTT TTATGAAAAT TAAGGAACTT
GAGGAAAAGA GCGGCGTTAA GATAGCCATT TTGGGGCACA TAGCTGACGG AAATTTACAC
CCAAATATTC TGATCTCGAG TAGAGACGAT CCTCGACTTA CGAAAATCTA TGACGAGATA
GGAAGGATAG CAATAGTACT AGGAGGATCA ATTTCGGGTG AACATGGAAT AGGCTACATG
AAAGCTGATT TAATGAAGGA ACAGTTAACA GCTCATAACG GCATTGAGGT TCTTAAAATC
ATGAATGACA TTAAAGGTTG TATCGATCCG CACCACTTTA TGAATCCTGG CAAGTTCGTT
GAGCTAGCCT GGAGTCGTTA CCTAATTAAT AAGGATTAA
 
Protein sequence
MWISLSENIL RELDGIKCSL EERTDFLNNR VRPVVVTYPS RTEEVAKIVN IAREHGLPIV 
VWGGGTSLAG HLVCDGCILI DMKFMDKIVE INDTEWYVRV QPGLILSKLN DELKKIGFFI
PPEPASSFAC SVGGVVNNAS GGMRSVRYGT FRDWVLALEV VLPSGKVIRV GEPFVKNRAG
YDLVHLFVGS EGTLGIVTEI WFKIIPVPEE VKYSIMMELS DFRQGTEIIR ELRKNRVVID
VAEYMDGLVA KTINKHFNTN IPESVGGTIT LSSSSTYREK IEKVLRQHSI TFTEVDEDKT
LSERALAGLA LKAEWNERVS EDIVVPLSKL DEAFMKIKEL EEKSGVKIAI LGHIADGNLH
PNILISSRDD PRLTKIYDEI GRIAIVLGGS ISGEHGIGYM KADLMKEQLT AHNGIEVLKI
MNDIKGCIDP HHFMNPGKFV ELAWSRYLIN KD