Gene Msed_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0031 
Symbol 
ID5105170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp27200 
End bp28486 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content46% 
IMG OID640505925 
Productmalate dehydrogenase 
Protein accessionYP_001190132 
Protein GI146302816 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.399579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.119769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTAA CTGATTTCGA GAGTATGGCT TTGGACGTTG CAGTGAGATA CAAAGGGAAA 
ATCCAGGTAA TGCCTAAGGT TCCTGTGAAC TCCCTGAACG ATTTCTCTAT ACTTTATACT
CCCGGGGTGG CGGCCGTTTC TCAGGCTATA CACAAGAATA GAGAGCTATC ATTTCATTAT
ACATATAGGT GGAATGCTAT CGCAGTGGTC ACAGATGGAT CCAGGGTACT GGGTTTAGGG
GATATTGGTC CTGAGGCTGC CATGCCAGTC ATGGAAGGTA AAGCACTCAT CTTCAAGTTT
CTAGGTGGGG TAGATGCTAT ACCCCTTCCC TTAGGGACAA AGGACGCAGA CAAGATAGTG
GAGACAGTTA AGTTATTGGA ACCTGCATTC GGAGGAATCA ACCTTGAGGA CATAGAATCC
CCTAAGTGCT TCTACGTTCT TGAGAGACTC AAGAGCATTA TGGAGATCCC TGTTTGGCAT
GACGATCAGC AAGGTACTGC AGGCGCTACC TTGGCGGGAT TGATTACAGC CCTTGAAATA
ACTGGGAAGA ACCCTAAGAA CATCAAGATT GTTCTCTTTG GTACAGGTGC AGCTAACATA
GCTACCGCGC GATTGCTTGG AAAATTTGGA ATTCCTCTAA AAAACATAGT TCTTGTAGAT
TCCGCAGGAG TCATATACAG GGGGAGACAG GACGAGGAAA GAATGAAAAC AGAAAATCCC
TGGAAGTACG AGTTACTTAG GGAAACCAAT GGAGAAAATG TGACCACTAT AGAGGATGCC
TTCAAGGGAG CTGACGTTGT GATAGCGGCC TCAAAGCAGG GACCAGATGT GATTAAGAAG
AGTTGGATAA AGCTCATGAA TACTGACCCC ATAGTATTTG CTCTAGCTAA TCCAACACCT
GAAATATGGC CAGAGGAGGC AAAGGAAGCC GGAGCTAAAA TTGTGGCGAC TGGAAGAAGT
GATTTCCCGA ACCAAGTCAA TAACTCCCTC ATTTTCCCAG GTGTTTTCAG GGGTTCCCTA
GATGTTAGGG CCAAGGCGAT AACGGACGAG ATGGTGATTG ATGCAGCAAG GGAGCTGGCC
AGTCACGTAA GGGAGAAAGG AGCAACGCCT GACTATATAA TTCCCAAGAT GACGGAGTGG
GAAATATATC CTCGTGTAGC GGCAGCCGTG GGTGTAAGGG CCATACAGCA GAACGTCGCT
AGAGTGTCTA GAAATTATAA TGAACTATTT GACAACGCTA AGACCTTGAT CGAGAAGGCA
AGAACCCAGT TAAGATCTAT AGCTTAA
 
Protein sequence
MVVTDFESMA LDVAVRYKGK IQVMPKVPVN SLNDFSILYT PGVAAVSQAI HKNRELSFHY 
TYRWNAIAVV TDGSRVLGLG DIGPEAAMPV MEGKALIFKF LGGVDAIPLP LGTKDADKIV
ETVKLLEPAF GGINLEDIES PKCFYVLERL KSIMEIPVWH DDQQGTAGAT LAGLITALEI
TGKNPKNIKI VLFGTGAANI ATARLLGKFG IPLKNIVLVD SAGVIYRGRQ DEERMKTENP
WKYELLRETN GENVTTIEDA FKGADVVIAA SKQGPDVIKK SWIKLMNTDP IVFALANPTP
EIWPEEAKEA GAKIVATGRS DFPNQVNNSL IFPGVFRGSL DVRAKAITDE MVIDAARELA
SHVREKGATP DYIIPKMTEW EIYPRVAAAV GVRAIQQNVA RVSRNYNELF DNAKTLIEKA
RTQLRSIA