Gene Msed_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1971 
Symbol 
ID5103358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1907040 
End bp1908389 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content50% 
IMG OID640507859 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_001192035 
Protein GI146304719 
COG category[C] Energy production and conversion 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTG ATGTAATAGT AATAGGCGGA GGAGTGGCAG GTGTATCTGC TGCTCTAAGG 
GCTTCAGAAC TCGGCAAGTC AGTGGCTTTA GTTGAGAGGG ATCAGGTGGG AGGGGAATGC
ATAAATAGGG CATGCATACC ATCTAAAACC CTCATCGACG CAGTGAAAAC GGTAAACAGG
GTCTCCTCCT CTCCGTGGAT AGTGTCCTCG GCAACCTTGG ACTATGCTAA ACTCAACGAG
AACAAGGCAA GGATAATAAC TGCAATTAAG GATAGAATGG AGCACAACCT TAACGCCAGG
AATGTGAAGG TGATCAAGGG CAACGCCAAG ATAAAGGCCC AGGGAGAGGT GGAGGTAGAC
GGAAGGACTA TAACAGGTGA CCATCTTGTT TTGTCTACGG GGTCGGTTCC TCTTTCCTTA
CCCGATTTTC CCCTCAATGG GAGGAACGTT TTGGATCCAT GGACTGCTAT GAACTTGAAG
GAGATAAAGA ACAGGATAGT GATCGTAGGT GGAGGGGTTG CAGGGGTTGA GCTTGCGACC
TTGTTCAGGG CCCTGAACAA GGAGGTAACA ATCCTAGAGT TGATGCCCCA GTTACTCCCT
GGATTTGATA GGGATCTGGC CTCAGCTACT AAGAAGAGGT TAGAGGAAAA GGGTATCAGG
ATATACCTCA ACGCCAAGTC GAAGATTGTG AACTCTGAGG GTACAGTGAA GTTCTCCGTT
AACTTGCCCA ATGCCTCTGA GGAGGTGGAG GGTGACCTAG CAGTTGTCAC CATAGGAAGG
AAAGCCAGCA CTGAGAACCT GAACCTGAAG GAGATTGGCG TTGAGACGGA TCAGAGGGGG
TACGTGAAGG TTGATGAAAG GGGAAGGACC ACCAATCCCA AGGTCTTTGC TGCAGGTGAC
GTGGCTGGTG TTCCCCTATC CGCTACTAAG GCCTGGAGGC AGGGAATTGT TGCGGGGGAT
AACATAGGAG GGAAGGAAAG CAAGATGCCG AAGTATATCC CGTCTTCAAT TTTCGCAGAC
ATGGAAATAG GAACAGTGGG AAAGACCCTC GACGACCTGA AGAAGGCGGG AATAGAGGCC
AGGGAAATAA TGGTCGAGAT GAGAGACATA CCTAGGGCCT GGACCCTCAA TGAGACCGAT
GGGTTCCTTA AGCTGGTAGT TGCAGGAAAC AAGATTGAGG GAGCGCACAT GATAGGAGAG
GGAGCTACTG AGGTCATCAA CACTATGGCG TTGGCCATGG AACTTGGGAT CACCACCACC
CAATTGTACT CGGTTACCTT CTCCCATCCC ACTGTGAGCG AAGTGATTGG CGAAGCAATA
CAGAGACTAA CTCATGGAGA GATATATTAA
 
Protein sequence
MDFDVIVIGG GVAGVSAALR ASELGKSVAL VERDQVGGEC INRACIPSKT LIDAVKTVNR 
VSSSPWIVSS ATLDYAKLNE NKARIITAIK DRMEHNLNAR NVKVIKGNAK IKAQGEVEVD
GRTITGDHLV LSTGSVPLSL PDFPLNGRNV LDPWTAMNLK EIKNRIVIVG GGVAGVELAT
LFRALNKEVT ILELMPQLLP GFDRDLASAT KKRLEEKGIR IYLNAKSKIV NSEGTVKFSV
NLPNASEEVE GDLAVVTIGR KASTENLNLK EIGVETDQRG YVKVDERGRT TNPKVFAAGD
VAGVPLSATK AWRQGIVAGD NIGGKESKMP KYIPSSIFAD MEIGTVGKTL DDLKKAGIEA
REIMVEMRDI PRAWTLNETD GFLKLVVAGN KIEGAHMIGE GATEVINTMA LAMELGITTT
QLYSVTFSHP TVSEVIGEAI QRLTHGEIY