Gene Msed_0632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0632 
Symbol 
ID5103792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp577891 
End bp578862 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content47% 
IMG OID640506536 
Productthioredoxin reductase (NADPH) 
Protein accessionYP_001190731 
Protein GI146303415 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01292] thioredoxin-disulfide reductase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00998267 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.636604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCA TTCCACGCTC AACGAATATT AACCCAAACG AGAAATTTGA TACCATCATT 
ATTGGTCTTG GTCCAGCAGC CTACAGTGCT GCACTATACG CAGCTAGGTA CATGCTAAAG
ACGCTTGTAA TAGGGGAGAC GCCAGGAGGT CAGCTAACTG AGGCAGGAGA AGTGGATGAC
TACCTGGGCC TCATTGGCGT TCAAGCGTCG GAAATGATAA AGTTATTCAA CGCTCATGTA
GAGAAATACA AGGTACCTGT TCTCATGGAC AGGGTAGAGT CCTTTAAAAG AGAGGGCGAG
GAATATGTGG TCAAGACCAA GAGAAAGGGA GAGTTTAGGG CTTCTACACT AATAGTAGCA
GTGGGAACCA AGAGGAGGAA ACTTAATGTT CCAGGTGAGA ACGAGTTTAT AGGTAGGGGT
GTCTCCTACT GCTCCGTGTG TGACGCCCCA CTCTTCAAGA ATAGGCCAGT AGTTGTAGTG
GGGGGAGGAA ACTCGGCGTT GGACGGGGCC GAGCTACTTA GCAGGTACGC GACCAAGGTT
TACCTTGTGC ATAGAAGGGA AGAGTTTAGG GCTCAACCAA TAATAGTGAA ACTGGTTAAG
GAGAAACCAA ATGTGGAGTT GATCCTGAAC TCCGTGGTCA AGGAAATTAA GGGAGATAAG
CTTGTCAGGA AAGTTGTGGT ACAGAATATG AAAACCGGCG AGGTAAGGGA GATAGATGCC
AATGGAATAT TCGTGGAAAT AGGATTTGAA CCCCCCACTG AGTTCGCTAA GATTAACGGA
CTAGAGGTGG ACGAACAGGG TTACATAAAG GTAGACGACT GGACAAGGAC TAACCTACCA
GGGGTTTTCG CTGCAGGGGA CTGCACCAAC AAGTGGATTG GATTCAGACA GGTTGCAACG
TCAACAGCAA TGGGCGCGGT TGCGGCACAC TCAGCTTATA ACTATTTGAA CGAGAGAAAA
GGTAAAACAT GA
 
Protein sequence
MSLIPRSTNI NPNEKFDTII IGLGPAAYSA ALYAARYMLK TLVIGETPGG QLTEAGEVDD 
YLGLIGVQAS EMIKLFNAHV EKYKVPVLMD RVESFKREGE EYVVKTKRKG EFRASTLIVA
VGTKRRKLNV PGENEFIGRG VSYCSVCDAP LFKNRPVVVV GGGNSALDGA ELLSRYATKV
YLVHRREEFR AQPIIVKLVK EKPNVELILN SVVKEIKGDK LVRKVVVQNM KTGEVREIDA
NGIFVEIGFE PPTEFAKING LEVDEQGYIK VDDWTRTNLP GVFAAGDCTN KWIGFRQVAT
STAMGAVAAH SAYNYLNERK GKT