Gene Msed_1546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1546 
Symbol 
ID5103991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1505672 
End bp1507006 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content49% 
IMG OID640507432 
Producthypothetical protein 
Protein accessionYP_001191625 
Protein GI146304309 
COG category[C] Energy production and conversion 
COG ID[COG2048] Heterodisulfide reductase, subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.018059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTACAC AGGAAATGGA TAAGAAAATG GAGGAAGAGC TCAAGGAAGC CTTCCCCATG 
GCTGACAACG TCGACTGGAA CGAGGTATAT CAGAGGATTA TATACAGGTA CAGCACCCCT
CACGGCCTAC AACACGTAAA GGAAGAGCTT TACAAGCTAG AGGACAAGGG CGAAATAATA
GTACATCACA TAAAGCCCTA TAACAACCCC GTGAAGATGC AGACCCTCAA CGGCACCCCT
AAGGTCATTC CAACCACCAA GCTATGGCAA CACAAGAGCT GTGGTCAGTG TGGTCACATC
CCAGGTTATC CAACTTCTGT GTTCTGGATG ATGAATAAGA TGGAAATAGA CTACATGGAC
GAGCCACACC AAACCTCATG TACTGGATGG AACTATCACG CTTCCGGTGC TTCCAACCCA
GTAGCGCTGG CTGGAGTCTA TGTTAGGAAC ATGTGGAGGG CTTACGAAAT AGATTACTTC
CCACTCATTC ACTGTGGAAC TTCATTCGGT CACTATAAGG AGATCAGGAA CATGCTAGTC
CTTCACAAGG AGATCAGGGA CAAGTTAAGG CCCATAATGA GGAAAATGGA CATGGACATA
GTGATTCCAG AGGAGGTAGT TCACTACTCA GAATGGTTAT ACACCATGAG CAAGAAGGCA
GCCCAGCAGA AGAAGTATGA CCTTAGCGGT ATAAGGGCTG CAGTTCACAC TCCATGCCAC
GTTTACAAGT TGGTGCCCGA GGACACAATA TATGACCCCG AGGTATTCCA GGGTAGGAGG
CCGGCAGCTC CCAGCGGTAC TGCCCAGAAC TTTGGTGCCA AGCTAGTCGA TTACTCCACA
TGGTGGGACT GCTGCGGCTT CGGTTTCAGG CACATCCTGA CAGAGAGGGA GTTCTCGAGA
AGCTTTGCGT TATTCAAGAA GGTTATTCCT GCAGTTGAGG AAGGAAAGGC TGACATATTC
GTGACCTCAG ACACTGGATG TGTGACAACC CTAGACAAGA GCCAGTGGGC GGGAAAGGCT
CACGGTTTCA ACTATAACCT ACCAGTATTG GCAGATGCTC AGTTCGCGGC AATTGCAATG
GGCGCTGATC CCTACACAAT TGCCCAAGTT CACTGGCATG CCACTGACGT AGAAGGATTC
ATGAGGAAGA TAGGTGTGAA CGTGGACGAT TACAAGGAGA AGTTCATTCA GTACTTAGCC
GATCTAAGAG AAGGTAAAGC CGAGCCCGAG TATCTCTACA AGCCCCACAG GAAGATTGAC
TTCTATCTCT CAGTCCCAGA GAGGGTCAAG TGGTACAAGG GCGATAAGGC CCAGGTGCCA
AACACTTCTA AGTAA
 
Protein sequence
MATQEMDKKM EEELKEAFPM ADNVDWNEVY QRIIYRYSTP HGLQHVKEEL YKLEDKGEII 
VHHIKPYNNP VKMQTLNGTP KVIPTTKLWQ HKSCGQCGHI PGYPTSVFWM MNKMEIDYMD
EPHQTSCTGW NYHASGASNP VALAGVYVRN MWRAYEIDYF PLIHCGTSFG HYKEIRNMLV
LHKEIRDKLR PIMRKMDMDI VIPEEVVHYS EWLYTMSKKA AQQKKYDLSG IRAAVHTPCH
VYKLVPEDTI YDPEVFQGRR PAAPSGTAQN FGAKLVDYST WWDCCGFGFR HILTEREFSR
SFALFKKVIP AVEEGKADIF VTSDTGCVTT LDKSQWAGKA HGFNYNLPVL ADAQFAAIAM
GADPYTIAQV HWHATDVEGF MRKIGVNVDD YKEKFIQYLA DLREGKAEPE YLYKPHRKID
FYLSVPERVK WYKGDKAQVP NTSK