Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1546 |
Symbol | |
ID | 5103991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1505672 |
End bp | 1507006 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507432 |
Product | hypothetical protein |
Protein accession | YP_001191625 |
Protein GI | 146304309 |
COG category | [C] Energy production and conversion |
COG ID | [COG2048] Heterodisulfide reductase, subunit B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.018059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCTACAC AGGAAATGGA TAAGAAAATG GAGGAAGAGC TCAAGGAAGC CTTCCCCATG GCTGACAACG TCGACTGGAA CGAGGTATAT CAGAGGATTA TATACAGGTA CAGCACCCCT CACGGCCTAC AACACGTAAA GGAAGAGCTT TACAAGCTAG AGGACAAGGG CGAAATAATA GTACATCACA TAAAGCCCTA TAACAACCCC GTGAAGATGC AGACCCTCAA CGGCACCCCT AAGGTCATTC CAACCACCAA GCTATGGCAA CACAAGAGCT GTGGTCAGTG TGGTCACATC CCAGGTTATC CAACTTCTGT GTTCTGGATG ATGAATAAGA TGGAAATAGA CTACATGGAC GAGCCACACC AAACCTCATG TACTGGATGG AACTATCACG CTTCCGGTGC TTCCAACCCA GTAGCGCTGG CTGGAGTCTA TGTTAGGAAC ATGTGGAGGG CTTACGAAAT AGATTACTTC CCACTCATTC ACTGTGGAAC TTCATTCGGT CACTATAAGG AGATCAGGAA CATGCTAGTC CTTCACAAGG AGATCAGGGA CAAGTTAAGG CCCATAATGA GGAAAATGGA CATGGACATA GTGATTCCAG AGGAGGTAGT TCACTACTCA GAATGGTTAT ACACCATGAG CAAGAAGGCA GCCCAGCAGA AGAAGTATGA CCTTAGCGGT ATAAGGGCTG CAGTTCACAC TCCATGCCAC GTTTACAAGT TGGTGCCCGA GGACACAATA TATGACCCCG AGGTATTCCA GGGTAGGAGG CCGGCAGCTC CCAGCGGTAC TGCCCAGAAC TTTGGTGCCA AGCTAGTCGA TTACTCCACA TGGTGGGACT GCTGCGGCTT CGGTTTCAGG CACATCCTGA CAGAGAGGGA GTTCTCGAGA AGCTTTGCGT TATTCAAGAA GGTTATTCCT GCAGTTGAGG AAGGAAAGGC TGACATATTC GTGACCTCAG ACACTGGATG TGTGACAACC CTAGACAAGA GCCAGTGGGC GGGAAAGGCT CACGGTTTCA ACTATAACCT ACCAGTATTG GCAGATGCTC AGTTCGCGGC AATTGCAATG GGCGCTGATC CCTACACAAT TGCCCAAGTT CACTGGCATG CCACTGACGT AGAAGGATTC ATGAGGAAGA TAGGTGTGAA CGTGGACGAT TACAAGGAGA AGTTCATTCA GTACTTAGCC GATCTAAGAG AAGGTAAAGC CGAGCCCGAG TATCTCTACA AGCCCCACAG GAAGATTGAC TTCTATCTCT CAGTCCCAGA GAGGGTCAAG TGGTACAAGG GCGATAAGGC CCAGGTGCCA AACACTTCTA AGTAA
|
Protein sequence | MATQEMDKKM EEELKEAFPM ADNVDWNEVY QRIIYRYSTP HGLQHVKEEL YKLEDKGEII VHHIKPYNNP VKMQTLNGTP KVIPTTKLWQ HKSCGQCGHI PGYPTSVFWM MNKMEIDYMD EPHQTSCTGW NYHASGASNP VALAGVYVRN MWRAYEIDYF PLIHCGTSFG HYKEIRNMLV LHKEIRDKLR PIMRKMDMDI VIPEEVVHYS EWLYTMSKKA AQQKKYDLSG IRAAVHTPCH VYKLVPEDTI YDPEVFQGRR PAAPSGTAQN FGAKLVDYST WWDCCGFGFR HILTEREFSR SFALFKKVIP AVEEGKADIF VTSDTGCVTT LDKSQWAGKA HGFNYNLPVL ADAQFAAIAM GADPYTIAQV HWHATDVEGF MRKIGVNVDD YKEKFIQYLA DLREGKAEPE YLYKPHRKID FYLSVPERVK WYKGDKAQVP NTSK
|
| |