Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0961 |
Symbol | |
ID | 5104513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 887395 |
End bp | 889194 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506863 |
Product | sulfite reductase (NADPH) beta subunit |
Protein accession | YP_001191056 |
Protein GI | 146303740 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0155] Sulfite reductase, beta subunit (hemoprotein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000332395 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.245939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATTAA GGGATAGGCT AATAGAGCCT GAGTTTAGGA CTTCCCCGGA TAGGTGGAGT GAGGAAGAGA AAATCAAGTA TAAAACAAGG GGATTTACCG AGATTCCTGA CTCGGTCGTC CACGAATTGA AGGACGAGAA GGACGAGTTG AGCTATGAAA CCTCGATTAT AGCCAAGTCA TGTGGAATAT ACCTTGAGTT CAACAGGGAT AAGTTCAGGG AGACCAGAGA GAAGGACTAC ATCTACATGA TAAGGATAGT AATCCCGGGC GGAGGACCAA TAACGGCGAA ACAGTGGGAA ATCCTAGATG AGATAAGCAA CAGATATACG GTGTCTGACG CATACACAAA GGACCCTAAA CCCTCATTGA GGCTCACCAC GAGGCAGGAT ATTCAACTCC ATCATGTTAA GAAGAGGGAC CTGCTCAACG CAATTCAAGA TATAGTTAGA TCTGGATTTT TCACCCTAAA CGGTTGCGGG GATAACGTTA GAAACGTTGT GGCCTGTCCC TTATCCTTCT TTTCCTCAAT ATTTAACTCC AATTCTCTAG CAAAGGAAAT AGCCAACTAC TTTAGGTTGC CCACTGGACC ATATATCCAG GTCTTTGAGC TAGACTCAGT TTCGCATCTA GATACCTCAA TTACGCATGA GGGGAGATTC CGTTACTCGG CAAGTCTTCT CAATAGGAAG TTCAAGATTG CGATCTCTGC CCTCCATTTG ATGGAGGGAA AGTTGGTGAG GGACAACTGC GTAGAGGCTA CCACCAATGA CATAGGTATT GTTCCGGTGG ATGGAAAGCT CTTTCAGTTA TACGTAGGAG GGGGACAGGG GGAGAACCAG GGATTCTCCA CTTTTTCCAC TCTCGGAAAG CCTCTGGGGG TATTTAACCG AGAGGAATTG GTGAGGGCCC TGGACGTTCT AGTGAACATA CAACAGGAGT GGGGTGATAG GAAAAACAGG CATTGGGCGA GAATGAAGTA CCTAGTCTAC AAAATGGGGA TAGAGTGGTT GAGGGAAAAG ATTAGGGAAA CAGGTCTAGA ACCCGAGCCT CCCTTGGATC TAGACGTGGG GGATAGAATG CTTCACCTGG GGCCTATTAG AACAGAGGTA GAGGCTTACG GAATATTTGT GGAAAACGGT AGAGTAATAG ATAGGGAAGA CTCCAAGTAT AAGACTGGGC TCCTAGAGAT GGTGAGGAGC TACCCAGATG CCAAGATTTT CATCACGCCA AACCAGCATC TTATTGTGAC TGGTATAGAT GACCTTAGGG AATTTGAGGT CTTTATGTCA AGATTTCTGA AAAAGCCGAC CAACCTAAGG ATGCATGCTA CTGCGTGTGT TGGCTTCCCC ACCTGTAAGC TATCGTATAC TGACAGCGAG AGATTTCTCC CAAGGTTAAT TGGGGAGCTC GAGAGGAGAT GGGGAGACCT GAAGGAAACC ATAGGTATTT CAGGATGTCT AGCTCAGTGT TCCAGGCCCG GGACCAAGAC CTTAGGATGG GTAGGGACTG GATATAATCT CTACATGCTC AAGGTTGGAG GCGATTCGTC GGGGAGATTC CAGGGGAATC CCCTAATTGA TCCAGATACA GGGGATGTAT ACCTTACCCA TGTACCTGGA AACAGGTTGG CGGATGTAAC TGACGCGCTA TTCGAGCTTT ACTCGACCCA TGGCAATGGG GAGGGAATGG GTCAGTTCCT CAGGAAGTTG GGAAATAAAA GAATCATAGA GTATCTGAAG AACAATCCCA AAACATCTGA CCTTATGACT CCATCAAAAC TAAGGGCGAC TTTGGATTGA
|
Protein sequence | MILRDRLIEP EFRTSPDRWS EEEKIKYKTR GFTEIPDSVV HELKDEKDEL SYETSIIAKS CGIYLEFNRD KFRETREKDY IYMIRIVIPG GGPITAKQWE ILDEISNRYT VSDAYTKDPK PSLRLTTRQD IQLHHVKKRD LLNAIQDIVR SGFFTLNGCG DNVRNVVACP LSFFSSIFNS NSLAKEIANY FRLPTGPYIQ VFELDSVSHL DTSITHEGRF RYSASLLNRK FKIAISALHL MEGKLVRDNC VEATTNDIGI VPVDGKLFQL YVGGGQGENQ GFSTFSTLGK PLGVFNREEL VRALDVLVNI QQEWGDRKNR HWARMKYLVY KMGIEWLREK IRETGLEPEP PLDLDVGDRM LHLGPIRTEV EAYGIFVENG RVIDREDSKY KTGLLEMVRS YPDAKIFITP NQHLIVTGID DLREFEVFMS RFLKKPTNLR MHATACVGFP TCKLSYTDSE RFLPRLIGEL ERRWGDLKET IGISGCLAQC SRPGTKTLGW VGTGYNLYML KVGGDSSGRF QGNPLIDPDT GDVYLTHVPG NRLADVTDAL FELYSTHGNG EGMGQFLRKL GNKRIIEYLK NNPKTSDLMT PSKLRATLD
|
| |