Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2080 |
Symbol | |
ID | 5105060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1998023 |
End bp | 1999327 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640507970 |
Product | NADPH:sulfur oxidoreductase |
Protein accession | YP_001192144 |
Protein GI | 146304828 |
COG category | [R] General function prediction only |
COG ID | [COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.216525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAGA GTCTTGTGGT ACTAGGTGGG GGAGCGGCCG GGATGAGCGC CGCATCCAGG GCGAGGAGAC TTAGGCCTGA CCTGAGAATT ACGGTTATTG AGTCCACGAA AATGGTGAGT CATGCACCCT GTGGGATACC ATACTTCGTG GAGGGTTTGT TCGACGACGA GAACCTCTTC ATGACCTATA CTCCCTCATA TTTTGAGAGG GAGAGGAAGA TAGAGGTGTT AACCAACACC GTGGCCAAGG AAGTTGATCT TGACTCTAGG GTAGTAAGAA CGGATAACGG AAAGGCGCTT GAATATGACT GGCTAGTCGT GTCTCTTGGA GCTCTGCCCA AGACGCTACC CAACGTTAAG GGAAACAGGG TTTATTACGT TCATCATCCA GCTCACGCGG TTGAGATCAG AGAGCGGTTG TGGTCCATGA ACAAGATAGC CGTGATAGGT GGAGGGATAC TTGGGGTGGA AATGGCTGAG GCGTTGAGCG CTGTGGGGAA GAAGGTTGTC CTAATCCACA GGGGGCCTTA CATCCTCAAC AAAATGCTGG ATCAAGACCT CGGAGACGTG GTCACCAAAC TTGTTCAGGG CAAGGTCGAG TTACACCTCA ACGAGAGCAC CGAGGAGATT GGAGAGAATT ACGTTAAAAC GGACAAGGGG AAGTACCAAG TTGACGGCGT AGTCCTTGCC CTAGGGGTCA CGCCCAACGT GGCCATGTTT AAGGAAAAGC TTCAGCTAGG CACCACTGGC GCAATAAAGA CCAATTCACG CATGGAGACT TCCGTGAAGG GCGTGTATGC AGCTGGCGAT GTGGCAGAGA CCACTCACGT GGTATCAAGG AGGGAAGTCT GGATGCCCTT TGCCCCTGTC GCGAACAAGA TGGGGTTTGT GGCTGGTAGT AACATAGGGG GCAAGGTCAC GGAGTTCCCA GGTACAGTTG GAAACATGAT TACTAAATTC CAGGACATGT TCATAGGAAA GGTAGGATTA AACGAGATTG AAGCCAAGGA AGTAGGGTTC AGGCCGATCT CTGCAACCAT AAAATCAAAA ACTAGGGCCA GGTATTATCC AGGTGCTAAG GACATTTACG TGAAGCTCGT GGCAGATGAG GATAGCAAGA GAATTTTGGG TGGACAAATT GTGGGAGGGG AGGAAGTACT AGGCAGATTG GATTCTGTTT CTGTGGCCCT CATGAAGCAG TTGACCGTGG AGGAAATGTT CTTTGCGGAG ATGGGTTACC TACCAGCGAT TTCCCAAGTA TGGGATCCGC TAACCGTTGC TGCTAGGCAA CTTCTTAAGG CATAA
|
Protein sequence | MPESLVVLGG GAAGMSAASR ARRLRPDLRI TVIESTKMVS HAPCGIPYFV EGLFDDENLF MTYTPSYFER ERKIEVLTNT VAKEVDLDSR VVRTDNGKAL EYDWLVVSLG ALPKTLPNVK GNRVYYVHHP AHAVEIRERL WSMNKIAVIG GGILGVEMAE ALSAVGKKVV LIHRGPYILN KMLDQDLGDV VTKLVQGKVE LHLNESTEEI GENYVKTDKG KYQVDGVVLA LGVTPNVAMF KEKLQLGTTG AIKTNSRMET SVKGVYAAGD VAETTHVVSR REVWMPFAPV ANKMGFVAGS NIGGKVTEFP GTVGNMITKF QDMFIGKVGL NEIEAKEVGF RPISATIKSK TRARYYPGAK DIYVKLVADE DSKRILGGQI VGGEEVLGRL DSVSVALMKQ LTVEEMFFAE MGYLPAISQV WDPLTVAARQ LLKA
|
| |