Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1750 |
Symbol | |
ID | 5104750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1684358 |
End bp | 1687153 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507645 |
Product | DEAD/DEAH box helicase domain-containing protein |
Protein accession | YP_001191829 |
Protein GI | 146304513 |
COG category | [R] General function prediction only |
COG ID | [COG1201] Lhr-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00374519 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.037221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACT CGCTCAGCAA GAAGATATCG AGCCTAATGA AGGAGAGAAA CTGGACCAAA ATGACCCAGA TCCAGGAAAT GGCCATGGAA CCCATTTTGA GGGGAAATAA CACGCTAATC ATAGCCCCTA CCGGATTTGG GAAAACTGAG GCTGCACTTC TCCCTATTCT TAGCCTGATG TCAGAGGGAG AGCAAAAACC TGTGTCCCTG ATTTACGTTA CTCCACTTAA GGCACTGATA AATGACATCA CGATCAGGAT AGATTGGTGG GCCTCGAAGC TGGGGTTTGT GGTGAGCAGA AAGCATGGTG AGGTGCCCCA GAAGGAGAAA AACATGAGAC TGAGGAAGGC TCCCCACATC CTGGTTACTA CCCCAGAGGG CCTTGAAATA GACATGGATT GGGCCTCGAA GTTTAGGGAT AACTACAGGA ACGTGAAGTG GGTGATAATT GATGAAATAC ACGAGCTCAT GGGGTCCAAG AGGGGAGCTC AGCTTTTCGT TCTCCTGGAA AGGTTGAAGG ATTTCTCAGG AAAGGACTTC CAAAGAATTG GTTTGTCAGC TACTATCAGT AACGAGGAGC TTGTGGCAAA TACCCTCTTC GGATCTTCCA GTAGGCCCAA GACCATCGTA AAAAGTGAGG CGGGGAAGGA GTTCAGACTG AAGATAAGAT CCATTAAGGA TTCCGGGGAT GTTTGGGTAG CGTCGGCCAA GATCATCAAG GAGTCCTTGG AGCCACCTAC CCTGGTATTT ACTAATTCCA GGTTCCTCAC GGAAAGATTA CATGAGGAAC TGGAAAGGCT CGGAGAGAAG GGGATTTTCG TTCACCACTC CTCAATCTCC AGGGATTCAA AGAGCAATGC TGAGGAGAAC CTCAGGAGTG GTAAGGCTGG GGCGGTCCTT TGTACCAAAA CCCTGGAGCT TGGTATAGAT GTGGGAAAGG TTAAGAAAAT AATAATGTAT AGGCCTCCCC CATCAGTGGC GTCATTCCTT CAGCGATTGG GGAGGAGCGG TCATTGCGTT GGTGGCATAC CCTACGGTGA GATCATATGT GTGCAGGACT TTGATTGCCT CGAAGCGTTG GCAATATATT CCCTTTCGAG GAAGGGGAAG CTCGAACCTC CTAGAAGGGT TCGACCGTTA GACGTGGTAG CCAGGGAGAT ACTTGGAATG CTACTCCAGT ACTCCTCTAT AAAGTTGGAA CGCGTCTTCT CGATTATTAC TGCCTCTCAG GTATATAGGG ACTTAACGAG AGAGGAATTC CAGAACCTCA TCTCCTACCT CCAGAGGAAT AACCTAGTGG TGGTAGAGGG AGACGAACTT AAGCTAGGGA AATCGTTCTT CAAGATCTGG ACCTTTAATA GGTCCAATAA CTTCGTATGG GCCAAGAACT TCTCGGAGTT CTTCTCCCTG ATAAGTAATG ACGACGCCTT CACCCTGAGG AGCGGGGAGA AGATAATAGG TGAGATAGAC GCAATCTACG TGTATAAACA TATTCGTCCC GGTGACCTGA TTAGGATTAG CGGTAAACTA TGGAAGGTTG CCAGAATACA CAACGGAATG ATGATGGTAG ACCTAACGCC AGCAGACCGT GGAGAGGGAG AAATCCCCAT CTGGAGGGGA GAGGGAGTAC CAAAGTCGCA ACTAATTCCT AGGGAAATCC AAGAACTCTT CAAGATTGGT GATAAAATAC TAGAAAGCGA GATTTTGGAT GATAGTGCTA AGGCAAGCCT AAAGGCTCTG ATGGAGAAAT ACGTAAGGAG TAAGTTACCT TTACCTTCCT CAAGTACAAT TTACATGACA GTGACAGATA AGGAAACTGT TTATTCTACA CTGATTGACG AAAGGGTAGC TAACACCATT GCTCACATGT TGATGTACTT GGCGAGTTCC AAGTACACCC TAAACGTGTA TACTAGGGCT TCTATATATG GATTCTCCAT TAACATAACT GACAGGGACC TTCTCAGGGA ATTGGTCCAA ATGAAGGAGG AGAGAATAAG GAAAATACTT TTCAGATCTA TTCTGAGATC CCCGCTCTTC ATGTCCGTGG AAAAGGAGAT TCAGGCCAGC TTTGGTAAGA TAGGTAAGAT AAACCCGAAG GAGGATAAAC TGATAATCAA GGAAGGTCTA AGGCAGACCG TAAAGCGTTA CTTTAACATA AAGGGAACCC TCACCACGCT GAAAAGGATA AGGGAGGGGA AGATAAGGAT AGTTAGATCG GAGCTGACGC CCTTGGGAGA GGCAGTACTT TCTCACGCTC CCATCAAACC TTGGATTTCT GGGATCAATA TCCTGATTTA CGATGCCTTG AAGGGTGGAG CCTATACTGT TCACGAAATC TCCGAAATGT TATCTATTCC TCCAAGAAGC CTAGAGGTGA AACTTAAGCA GATGAGAAAA ACGAGCACAA AGTATAGGGT AACCAGCTTT GTGGACGTGG ATAGTAAGGA GATAAGGTGG TGTACAGTGG ATGAACTTAA ACAACTTGTG AACTCAGACG AGTTCTATAC CTCCTTCGCT CCCATTAACG AGGATGAGAC TTTCATTGCA GAGATGAGGT CCATTGAAGG CTCTAGCAAC ACGGAGTTAA TATTTAAGCC GAAACAGATA TTAGAAAATC CAGAGGAGTT CGCCAAGAGG ATCCCCATGG ATGAAGTTGG GGAGCTTAGG ATCCACGATC CCGTGGATCC CATGATATGT AACATGTCCC CAAGGTACTA TTTCGTGAGG AGGGATATAG TTCCCTATCT ACTCTTAAAC GCGTCGGCGT ACATTCAAAA CCTCAAATAT ACGTGA
|
Protein sequence | MSDSLSKKIS SLMKERNWTK MTQIQEMAME PILRGNNTLI IAPTGFGKTE AALLPILSLM SEGEQKPVSL IYVTPLKALI NDITIRIDWW ASKLGFVVSR KHGEVPQKEK NMRLRKAPHI LVTTPEGLEI DMDWASKFRD NYRNVKWVII DEIHELMGSK RGAQLFVLLE RLKDFSGKDF QRIGLSATIS NEELVANTLF GSSSRPKTIV KSEAGKEFRL KIRSIKDSGD VWVASAKIIK ESLEPPTLVF TNSRFLTERL HEELERLGEK GIFVHHSSIS RDSKSNAEEN LRSGKAGAVL CTKTLELGID VGKVKKIIMY RPPPSVASFL QRLGRSGHCV GGIPYGEIIC VQDFDCLEAL AIYSLSRKGK LEPPRRVRPL DVVAREILGM LLQYSSIKLE RVFSIITASQ VYRDLTREEF QNLISYLQRN NLVVVEGDEL KLGKSFFKIW TFNRSNNFVW AKNFSEFFSL ISNDDAFTLR SGEKIIGEID AIYVYKHIRP GDLIRISGKL WKVARIHNGM MMVDLTPADR GEGEIPIWRG EGVPKSQLIP REIQELFKIG DKILESEILD DSAKASLKAL MEKYVRSKLP LPSSSTIYMT VTDKETVYST LIDERVANTI AHMLMYLASS KYTLNVYTRA SIYGFSINIT DRDLLRELVQ MKEERIRKIL FRSILRSPLF MSVEKEIQAS FGKIGKINPK EDKLIIKEGL RQTVKRYFNI KGTLTTLKRI REGKIRIVRS ELTPLGEAVL SHAPIKPWIS GINILIYDAL KGGAYTVHEI SEMLSIPPRS LEVKLKQMRK TSTKYRVTSF VDVDSKEIRW CTVDELKQLV NSDEFYTSFA PINEDETFIA EMRSIEGSSN TELIFKPKQI LENPEEFAKR IPMDEVGELR IHDPVDPMIC NMSPRYYFVR RDIVPYLLLN ASAYIQNLKY T
|
| |