Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0814 |
Symbol | |
ID | 5105137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 743206 |
End bp | 747510 |
Gene Length | 4305 bp |
Protein Length | 1434 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506719 |
Product | anaerobic dehydrogenase |
Protein accession | YP_001190913 |
Protein GI | 146303597 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0197792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACT TGAAATTGTC CAGAAGGGAC TTTCTTAAGG TAAGCGGTGC GGCTGCTCTA GGAACGGCTC TCATCCTAGG CGGAAATTCG GTGGCTAAGA AAATATTTGA TACTTTCTCT GAGACAAATT ATACCCTTAA TTATCCCAGC GACGAGATAG TCTACTCCAA CTGCTTCCAG TGTCTCGGAA GATGCGCCCT AGAGATAGTG AGAACTCCAA CGGGATTCCC GAGGTTCATC ACTGGTACCA TAGGGTGGCA CATAAATGAC GGTGGCGTAT GCCCTAGAGG GGCATCAGAC GTCTATTACT ATTTCGCTCC AGCTAGGTTA AGGTATCCTC TCCTTAGGGC TGGGGATAGG GGTTCAGGTA AGTGGATGGC CATAGATTAC GACACTGCTT TCGACATACT GGTTAATGGT GCCTCAGCCA AGAGCTGGAG CAACTTAGGA ATTACCCCAC AGGAACTAGG GGTTGCTAAT TTCCAAGGTC TAATGCAGAT TAGGGAGACA AATCCGCACT CCCTAGTCTT CATGCAGGGA AGGGATCAGT TAATCCCAGG AATTACCGCA GGCTTCTTTG CAGGCAATTA TGGTACAGCT AACGCCGGGG CACACGGTGG ATTCTGCTCC ATGAACGTCT ACACTGCAGG AGTTTACGTT ACTGGTGCTC CAGTGTGGGA GTACGCTGGA CCTGACGAGG AGAGGGCTCA GTACTTCATT CTAGCAGGTC TAGCTGGAGA TCACTTCCCC AACTGGATGA GAAGGATCAT TGCGAGGATA AGGGAGAATG GAGGTAAAGT GGTTACAATT GCCCCTGAAA GGTTCGGTTT CTACTCTGTT TCTGATGAAC ACCTCTTCAT CAATCCAGCC ATGGACGGAG CATTGGCCAT GGGATGGATA AGGGTTTTGG TTGACTTCCA CTATTACGTG TATAAGGCTT ATCTTGCTTC CACGGGACAA GGTCCAACGG TCCTTAACCC GATCACCTTG ACGCCGGTGC AACCTGCATA CGACACCTCC TCTGGCCAAC TGGTGATGCA GACGGTTACC CCATCTGGTC AAGTGCAGGC TATACCCAGT TTAGGTGATA TTCCCAGCAC CGCCGTATTC CCGCATATTG ACGAGGAGTT CTTGAGATAC TACACCAACA TGTCCTGGCT CGTCATAGTG AACCCTAACC CGCAGAACGG TGACGCCCTA GATCCCACAG ATCCGACTGC AGGGAATAAC GTGGGGCTAC ACCTGAGGAT GCCCGTTAAC AACTCAGAGT ACAGTGGCGC TCATCCATGG CTAGAGGCCA TTATGGGGAA TGACGGCAAC GTTTACTCCT ATGTGGACAC TCCATGGCAG AAAAACGTGA TGCCAATCCT AACCATGGAT GAGCTTCCCT CAAGCATGCA GTCAAGCATT GTGCAGGTTC CCTACAAGCT AAAGAATGGA ACGTCAGTGA AAGTACCAGC TATCCAGGTC ACAGTTCCCA AGGCCTTAGG TTCCTCTGAG ACCATAACGC TCACCGTTAC GACTGCGTTC GAGCTATTTA GGGCCGAGCT TCTCAACTAC GATCCATATA CACCTTCATC TTCCTCACCA CAATACGGAG CCCTTAACGT TGCTGAGCTC GCTGGTATTC CCCATAACAC GATAGTTAGA ATTGCCAATG AGCTTGCGAC TGTCGCATTC CAGAAGTCGA TCTACGAACC AGCTCAATGG GTTGACTACC TAGGTAGGTA TCACGATCAT GTTGTGGGAA GACCTGTCTC GATTTACTTC ATGAGAGGTC TAGCTGCTCA TGCCAATGGG TTCATGAGCT CAGCTGCCTA CTATTACCTT GTGCTGGCCC TGGGTGCCTG GGACAACCCT GGTGCCGATC TTTATAAGTA TCCTTATCCG CACTACTTCC CTGGTCATGC TGTACCTCCG CATCCCTTAG CTAACTCTCC ACAGATTGAC CCCAACATAG CTTCAGCTAT AAGTGGGATG AACGTTACGA TACCTAAGGG TAGATCTGGC TTCCTCAAGA CGGAGTACAT CTATAATGAT GGCACCGTTG ACATAAAGAA AGTCACTGTC GTGGGTGCAG GTCCCTACGG ATATCCTGAT GGACCAGATG ATCTGGTACT ATTCAGTACT GGAAGACCCT TACTCATCGA CAGAGGTTAC AGCTGGGAGA TGCCTCTTTC CACGCAGAGG TCCATTTCAG CAGTTGCTTA CACAACATAT TTCGCTAACA AGAATCCCAA CCAGGTTATT CCGTATACCG TTAACGCAAT GATGTGGTAC ATTACTGCTC CCTACTGGAA CAATGCCTAC ACTTTGACCG ACCTATTACA GAAGGTCACA GAGAAGGGTA GCGATGGTAA TTATGTAATT CCATTTACCA TAAGCTTCGA TCTCTTCATG CAGGAGACTA ACAACGTTGT AGACCTGGTG ATGCCAGACC TATCCTTCCT CGAGATATAT GGTTTCCACA GCACCTTTGA TAGACCCACC AGTCTACCAC AAGGTCCCTC CGATTCCTTG AACTGGCCCG CACTACCATC CATGTATCCG GTCCTATCTA CAGGGGACAC GTTACTGACC CTTCTATGGT TACTTAGGGC ATATCCAGGG CAGAAATCCA TAACACCTAC CGATAATCCT CACTACACTA CCCAAGATCC AATAACAGGT GGAAATCTGG TAAGCCCGGT CTCCGTTACG GACCCAATAA CAGGAACTCA AATACTGACT CAAGGTCAAC CTGGTCTAAT CTCTTCTGCA ATGTATATTC TGAAGTCCGG AACGTTACTA GCGGGATATG GAAGTAACTT CGAGTATATC CTTGCAGACT CAGAGGGTAA ACAGGTACCA AACCCACAGC AACTAAAGCT CTACGCCTCC TTCGTCCCAG CCTCCTCTAA TCAGCAGAGT GTTATTAATC AGATACTAAG TAGTGTGCCA TCTGGTCAGA CCTTCTCCAC AGCGCCTACA TACAAGATTG GGGCAAGCGA CAGGTCCTCA GGTACAGAAC ACTACTTCAG GATTCCGGTC AAGTTCCAGC CGGGCCTTCA AACAAAATTA CAGAATATCA TCAGCAAGTA CGCTCCTAAC ATGTCCCTTA GTGACATCAA TCCTGGATAC GTCAAGGTAG GAAAGGGAGG CGCAGCGTAC GTTCTTCCTC CTTCCATAAG GTACATGAGA AACGTAAACA TGTTCTACTT CCTGGGATGG GGCGCTTATA TGCCAGGAGG ATATGGACCC ATTGGACTGC CGTATGTCCA CAGGATTTAC TTGGAATACC TCCAGAAGTT CAGGTTAGCT GCAGCCGGAA AGTGGACAGG ATACAATGCA GCTTACTATT ACTACTACCA GCTGACTGGA AATTCCAACT TCAAGGTGCA GTCCGTGTTG CCGAATGACA GCTATGGTCA AGCTCTGGCA AAGAACATAC TGGACTATCA CAGACCCTTC GGAGGGTATT ATCCACCACC AGCTTGGGCT TCCAACATAA CATCAGACGG GATCAATGAG AATGAGTATC CCTTAACGTT CTTTGTGAGA AGGCACGACA GGACATATCA CACTTGGTCA TTCAATGTTC CATGGTTAAC TGCTATCATG CCCTATACTC CAGTCATGCT AAGTTCCGCA GACCCCTACG TGCAGAGCAT GGGGATACAG AGCGGTGACA TGGTCCAGTT TGAGGCAATA AACGAACCTT GGAGCGGTGG TCTCAGAACC ACATTGACAG CTGTGGCATT CCTGGATAAT GCCACCAGGC CCGGTGCAGC TTGGGTTGTA GTATCTGCCC AGGCTCTACC TGGATTCAGG GGACAGACTG CTGAGTCTCC GCAGGTGAAG TATAGCGTTC TAAATAACTG GGCCAATATC TCCTACATGC CACCATCTAG GGGAGGACAA CTCTCTCCGA CAACGGACAA GGCGTTCCCA ATCATGTACC TAGATCCGAT CACTGGTCAA ACTTCATGGC ACGATAGCAG GATAAAGATT GTGGGTAAGT CCTCTGCGAC ACAGGTACAG GTCACAGCTA ACAAGTTCGT GTACATGGGC CAGGACTTCT CAAACCAGGA TATTCTATCC ACTATACTTA ACCAAATGGG AGGTCAGATA TCTGTCAGCA ACTCCCCATC TACCCCGACC TTTGCACAAC CTGTGAATGT GCCTCACCTG AGATTTGATG CCAACAACAT GACCTCCACA AGTATATGGA GCTACGTATC GCCAGGATCC CTGGCACCAG GTTACACCAT AGGCAACTAC AAGGTGAGGT TCGGTTTTGC GACCGATCCT TCAAGTCAAG GGTGA
|
Protein sequence | MSHLKLSRRD FLKVSGAAAL GTALILGGNS VAKKIFDTFS ETNYTLNYPS DEIVYSNCFQ CLGRCALEIV RTPTGFPRFI TGTIGWHIND GGVCPRGASD VYYYFAPARL RYPLLRAGDR GSGKWMAIDY DTAFDILVNG ASAKSWSNLG ITPQELGVAN FQGLMQIRET NPHSLVFMQG RDQLIPGITA GFFAGNYGTA NAGAHGGFCS MNVYTAGVYV TGAPVWEYAG PDEERAQYFI LAGLAGDHFP NWMRRIIARI RENGGKVVTI APERFGFYSV SDEHLFINPA MDGALAMGWI RVLVDFHYYV YKAYLASTGQ GPTVLNPITL TPVQPAYDTS SGQLVMQTVT PSGQVQAIPS LGDIPSTAVF PHIDEEFLRY YTNMSWLVIV NPNPQNGDAL DPTDPTAGNN VGLHLRMPVN NSEYSGAHPW LEAIMGNDGN VYSYVDTPWQ KNVMPILTMD ELPSSMQSSI VQVPYKLKNG TSVKVPAIQV TVPKALGSSE TITLTVTTAF ELFRAELLNY DPYTPSSSSP QYGALNVAEL AGIPHNTIVR IANELATVAF QKSIYEPAQW VDYLGRYHDH VVGRPVSIYF MRGLAAHANG FMSSAAYYYL VLALGAWDNP GADLYKYPYP HYFPGHAVPP HPLANSPQID PNIASAISGM NVTIPKGRSG FLKTEYIYND GTVDIKKVTV VGAGPYGYPD GPDDLVLFST GRPLLIDRGY SWEMPLSTQR SISAVAYTTY FANKNPNQVI PYTVNAMMWY ITAPYWNNAY TLTDLLQKVT EKGSDGNYVI PFTISFDLFM QETNNVVDLV MPDLSFLEIY GFHSTFDRPT SLPQGPSDSL NWPALPSMYP VLSTGDTLLT LLWLLRAYPG QKSITPTDNP HYTTQDPITG GNLVSPVSVT DPITGTQILT QGQPGLISSA MYILKSGTLL AGYGSNFEYI LADSEGKQVP NPQQLKLYAS FVPASSNQQS VINQILSSVP SGQTFSTAPT YKIGASDRSS GTEHYFRIPV KFQPGLQTKL QNIISKYAPN MSLSDINPGY VKVGKGGAAY VLPPSIRYMR NVNMFYFLGW GAYMPGGYGP IGLPYVHRIY LEYLQKFRLA AAGKWTGYNA AYYYYYQLTG NSNFKVQSVL PNDSYGQALA KNILDYHRPF GGYYPPPAWA SNITSDGINE NEYPLTFFVR RHDRTYHTWS FNVPWLTAIM PYTPVMLSSA DPYVQSMGIQ SGDMVQFEAI NEPWSGGLRT TLTAVAFLDN ATRPGAAWVV VSAQALPGFR GQTAESPQVK YSVLNNWANI SYMPPSRGGQ LSPTTDKAFP IMYLDPITGQ TSWHDSRIKI VGKSSATQVQ VTANKFVYMG QDFSNQDILS TILNQMGGQI SVSNSPSTPT FAQPVNVPHL RFDANNMTST SIWSYVSPGS LAPGYTIGNY KVRFGFATDP SSQG
|
| |