Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1318 |
Symbol | |
ID | 5104569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1294134 |
End bp | 1297055 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507207 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001191400 |
Protein GI | 146304084 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTCA CGGTGAGGAT AAATGATAAA GTATATTCCG CTAACCCTGG GGAGACTATC ATTGACGTTC TGAAAAGGAA CAACATTTAC GTACCCCACG TCTGCCTCAA CGAGGGTCTA GTGCCCATAG AGAGCTGTGA CACTTGTCTG GTCAGGGTAA ATGGTAAGTT AATGAGGGCC TGTTCCACCA GGGTTGAGGA CGGAATGACA ATAACCACTA ACGACGATAA GTCCAAAGGG GCTAGAAAGG AGGCCATATC TAGGATACTT AGATATCACA AGCTATACTG CACAGTTTGC GAGAACAATA ACGGAGATTG TGTCCTTCAT GAGGCCGTGA TTAAGGAGAA GGTCTTCCAT CAAAGGTACG TTGAGAAACC CTACTCCCTA GATAATAGTG GGCCCTTTTA CGTTTATGAC CCTGCACAAT GCATCCTTTG CGGGAGATGC GTCGAGGCAT GCCAAGACTT CGCCGTAAAT GAGGTAATAT GGATCGACTG GAACCTGAAT CCGCCAAGGG TGGTGTGGGA TCAGGGTAAT CCCATTGGAA ATTCTTCCTG CGTGAACTGT GGGACCTGCG TCACAGTCTG TCCCGTTAAT GCCCTTATGG AGAAGGGAAT GTTGGGGGAG GCTGGACTCT TTACGTGGAT AAACCCAGAG CTCAAGAAGA AGACAATAGA GGCAGTTGGG AAAGTTGAGG ACAACTTTAG CTTGCTCATG ACAGTGAGCG AACTAGAATC CAAGGCGAGA CAATCACAGA TTAAGAAGAC CAAGACTGTG TGCACTTACT GCGGAGTTGG TTGCTCCTTT GAGGTGTGGA CTAAGGGAAG GAAGATCCTG AAGGTAGAGC CCAAACCTGA GTCACCAGCG AACGGGATCC TGACTTGCGT CAAGGGTAAG TTTGGGTGGG ATTTCGTCAA TAGCCCAGAT AGGATAACTA AACCACTAAT TAGAGAGGGA GATCACTTCA GGGAAGCCAG CTGGGATGAG GCAATCCAGC TTGTTGCGAG GAAGTTTAAG GAAATCAAGG AGAGGTATGG CCCAGATTCC CTGGGCTTCA TTGCCTCAGA TAAAATGACC AATGAGGAGG CTTACCTCCT TCAGAAGCTA GCTAGGGCAG TTGTGGGCAC GAACAACGTA GATAACTCCT CTAGATATTG CCAATCTCCG GCTACAGTGG GTCTTTGGAG GACCGTTGGA ATAGGAGGTG ATTCCGGGAC CATAAGGGAC ATTGAGAACG CTGATCTAAT CCTCATAGTG GGGCATAACA CCACGGAAAG TCACCCCGTT GTGGGGAGTA AGGTTAAGAG GGCCCAAAAG ATCAGGGGGG CCAAGATTGC CGTTATTGAC GTCAGGAAAC ACGAGATGGC AGAGAGAGCT AACCTCTTCA TCAGGCCCAA GCCAGGTACA GATGCAGCGG TCTTGGCAGG GGTAGCGAAG TATATCGTGG ACCAGGATTG GGTGGATCAC GAGTTCCTCA AGAGAGTTAA TGGTTTTGAG GAGTTTAAGG AGACCATCAA GGGCTTCACG CTGGACTACG TTGAGAGCGT GTCTGGAGTG CCCAGGGAAC AGATAATCAA ACTAGCTGAA ATGATACATC AAGCAAAGGG CGTAGCAGTG TTGTGGGGGA TGGGTGTAAC TCAGCACTTA GGTGGGGCTG ACACTTCCAC CATCATATCC GACCTTCTCC TATTAACCGG AAATTACGGA CGCCCAGGAA CCGGGGCTTT TCCCATGAGG GGGCACAACA ACGTTCAAGG GGTCAGCGAC TTCGGTTGTC TACCAAACTA CATGGTAGGA TACCAAAAGA TGGAGGAGAG CGTTATGTCC AAGTTCGAGG ACTCGTGGAG AACGACCCTT AACAGAAAAC CTGGCCTGCA GATACCACAA ATGATAGAAG GGGTCCTCGA GGGAAAGATC CATGCCCTTT ACGTCGTGGG AGAGGATACC GTGATGGTTG ACTGTGGAAC TCCTCTTACT AGAAAGGCGT TGGAGAACGT GGACTTTCTT GTGGTTCAGG ACATGTTCAT GACAGAGACA GCGAAGTTAG CTGACGTCAT ACTTCCAGCT GCGGCGAGTC TTGAAAAGGA TGGGACTTTC GTCAACACAG AGAGGAGGAT ACAGCGGATC TACAAGGCCA TGGATCCCTT AGGTGAATCA AAGCCTGACT GGGAAATAAT CCAAATGATA GCTAACGCCA TGGGAGCTAA CTGGAACTAT CATCATCCCT CGGAGATCAT GGACGAGATC GCTAGGCTGA CCCCAATATT TGCTGGCGTT TCCTATTCGA GGCTCGAGGG TTTCAATAGC CTGGTCTGGC CAGTGAACGA AGATGGAACT GACACCCCTC TGCTTTACGT GAATTCCTTC GCCACTCCTG ATGGAAAGGC AATCCTCTAC CCGCTGGAGT GGAAACCAAG ACCGCTGAAG GATGAGGTAC ACTCCATAGT GGTAAACACT GGAAGGGTAC TTGAGCATTT TCACGGTGGT ACAATGACTG GAAGGGTTGA AGGTCTTAGG AGAAAGTTCC CAGAAACGTT CGTGGAAATA TCCAAGGAAC TAGCTGAGAG GTACTCCATC AAAAACGGAG ATCTAGTACT CGTTAAGTCA AAGTTTGGAG GGGAGATCAA GGCAAGGGCG CTGGTCAGCG AAAGAGTGTC AGGAGAGGAA GTGTTTATTC CCCTCTTTGC CTCAGAACCC TCAAAGGGTG TGAACAACTT GACAGGCCAA GAGTTTGATA AGGCTTCCGG AACTCCAGGC TATAAGGATA CGCCCGTCTT GATCGAGAAG ATATCCAGCG GTGAGGGAAC TCCGTTACCA AGGGATAACT GGAGGTTTCA CGTACAGGAG AGGAAGAGAC AGATCGGGAT TGAGGTGACC AAGAAATGGA TGAGAGAGGA GTTCAAACCC TTGACAGAGT AA
|
Protein sequence | MSLTVRINDK VYSANPGETI IDVLKRNNIY VPHVCLNEGL VPIESCDTCL VRVNGKLMRA CSTRVEDGMT ITTNDDKSKG ARKEAISRIL RYHKLYCTVC ENNNGDCVLH EAVIKEKVFH QRYVEKPYSL DNSGPFYVYD PAQCILCGRC VEACQDFAVN EVIWIDWNLN PPRVVWDQGN PIGNSSCVNC GTCVTVCPVN ALMEKGMLGE AGLFTWINPE LKKKTIEAVG KVEDNFSLLM TVSELESKAR QSQIKKTKTV CTYCGVGCSF EVWTKGRKIL KVEPKPESPA NGILTCVKGK FGWDFVNSPD RITKPLIREG DHFREASWDE AIQLVARKFK EIKERYGPDS LGFIASDKMT NEEAYLLQKL ARAVVGTNNV DNSSRYCQSP ATVGLWRTVG IGGDSGTIRD IENADLILIV GHNTTESHPV VGSKVKRAQK IRGAKIAVID VRKHEMAERA NLFIRPKPGT DAAVLAGVAK YIVDQDWVDH EFLKRVNGFE EFKETIKGFT LDYVESVSGV PREQIIKLAE MIHQAKGVAV LWGMGVTQHL GGADTSTIIS DLLLLTGNYG RPGTGAFPMR GHNNVQGVSD FGCLPNYMVG YQKMEESVMS KFEDSWRTTL NRKPGLQIPQ MIEGVLEGKI HALYVVGEDT VMVDCGTPLT RKALENVDFL VVQDMFMTET AKLADVILPA AASLEKDGTF VNTERRIQRI YKAMDPLGES KPDWEIIQMI ANAMGANWNY HHPSEIMDEI ARLTPIFAGV SYSRLEGFNS LVWPVNEDGT DTPLLYVNSF ATPDGKAILY PLEWKPRPLK DEVHSIVVNT GRVLEHFHGG TMTGRVEGLR RKFPETFVEI SKELAERYSI KNGDLVLVKS KFGGEIKARA LVSERVSGEE VFIPLFASEP SKGVNNLTGQ EFDKASGTPG YKDTPVLIEK ISSGEGTPLP RDNWRFHVQE RKRQIGIEVT KKWMREEFKP LTE
|
| |