Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1457 |
Symbol | |
ID | 5104827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1426534 |
End bp | 1428507 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507345 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001191538 |
Protein GI | 146304222 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAAA CGAAAAGTAT CTGCCCATAC TGTGGAGTAG GGTGCGGTCT TATACTTGAG GGTGAGAATA ACGTAGTTGC GAGAGTGTAT CCAGACCGCG ATCACGTGGT TAGCAAGGGT CACATATGCG GTAAAGGGAG TACAGCCCAC GAACCTGGAA ATAGTTGGGA CAGGCTTCTG TATCCCCTGA AAAGAGAAAA GGACATCCTA GTCAGGATTT CGTGGGACGA AGCTATCCGA GAGATCGCGT CAAAACTCTC GGAGATAAGG AGCAAGTACG GTCCTAGTGC CATAGGATTC TACGGAGGTT GCCAGAACAC CCTTGAGGAG GGATATACCA TGATGAAGCT AGCAAGGGCC TTAGGAACCA ATAATGTGGA TTCATGTGCG AGAGTTTGTC ACGATCCCTC AGCTACAGCC CTAAAGGAGA TGGTAGGCCT CGGTGCAACC TCAACCTCAG TTACGGAGAT ACCCAAAAGC AAGGTCTTGG TTATAGTTGG AGAATCACTA ACCGAGAGCC ATCCCGTCCT AGTTCAGTAT CTCTCGATGT TGAAGAAAAA TAACGGCAAG GTAGTAGTGA TAGACCCTAG GGTAACAGGA ACTGCGAGGT TGGCTGACCT TCACCTCAGG GTTAGACCTG GTACAGACAT TTACCTGTTT AATGCCGTTG CCAACTACTT GATCTCCAAC AACATCTACG ACAAGAAGTT CGTTGAGGAA AGGGTGGAAG GATTCGTTGA GTTCTCTAGG CTTGTTAAGT CCTACACAAT CCAAGGAGCA GAGGAAATAA CGGGGATAGA TCAGTCCGCT ATCCTCGAGT TTGCCAAACT AATATCGCAG AAACCTGTCA TCTTCTCCTG GGGTCTGGGG CTTACCCAGA CTGGAGGGCC TAAGGCAGTC CGTAGCCTAA TTAACCTCGC CCTGCTTACA GGCAATGTGG GTTTCGAGGG AGCGGGCCTC CTAGTATACA GGGGACAGAC CAATGTACAA GGATCAGGAG ATATGATTAA GCCCAACGTG TTTCCCAATG GTCCCATGAC GCTGGAAACG GCGAGGGAGC TGGAGAAGCT ATGGGGTTTC TTGCCTCCCA CATGGGAAGG TAAAACTGTA ACTGAAGCCC TCCTTGAGTC GGACATGAAG GCCGTGGTAC TCATGAACTT CAACCCTGCA GTGAGTTTCC CAAACAGACA GAAGGTTGAG AATTTCTTGA AGTCCCTAGA GCTTCTGGTA GTTATGGATC CCTTCATGAC AGAGACCGCA AGGTTTGCAC ACTACGTCCT GCCGTCGGCT ATGTGGACCG AGAAGGAGGG TTCCGTCACC AGCCTTGATA GAGTTGTGAA ATGGAGGTTT AGGGCAGTAT CTCCTCCAGG AGAGGCGAAG GAAGAGCTCG AGATCCTGTC CCTCCTCGCA GATAGACTGG GATTCAAGGG ATTTTCCAGG GATCCAAAGG AGGTATTCAA GGAATTGAGG AGCGTGGTCA AGATCTACTC TAACTTAACT TTGGATCAGG TCATGGACTA CTCATCCCCC TCAAGATACC CAGAGAACGA CCCAGTTCTC TACAGAACAA GGTTCTATAC TGCAAGTGGG AAGGCTAAGT TGAAGTTTGA GGAACAACCA GAACCCAAGA AAGGTCTCAT CTTGATAACG GGCAGAGCGG TAACTAGGTA CAACACAGAC GAGATGATAA GCAGAACACC TGGATTCGGC CAACTTACAC CCGTGATTTA CCTTAATCCA AGGGACGCAC AAAACCTGGG TATCAAGGAT AATGACCTGG TAAAGGTATC CTCAAGATGT GGTATGGCAA TCCTAAGCGC CAAAATCTCC CCCGACGTGT TAGAGGGAAC AACTTTCGCG TATATGCACG TCCACAGTAT CAATAATGTA GTCTGTGATG AGCTGGATCC AGAAACTAAA ACTCCGAGAT ATAAGTACAC TGAGATAACT ATAACAAAAA TTGAATGGGT CTAG
|
Protein sequence | MLETKSICPY CGVGCGLILE GENNVVARVY PDRDHVVSKG HICGKGSTAH EPGNSWDRLL YPLKREKDIL VRISWDEAIR EIASKLSEIR SKYGPSAIGF YGGCQNTLEE GYTMMKLARA LGTNNVDSCA RVCHDPSATA LKEMVGLGAT STSVTEIPKS KVLVIVGESL TESHPVLVQY LSMLKKNNGK VVVIDPRVTG TARLADLHLR VRPGTDIYLF NAVANYLISN NIYDKKFVEE RVEGFVEFSR LVKSYTIQGA EEITGIDQSA ILEFAKLISQ KPVIFSWGLG LTQTGGPKAV RSLINLALLT GNVGFEGAGL LVYRGQTNVQ GSGDMIKPNV FPNGPMTLET ARELEKLWGF LPPTWEGKTV TEALLESDMK AVVLMNFNPA VSFPNRQKVE NFLKSLELLV VMDPFMTETA RFAHYVLPSA MWTEKEGSVT SLDRVVKWRF RAVSPPGEAK EELEILSLLA DRLGFKGFSR DPKEVFKELR SVVKIYSNLT LDQVMDYSSP SRYPENDPVL YRTRFYTASG KAKLKFEEQP EPKKGLILIT GRAVTRYNTD EMISRTPGFG QLTPVIYLNP RDAQNLGIKD NDLVKVSSRC GMAILSAKIS PDVLEGTTFA YMHVHSINNV VCDELDPETK TPRYKYTEIT ITKIEWV
|
| |