Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0100 |
Symbol | |
ID | 6316315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 115206 |
End bp | 117887 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642642473 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001916287 |
Protein GI | 188584742 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000102152 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.000000000000206048 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGTCTTG TAACTGTAAC CATTGATGGT CAAAGCATAC AAGTTCCCAA GACCTCTACT GTGCTGGATG CCTGTGAAAA ATTAGGAAAA GAAATTCCTA CCCTTTGTCA CCAAAAAGAT CTGACTACAG TGGGTGCTTG CCGTTTGTGC GTTGTGGAAG TAGAAGGCGC TGGAAACTTT CCAACGGCAT GTACTCTTCC TGTAAAAGAT GGCATGGTGA TCAATACAAA CACTGATGAA GTAAGGCATG CACGAAAGAT GGTTTTGGAA TTATTATGGG CAAATCATCC AAATGATTGT TTAACTTGTG AATCTAACGG AAACTGTAAG TTTCAAGATT ACTGTTATGA ATACGGGGTT ACTGAAAGCA GATTTCGGGG AGAAGTTATC GAACATGAAA TAGATGAAAC TAGTCGCTTT GTTGAAAGAG ACCTGGATAA ATGTATTTTG TGCGGGAAGT GTATTAGAGT TTGCCACGAG ATTCAAGGAA GTGAAGCTAT TGATTTCATG GATAGAGGTT TTGAAACTAA AGTGGCAACA TTTTATGATA AAGGACTTAG CGACTCCCCC TGTGTAGATT GCGGTAATTG TATAAATGTA TGTCCAGTAG GTGCTTTGAT TCCAAAACCC TTGAAAGGTA AGGGAAGGGA TTACGATTTC GAAAAAGTCA AGACAACCTG CCCATACTGT GGTGTCGGGT GTACTTTTAA TCTAAATGTC AAAGATGGAG AGGTTGTTGG TGTCAGTCCT GATGAAGAGG CTGAAGTAAA CGATGGATAC CTTTGCGTTA AAGGCCGCTT TGGAACAGGA TTTATCCATA ATGATGATAG ACTAACCCAA CCTCTTGTTA GAAAAAATGG TGAACTTGTA GAAACTGATT GGGAAGAAGC CTTAAATACT GTTGCGGAAA ACTTTAAAAA ACTAAAAGAA AAACACGGTG GCGACGCCTT TGGATTTTTA GCTTCTGCAA AATGCACTAA TGAAGATAAT TACCTGTTTC AAAAATTTGC GAGAGCGGTA GTAGGTACAA ATAATATTGA TCATTGTGCC CGCCTCTGAC ATGCTCCTAC AGTGGCCGGT CTGGCCACAC AGTTTGGTAG TGGAGCAATG ACTAACAGTA TCGGAGAAAT AGAAGATACC AGTACGATAT TTGCTATTGG AACTAATACT ACAGAAGCTC ATCCAATTAT CGCTCAAAAA GTATTTAAAG CTCAAAATAA AGGGGCAAAA TTGATTGTCG CTGATCCAAG GAAGATAGAG ATAGCTGAAA AAGCGGATAT ATGGTTGAGA CCTTTACCAG GAACCAATGT AGCTTTACTG AATGGAATTA TGAAAGTTAT TTTGGAGAAA GATTTGGTCG ATAAAGAATT TATCCGAAAC AATACTGAAG GTTTTGAAGA AGTTAAAAAA CAACTTGAAC AGGTGTCATT GGATGAAATT GAACAGATAA CCCAAGTACC CAAGGACAAA ATTGAACAAG CTGCTATCAT GTATGGAGAA AGCGATAAAG CTTCTACTCT GTACACTATG GGAATCACTC AGCACACCAC TGGAACCGAT GCAGTTTCTT CAATTGCGAA CCTGGCATTG ATGACAGGCA ATGTCGGTCG AGAAAGTACT GGTGTTAATC CATTGAGAGG TCAAAATAAT GTCCAGGGAG CCTGTGACTT GGGTGGATTA CCCAATGTAT TAACTGGTTA TCAAAAAGTA GCAGATCCTG AAACAGTCTC CAAATTTTCT CAAGAATGGG GACAGGAGCT AAATGATCAG CCGGGTATGG CTGTTACTGA AATGCTAAAA GCAACTGGTG AAGATTTAAA AGCTATGTAT ATTATGGGTG AAAATCCTAT GGTCACAGAT GCTAACTTAG GTCATGTGGA AGAAGCCCTT GATAGCTTGG ACTTTTTAGT TGTTCAAGAT ATCTTCTTAA CAGAGACTGC AGAAAAAGCA GATGTAGTGT TACCAGCTAG TTCTTTTGCG GAAAAGGATG GAACATTCAC TAATACTGAA CGACGTGTTC AAAGAGTTCG AAAGGCCATA GAATCAGTGG GAGATAGTAA GCCAGACTGG CAAATCATAG CTGACTTATC TCAACAGATG GGTTATGAAA TGAATTACAG CAATCCTCAA GAAATCATGG ATGAAATTAG GAAATTAACA CCAAGCTATA GTGGTATTTC TTATGACAGA ATTGAAGATC AAGGGATTCA ATGGCCCTGT CCATCTGAAG ACCATCCTGG TACTAAATAT CTTCATAAAG AAGGTAACTT TGCAATAGGC AAAGGACAAT TTAAGGCCGT GGATTATCGA GAACCAGCTG AAACAGCAGA TGATGAATAT CCATTTGTAC TGACTACAGG GCGAATGCTG TACCACTATC ATGCTACTAT GACCCGAAAG GTCAGAGAAC TTAATGAAGA AGTCCCTGAA GGTGATATTG AAATAAACAC GCAAGATGCT GAAAAATTAG GTATTGAAAA TGGTGACCAA GTAAAAGTCT CCTCCCGTAG AGGTGAAGTT GTGACTGTTG CAGAGGTTAC TGATAGAGTG GCTCCAGGAG TTGTTTATAT GGATTTCCAC TACAAAGAAG CAGCAGCTAA TAGGCTAACT AATGACGCCC TAGATCCTGC GGCTAAAACC CCAGAACTAA AAGTCAGCGC CGTCAAAGTT GAAAAATCTT AA
|
Protein sequence | MSLVTVTIDG QSIQVPKTST VLDACEKLGK EIPTLCHQKD LTTVGACRLC VVEVEGAGNF PTACTLPVKD GMVINTNTDE VRHARKMVLE LLWANHPNDC LTCESNGNCK FQDYCYEYGV TESRFRGEVI EHEIDETSRF VERDLDKCIL CGKCIRVCHE IQGSEAIDFM DRGFETKVAT FYDKGLSDSP CVDCGNCINV CPVGALIPKP LKGKGRDYDF EKVKTTCPYC GVGCTFNLNV KDGEVVGVSP DEEAEVNDGY LCVKGRFGTG FIHNDDRLTQ PLVRKNGELV ETDWEEALNT VAENFKKLKE KHGGDAFGFL ASAKCTNEDN YLFQKFARAV VGTNNIDHCA RLUHAPTVAG LATQFGSGAM TNSIGEIEDT STIFAIGTNT TEAHPIIAQK VFKAQNKGAK LIVADPRKIE IAEKADIWLR PLPGTNVALL NGIMKVILEK DLVDKEFIRN NTEGFEEVKK QLEQVSLDEI EQITQVPKDK IEQAAIMYGE SDKASTLYTM GITQHTTGTD AVSSIANLAL MTGNVGREST GVNPLRGQNN VQGACDLGGL PNVLTGYQKV ADPETVSKFS QEWGQELNDQ PGMAVTEMLK ATGEDLKAMY IMGENPMVTD ANLGHVEEAL DSLDFLVVQD IFLTETAEKA DVVLPASSFA EKDGTFTNTE RRVQRVRKAI ESVGDSKPDW QIIADLSQQM GYEMNYSNPQ EIMDEIRKLT PSYSGISYDR IEDQGIQWPC PSEDHPGTKY LHKEGNFAIG KGQFKAVDYR EPAETADDEY PFVLTTGRML YHYHATMTRK VRELNEEVPE GDIEINTQDA EKLGIENGDQ VKVSSRRGEV VTVAEVTDRV APGVVYMDFH YKEAAANRLT NDALDPAAKT PELKVSAVKV EKS
|
| |