Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0704 |
Symbol | |
ID | 6316418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 731308 |
End bp | 734010 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642643084 |
Product | formate dehydrogenase, alpha subunit |
Protein accession | YP_001916884 |
Protein GI | 188585339 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATGACTG ACAGTCAAAA CCAAAACACA ATTACAATCA TGGCAGATGG TAATTCATAT CAAACAGAAC CAGGAGAAAG TATTCTCAAT GCCTTGGAAA GGCACGGATT TCACATTCCT ACTCTTTGTT ACGAAGAGGG ACTGCCAATT ATTGGCGCCT GCAGGTTATG TGTGGTAGAA ATTTCGGGTG CCAGAAAACT ACACCCAGCA TGTAGTACAC CGGTGGCAGA TGGTATGAAT ATTTTTACTG AATCAGAACA GGTTGTTGAG GCCAGAAGAG AGATTTTAAA GCTACTGCTT GCAAACCATG ATTTGAGATG TCTTACATGT GAGAGAAATG CCGATTGTAG GCTGCAGGAT TACTGTTACC GCTATAAAGT GGAAGATACC CCTTATGTGG GAGATACTAA AAACTATCCT ATAGATAGGT CAAACCCATT TTTCTTTCGA GATTACGGAA AATGTATCAT GTGCGGAAAA TGTGTTAGTA TCTGCGCAGA AATAAATGGG GCCTATGTCT ACGATTTTAT GGATAGGGGC TTTGATTCGA AGGTTACCAG TGCTTTTGAC GATGAATTAC AGAATACTAC TTGTACTTTT TGTGGTATGT GTGTCAACGT TTGCCCCGTA GGAGCTTTGG TAGAGCGTTC TGTCGAGTGG CGTTCTAGAC CCTGGGAGGT AGATAAAGTA CTGACTACCT GTCCTTACTG CGGAGTGGGC TGTCAAATGT ATTTAAAAGT CCAGGACGGA GAAATAGTGG GTGTATCAAG AAAAAAAGAT TCATTTACTA AAGGGCATCT ATGTGGAAAG GGTCAATTCG GTTGGGATTT CGTTCATTCA AAAGATAGAT TGACTAAGCC CATGATTAGA AAAGATGGCC AGTTAACCGA GGTTAGTTGG GATGTTGCCA TGGAGTACAT AAGTGAGAAT TTATCAAGAC TTTATAATGA AAGTGGCAGT GATGCCATAG CAGGACTTAG TTCGGCCAAG GTGACTAATG AAGAAAATTA TCTGTTTCAA AAATTATTGA GAACAGCTTT CAAGACTAAT CATATTGACC ACTGTGCCAG GCTATGTCAC AGCTCCACTG TTTGGGGGCT TGCCACTAGC TTCGGAAGCG GTGCTATGAC TAATTCAATC AATGAGCTTT TGGAGACAGA TTGTATTGTA GTGATAGGTT CAAATACTAC TGAAGCTCAT CCCGTTACAG GTTACAGAAT AAAGCAGGCC AATAAAATGG GAAAAAATTT GATAGTGATT GATCCCAGAA AAATAGAATT AACTAATTAT GCCGATGAAT GGCTTAAACT GAGACCCGCT ACTAATGTAG CTCTACTGAA TGGTATTGCC AGAGTAATAT ATAAAGAAGG CTTATGGGAC CAAGAATTTG TAGAAGCTAG AAGCGAAGGA TTTCAGGATT GGATAGAATC TATTGAAGGA TATACACCTG AAAGAGTCCA AGAAATAACG GGTGTTCCAA AGGATAAACT GATTAGAACC GCACGACTAA TAGGTAATAG CAATAAAGTG ACTTTTGTTT ATGCTATGGG AGTTACCCAG TATACTCAGG GAACTCAGAA CGTTTTATCT ATAGCTAACT TGGCAATTTT AACAGGTAAT GTAGGACGAA GAGGTACAGG TGTAAATCCC TTGAGGGGAC AAAACAATGT TCAGGGTGCC TGTGATATGG GAGCTTTACC AAACTACTTC CCTTCATATC AACCTATCGA GGATCCAGTG GCTCGGGATA AATTTAAAAA AGTTTGGGGT ACTGTTCCAG AACCCAAACC CGGGTTGACA GTAACGGAAA TTATTGAAAA AGCAGGTACC GGTGATATTA GAGGACTATA TATTATGGGG GAGAATCCGG CAGTTAGTGA TCCCGATGCT GATGAGGTTG AAAGATCATT AGAGGAAACA GAATTATTGA TTGTTCAAGA TATTTTCATG ACTGAAACGG CAAAACTTGC CGATGTAGTT CTACCTGCCA CTTCCTTTGC TGAAACGGAA GGTACTTTTA CCAATACTGA GAGAAGAGTA CAGCGCGTTA GAAAGGCAGT TGAACCTCCG GGTCAAGCTA AAACAGACCG GGAAATTTTA ACTCTGTTGG CTAATAGACT AGAACTTAAC TGGAATTACA ATAATGATGA AGAAGTGATG GATGAAATTA ATAAATTGGC ACCCTTTTAT GGCGGGATCA CTTTTGCTAG GTTAGAAAAC GAGGGCTTAC AGTGGCCTTG CTACCACACC TCACATCCTG GGACTGAATT TTTACACAAA GATCAGTTTT CAAGGGGAAA AGGTAAATTT CATCCTGTAA AATACATTCC CCCTTCGGAG GAGCCTTCAA AAAAGTATCC TTTAATCTTA AATACAGGGA GATGGCTTTA TCACTTCCAT ACTGGCACCA TGTCTCTAAG ATCTCAGAGG CTTAAATGGC TTCGTGATGA AGAACTAGCT ATGGTTAATC CCCAGGTTGC CAACGAATTG GATATAGAAG ACGGGGATAT AGTTAAGATT GCTTCCAGGC GCGGTAAAGT GAAATCAAAA GTTCAGGTTA CAGATCATGT GCCCCAAGAT ATGGTGTTTA TGACATTTCA TTTCCCAGAA ACCTTGACAA ATCGTTTAAC TACTAAGGCT AAAGATCCTA TCTGTAAAAT TCCTGAACTA AAAAGTAGTG CAGTTAGAAT CGAGAAAGAA TAG
|
Protein sequence | MMTDSQNQNT ITIMADGNSY QTEPGESILN ALERHGFHIP TLCYEEGLPI IGACRLCVVE ISGARKLHPA CSTPVADGMN IFTESEQVVE ARREILKLLL ANHDLRCLTC ERNADCRLQD YCYRYKVEDT PYVGDTKNYP IDRSNPFFFR DYGKCIMCGK CVSICAEING AYVYDFMDRG FDSKVTSAFD DELQNTTCTF CGMCVNVCPV GALVERSVEW RSRPWEVDKV LTTCPYCGVG CQMYLKVQDG EIVGVSRKKD SFTKGHLCGK GQFGWDFVHS KDRLTKPMIR KDGQLTEVSW DVAMEYISEN LSRLYNESGS DAIAGLSSAK VTNEENYLFQ KLLRTAFKTN HIDHCARLCH SSTVWGLATS FGSGAMTNSI NELLETDCIV VIGSNTTEAH PVTGYRIKQA NKMGKNLIVI DPRKIELTNY ADEWLKLRPA TNVALLNGIA RVIYKEGLWD QEFVEARSEG FQDWIESIEG YTPERVQEIT GVPKDKLIRT ARLIGNSNKV TFVYAMGVTQ YTQGTQNVLS IANLAILTGN VGRRGTGVNP LRGQNNVQGA CDMGALPNYF PSYQPIEDPV ARDKFKKVWG TVPEPKPGLT VTEIIEKAGT GDIRGLYIMG ENPAVSDPDA DEVERSLEET ELLIVQDIFM TETAKLADVV LPATSFAETE GTFTNTERRV QRVRKAVEPP GQAKTDREIL TLLANRLELN WNYNNDEEVM DEINKLAPFY GGITFARLEN EGLQWPCYHT SHPGTEFLHK DQFSRGKGKF HPVKYIPPSE EPSKKYPLIL NTGRWLYHFH TGTMSLRSQR LKWLRDEELA MVNPQVANEL DIEDGDIVKI ASRRGKVKSK VQVTDHVPQD MVFMTFHFPE TLTNRLTTKA KDPICKIPEL KSSAVRIEKE
|
| |