Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1408 |
Symbol | |
ID | 4601822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1359691 |
End bp | 1363203 |
Gene Length | 3513 bp |
Protein Length | 1170 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639774183 |
Product | formate dehydrogenase |
Protein accession | YP_920808 |
Protein GI | 119720313 |
COG category | [R] General function prediction only |
COG ID | [COG3383] Uncharacterized anaerobic dehydrogenase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGGAG AGATCGATAG GCGTAGCTTT TTGAAGCTGG CAGCCATAGC GGCGTCCGCG CTAGCTCTAC CGGTCGAAGC GCAACCAATA CCCCTTAAAC CCTGGAAGAC GCAGTGGGGC TTCGGGGAGA TAGGCGGCAA GGGGGATCTC GCGAAGGCGA GGATAACGCC AGTTATATGC CCGTACTGCT CGATGGGGTG TTCCATTGAT TTCTACACGT TCGGCGACCA AATCCTGTGG ACCGAGGGGT CTCCCGACTC CTACATAAAC TTCGGCGCGC TGTGCCCGAA GGGTAAGGCG GCGTTCGAGC TAGTCGAAAA CGAGATGAGA GTTACGCAAC CGATGATAAG GACGGGTCCG AAGCCGCCTC CCGAGGAGAT TCTCAGCGCT AAGAGCTGGG ACGAGCTAGT AGCCCTTGTC AAGAAGTACC CGCCACAGTG GAAGCCTGTA AGCTGGGACG AGGCCTTCAG GTTCATAGCG AGCAGGGTGG CGAAGATACT CAACGAGTGG CGTAGCTCTA GCGGCGCGCC CAAGCAGCCT GACGGTTACT ACTACGTCGG CTCGAAGGTT CCCATACAGC TCATAGGCTC GTCCATAATG GTGAACGAGG CGGGCTACCT TACAACTAAG CTCGCCGAAT TCCTGGGGAC TACGAACGTA GACTCTCAGT ACAGGAAGTG CCACAGTTCG ACGGTGTCGT CGCTGGCGCT GACCTACGGG TGGGGCGCGG AGACCGCGAC TATAGAGGAC GTGGCACTCG CCGACGTGGT CCTATTCTTC TCGAGCCCCG CCGAGGCCCA CCCGCTGTCG TTCCAGTACT TCCTCAAGGG GAAGAAGGAG AGGGGGACGA TATTCATCAC GTTCGACCCG CGGTACAGCA GAACCGCCAT GGCGTCCGAC CTGTGGGTTC CCTTCAGGAG CGGTACCGAC ACGGCGATCT TCCTCTACAT CCTCCACTAC GCCTTCTTCG AGAGGGATCC CCCGATAGAC TCCCTCGACG CGTTCAAGGC TTTAAGGTCG AGGTGGAACG TAACGGACGA CGACCTGGCA GACTTCAAGG AGCTTATCAA GGAGTACGAC GCGGAGACCG TCTCCAGGAT TACCGGAGTC CCGGTCGACA TGCTTAGAAC TGTTGCCCGG ATCTACGTCG AAAACAGCGG TGTGGCGACT AACCACAAGA AGCACGGCGT CGTCCAGTGG GCTATGGGGA TGACGCAACA CACGAACGCC ACCATAAACA TAATCAGGGC CGCTGCGATA ATGCAGCTCC TCTTGGGCAA CGTAGGGTTC CCGGGCGGCG GCGCCCACCC GTTCAGGGGG CATAGCAACG TGCAGGGAGT CACCGACGTC CAGGGAGGAG GGCTGGGCGC CCTCCCAGGG TACCACGCGT CCCCCTCCTC TACGTTCTAC GTGCGCCTCT ACCAGGATTG GAAGCTCCAG GGGATGCCGG ACGCCTGGAA CTGGGTCGTC CCCGAGTGGG CGAGGAAGAC CTTTACAACC ACGACGCCCG ACAAGGGTAG CGCTGACCTC ACGAAGATAC TCCAGGTGTA CACGTTCTAC GGCTGGAGGA GGTTCGAGCT TCTCTGGGGC TTCTTCTGCG GAACGGTTCC GGAGGACGAC CCCGTCAACG GGACCGTCGT ATGCGACTTC CCGTTCGGGA CGGGGTCCAC GGAGATAACC TTCCCGAGGA GAGTCCTCAA CGGCGAGATA AGGGCGGCTT TCATATTCGG GGAGAACCCC GCGGTGACCA ACCCGAACGC AAAGGTGATA TGGGCGGCTC TCTCGAAGCT CGACCTCCTC GTAGTGTCGG ACATATTCGA GACTGAGACC GCGTGGTTCG CCGACGTGCT CCTACCGGCG TCCTCCTTCG CTGAGGTGGA GGGCACGAAG ACGAACGGAA ACAGGGTTAT ACAGTGGACC TACGCCGCTC TGAACCCCAG GGGCGAGTCT AGGCCCGATT ACTGGATAAT CACCAAGCTA TTCCAGTACC TGAGGAACTA CGGCGCCGTG AAGCTACCGA GCGAGGTCTT CGGGTTGAAG AGCGAGAAGG TGAAGGTGAG AAAGGGCGGC AGGGTCGTCC TCCTCTACGA GCGCCCGCTC AGGCCCGACG CGAGCTGGGA CTACTCTGGA GGCAAGGGCG CCGCGTCGCC GATAAGGAGC ATCGAGGCAG AGGTCAACCC CAGGATAATA AACAAGGAGA TAAACTTCGC GGTTCTCATC TACCAGGGGA TGTACGACCC GGTTAGGGAC GAGTTCACCT CGATGCGGAG GGACAAGAGG CTGAGGCAGC CGGGCGAGAT CGACGGGCTC TTCTCGTCTA CGTTCAAGGT CTACAAGAAC TGGGGCTGGT CCTGGCCCAT GAACGTCAGG ATACTCTACA GCTACACAGG CCTCGCCGAC ACTCTCGGCA CGACGGACAC CGTGTACGCG GCTGGGCGTC AGTGGCAGGC CACCGGGGAG ACCGGGGAAT GGATAGACGA GTACACGGGC GAGTACAGGC CCGCCTTCAT ACCTGGGCAC AACTTCTGGC TTCCCAGAGC GTTTAAACGC AGGCTAAGCG GGGTTGCCGA CCTCTACGGG GGGATCGACG TCATGCACCT CATAAGGCAC AACGAGCTGA GGCCTCTGGG GCTCTTCGCT GTCGAGGACG GCGGCGAGGT AAAGCTGCTC ACGTTCGAGG AGTACGTCGC GAGCACCGGC ATGAAGTACC TCTGGGCCAA CGATACCCTG TACTGGGATC AGGACACGGC GATAGCCGTG AAGGCTACCG TGAAGAGGGC CTTCTTCCCG GGCGGCGGTT GGAGGCAGTT CAAGCCCACC TACGAGCAGA TGAGGGCTAC CCTGAAGAAG TACTACGAGC AGACGGGTAA CATGAGGGAC GCCGTGAACA AGACGATCCA GGAGCTGAAA GGGTGGTACC CGGGCTACTC CTTCACGTGG CCTATACACA CGGAGCCCGT CGAAAGCCCC GACCTGGAGA TGGCGATACG GTACCCGACC CTCGCCTGGC TGAACAGCTA CAACCTCCAG GTACTCAACG AGCAACCCGA CATCGTGAGG GGCAAGCCCG TCGGGGTTGC GCTTACACCG CAGGACCTAT CGAGCATCCC GGGAGAGCTC GTGGTTATAA CCACGAACAG GCTCACGGAG CACTGGCACA GCGGGTCAAT GACTAGGAGG ACTCCGTTGC TGGCAGAGCT GGATCCGGAG CCCTTCGTCT ACGTTCCGAG GGAGCTTGCG AGGAAGCTGG GCGTGAACTC GGGGGACTAC GTGGAGATAA TCACTGCTAG AGGGTCGATA AAGATGAGGG CATACGTGAC GGAGGGCGAG GCCTACCTAA CAGTGAACGG CAGGCAGCTA CCGCATGTAA ACGTTGTGTG GGCGTTCAGC TTCCTCGGGT ACGTGACCGG CCCCCAGGGG AACTTCATCT CGCCCGACGT AGGTGACGTG GTTACGACGA TCCAGGAGAC TAAGGCTTGG ATCGGTAAAA TTAGGAAGGC GGAGGTGGTG TAG
|
Protein sequence | MPGEIDRRSF LKLAAIAASA LALPVEAQPI PLKPWKTQWG FGEIGGKGDL AKARITPVIC PYCSMGCSID FYTFGDQILW TEGSPDSYIN FGALCPKGKA AFELVENEMR VTQPMIRTGP KPPPEEILSA KSWDELVALV KKYPPQWKPV SWDEAFRFIA SRVAKILNEW RSSSGAPKQP DGYYYVGSKV PIQLIGSSIM VNEAGYLTTK LAEFLGTTNV DSQYRKCHSS TVSSLALTYG WGAETATIED VALADVVLFF SSPAEAHPLS FQYFLKGKKE RGTIFITFDP RYSRTAMASD LWVPFRSGTD TAIFLYILHY AFFERDPPID SLDAFKALRS RWNVTDDDLA DFKELIKEYD AETVSRITGV PVDMLRTVAR IYVENSGVAT NHKKHGVVQW AMGMTQHTNA TINIIRAAAI MQLLLGNVGF PGGGAHPFRG HSNVQGVTDV QGGGLGALPG YHASPSSTFY VRLYQDWKLQ GMPDAWNWVV PEWARKTFTT TTPDKGSADL TKILQVYTFY GWRRFELLWG FFCGTVPEDD PVNGTVVCDF PFGTGSTEIT FPRRVLNGEI RAAFIFGENP AVTNPNAKVI WAALSKLDLL VVSDIFETET AWFADVLLPA SSFAEVEGTK TNGNRVIQWT YAALNPRGES RPDYWIITKL FQYLRNYGAV KLPSEVFGLK SEKVKVRKGG RVVLLYERPL RPDASWDYSG GKGAASPIRS IEAEVNPRII NKEINFAVLI YQGMYDPVRD EFTSMRRDKR LRQPGEIDGL FSSTFKVYKN WGWSWPMNVR ILYSYTGLAD TLGTTDTVYA AGRQWQATGE TGEWIDEYTG EYRPAFIPGH NFWLPRAFKR RLSGVADLYG GIDVMHLIRH NELRPLGLFA VEDGGEVKLL TFEEYVASTG MKYLWANDTL YWDQDTAIAV KATVKRAFFP GGGWRQFKPT YEQMRATLKK YYEQTGNMRD AVNKTIQELK GWYPGYSFTW PIHTEPVESP DLEMAIRYPT LAWLNSYNLQ VLNEQPDIVR GKPVGVALTP QDLSSIPGEL VVITTNRLTE HWHSGSMTRR TPLLAELDPE PFVYVPRELA RKLGVNSGDY VEIITARGSI KMRAYVTEGE AYLTVNGRQL PHVNVVWAFS FLGYVTGPQG NFISPDVGDV VTTIQETKAW IGKIRKAEVV
|
| |