Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1631 |
Symbol | |
ID | 4600910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1577130 |
End bp | 1579073 |
Gene Length | 1944 bp |
Protein Length | 647 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639774404 |
Product | formate dehydrogenase |
Protein accession | YP_921029 |
Protein GI | 119720534 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGGCGC TGAAGTCTGT CTGCCCGAGG GACTGCTACG ACACATGCCA CCTAGCGGTC TCGCTGGACG GCGGAGAACT GAGAGTAGCG CCGGACCCGG GCTTCGCGTT TACGGCGGGC TTCCTGTGCC CGCGGGGAGC GGTCGAGGCT AGGAGGGTTT TCTCGGCGGG GCGCGTCCTC TTCCCGTACA GGAGGGCCGG GGGAAAGCCG GGAAGGAGCT TCGAGAGGGT TGAGTGGGGT TCCGCGCTGG ACGAGGTTGC TTCGAGGCTG AAGGAGGTTC TCGAGGAGCA CGGGCCGGGC GCCGTTCTGC ACTTAGAGTA CGCTGGGAAC ATGGGGCTGT TGACGTGGTA TTACCCTCAG AGGCTCTGGA ACTGGCTGGG CGCGGCGAGG ACTGACTACA GCATTTGCAG TAAGAGCGGG CACGAAGCCC TCTCCCTGCA CTACGGCTTG AGCTACGGCA GGACTCCCGA GGAGGCAGAG GGCTCGAGGC TCTTCGTGTT CTGGGGCTTT AACGCGAGTG TCAGCTCGCC GCACCTCTGG GCCGCGGCGC TGAGGGGGAG GCGTAGGGGG TCCGTGATCG CGGCAGTTGA CCCCAGGAGG AGCGAGACCG CGCTGAAGAG CGACTTCGCC GTTCACCCGA GGCCGGGCAC CGACGTCGCG CTGGCGTACG GCGTTATCAA CTACCTTATC TCGGAGGGGC TCTACGACGA GGATTTCGTA GAGCGCTACA CGGTCGGCTT CGAGGAGCTG AGGAGGGAGG CTTCGAGGTG GAGCCTCTCC AGGGTTTCGG GGGTGACGGG TGTAGGCGAG AAGGACGTTG CGAGGCTCGC GGAGCTCTAC GCCGAGCTGA AGCCTAGCAC TACCTTCATC GGGTTCGGGG TTCAGAAGGG TGTTAACGGG GCGGAGGCTG TGAGGGCAGT CTCCCTTATA CCGGCCCTCG TCGGCCAGCA CAGGGGCTTC TACTACTCGA ACAGTAGGAG GTGGCTCGTC GACCTCGCCG CCGTCACGGG CGAGAGGCAC GCGCCCCCGG GGAGGGTTGT CAGCCAGGTG GCACTCGCGG AGCTCGTGGA GAGGGGCGAG TTCAAGTTCA TCTACGTGTA CAACATGAAC CCCCTCTTGA CGTTGCCGGG GCAGGGTAAG CTGAGGAGAG GGCTTAGCAG GAGCGACGTG TTCGTCGTGC TCCACGACAC GCACTGGAAC GAGACCGCGG ACTACGCGGA CGTGGTTCTG CCCGCCGCCA CCTACCTCGA GAAGGACGAC GTGGTGATCC CCTACGCGCA CGGCTACGTG GCGATGTCCA GGAAGGTTAT CGAGCCCTTG GGGGAGAGTA GGAGCGAGGT CTGGGTTACG TGCGAGCTCG CCAGGAGGCT CGGAGCCCCC GAGTGGGTCT GCAGGGACCC CCTCGACGTC CTCAGGGAGG CTCTGGGCGG AGCGCTCGAG GGAAGCTTCG AGGACCTACT AGCCGGCAAG ACCCTTAGGC TCAAAGCCAG GAGGCTAGAC GAGTACCAGA CCCCCTCCGG GAGGATCGAG CTCTACTCGC GGAGGGCCCT CGAGCTGGGT TTCAGCCCGC TACCGGTGCA GGGGGAGTAC GACGGGGAGG GCTTCGTCCT CTTAAACTCC GCCACGCCCC TCTACACCCA CACGCAGTTC CGCGACGTCT ACGGCCCCAT ACCCGCCGTC GTACACGTGA ACCCGGTGGA CGCCGAACGG CTCGGAGTTA GGGACGGGGA CCTCGTCGAG CTCTACAACG AGCATGGAAG CGTGGTTGTA AAAGCCCAGG TAACGGAGCT CGTACCCCCG GGCGTCCTCT GGTCCCCGCG CCAGCTAGTC GGGCTGGACG GCTCCCCCCA GAACTCGCTC GTACCCACGG AGACGCAGAG GATAGGCGGA GGACCGGTCT TCAACTCTAC GAGGGTATTC GCGAGACCCG CGCGGGTAAT TTAA
|
Protein sequence | MAALKSVCPR DCYDTCHLAV SLDGGELRVA PDPGFAFTAG FLCPRGAVEA RRVFSAGRVL FPYRRAGGKP GRSFERVEWG SALDEVASRL KEVLEEHGPG AVLHLEYAGN MGLLTWYYPQ RLWNWLGAAR TDYSICSKSG HEALSLHYGL SYGRTPEEAE GSRLFVFWGF NASVSSPHLW AAALRGRRRG SVIAAVDPRR SETALKSDFA VHPRPGTDVA LAYGVINYLI SEGLYDEDFV ERYTVGFEEL RREASRWSLS RVSGVTGVGE KDVARLAELY AELKPSTTFI GFGVQKGVNG AEAVRAVSLI PALVGQHRGF YYSNSRRWLV DLAAVTGERH APPGRVVSQV ALAELVERGE FKFIYVYNMN PLLTLPGQGK LRRGLSRSDV FVVLHDTHWN ETADYADVVL PAATYLEKDD VVIPYAHGYV AMSRKVIEPL GESRSEVWVT CELARRLGAP EWVCRDPLDV LREALGGALE GSFEDLLAGK TLRLKARRLD EYQTPSGRIE LYSRRALELG FSPLPVQGEY DGEGFVLLNS ATPLYTHTQF RDVYGPIPAV VHVNPVDAER LGVRDGDLVE LYNEHGSVVV KAQVTELVPP GVLWSPRQLV GLDGSPQNSL VPTETQRIGG GPVFNSTRVF ARPARVI
|
| |