Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1081 |
Symbol | |
ID | 4601693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1017576 |
End bp | 1018790 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773858 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_920483 |
Protein GI | 119719988 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.152309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAGTA CGAGTAGCGG TGTCGTATAC TCGAGGCTGT TACTCAGGGA GGGCGACGAG TCCGTCTACG AGCTCTTCAT AGGGCCGCAA CACCCATCCT CGGGCCACAT GAGGTTCATA GTGAGGCTTC AGGGAGACGT AATAGTGTCC GTCGACCCGG ACATAGGCTA CGTGCACCGC ACGATGGAGA AGCTCGCCGA GGGGCGGGAG GCCATAAAGG CGATACCGCT CCTCGAAAGG CTGACGATAA TAGACTCGCA TAACGCCACG GTAGGCCTGG TAACGGCGAT GGAGAGGCTT CTCGACGTGG AGCCTCCGCC GCGGGCCCTC TACCTCAGGA CTCTTCTCTC GGAGATAAAC AGGATAGCGA GCCACCTGTA CGGGATGGGC ATAGCCGGGA TCATGCTGAA CCACTCGACG ATGTTCATGT GGGCGTTCGG GGACCGCGAG GTGTGGCTCC AGCTCGCCGA GGAATTGACG GGGGCCAGGC TCACACACAC CTACAGCGTG CCAGGGGGTG TGAGGAGGGA TCTTCCGCAG GGCTTCGGGG AGAAGTTCGA GAAAGCAGCT AGGTACATGG AGAGGAGGTT GCAGGATTAC ATGAGGATCT TCCTGGAGAA CCCCCAGGTG GTCGCGAGGT ACGAAGGCGT AGGGGTGTTG AAGAAGTCCG AGGCCTCCAG GCTCGGGGTC GTGGGCCCGA ACCTACGCGC GAGCGGCGTG AAATACGACG CTAGGCTCGC GGACGACTAC GGTGCCTACA AGGACCTCGA GTTCGAGGTT CCAACCCGAG AGGAGGGCGA CTGTATGGCT AGGATGCTGG TTAGAGTGGA GGAGATAAAG CAGAGCATCT CGATCATACG CCAAGTACTC CGGAAGATGC CGGATGGACC CATACTCTCC GAGAAGTACC TCAAGCTCCT GCCGCCCAAG ACTCGCGAGA GGGTTTTGCA GGAGGGGAGG GTCAAGTTCC CGGCGCTCTT CGCCTCCCTG AAGTTGCCCG CCGGCGAAGC CGTGGCTAGG GCCGAGATGG GGCACGGCGA GATATTCTAC CACATCACGG GGGACGGGTC GGCGAAGCCG TACAGGCTCC GGGTCGTAAC GCCCTCCTTC AGGAACGTCA TACTGTTCAG GTACCTGGCC CCCGGTCACA GGTTTATGGA TTTCCCCGCG ATATACGGTT CTCTGGACTA CTTCCCTCCC GAGGCGGATA GGTGA
|
Protein sequence | MLSTSSGVVY SRLLLREGDE SVYELFIGPQ HPSSGHMRFI VRLQGDVIVS VDPDIGYVHR TMEKLAEGRE AIKAIPLLER LTIIDSHNAT VGLVTAMERL LDVEPPPRAL YLRTLLSEIN RIASHLYGMG IAGIMLNHST MFMWAFGDRE VWLQLAEELT GARLTHTYSV PGGVRRDLPQ GFGEKFEKAA RYMERRLQDY MRIFLENPQV VARYEGVGVL KKSEASRLGV VGPNLRASGV KYDARLADDY GAYKDLEFEV PTREEGDCMA RMLVRVEEIK QSISIIRQVL RKMPDGPILS EKYLKLLPPK TRERVLQEGR VKFPALFASL KLPAGEAVAR AEMGHGEIFY HITGDGSAKP YRLRVVTPSF RNVILFRYLA PGHRFMDFPA IYGSLDYFPP EADR
|
| |