Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1082 |
Symbol | |
ID | 4601694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1018797 |
End bp | 1019810 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773859 |
Product | respiratory-chain NADH dehydrogenase, subunit 1 |
Protein accession | YP_920484 |
Protein GI | 119719989 |
COG category | [C] Energy production and conversion |
COG ID | [COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.524281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCTAG TCGAGTGGAT AGTAAGGCTC GTGCTCTCCC CCGGGGTGTT CGCCCCGGTG ATATTCCCGG GTTTGCTCAC AGCGCTGGCG GTACTGCTCA TAGTAATATG GGCCGAGAGA AAGATAGCCG CTAGGGTCCA GATGCGCGTT GGACCCCTCT ACGTTACGAG GCACTTCGGA GGAGTACTGC AGATGCTGGC CGATGGTACG CGCTACATGT TCCAGGAGTT CATCGTACCG GAGACCGCGG ACAAGGTTCC CTACATGCTT GCACCGGCGC TCGCGCTAAC CCTGGCTATA GCCCCCTTCG CCCTGATACC CTCTGCTCCG GGCTTCGCGC CGGTAAAGTC TCCGTACTCC CTCCCCGCCC TGCTGGCTAT CCTGGCGAGC ACGCCGCTCT CAGTCCTGCT CATGGGCTGG TCTTCTAACA ACAAGTTCAG CATACAGGGC GCTGTCAGGG AGGCGTTCAT GACTCTAGCC TACGAGGTCC CCCTCTTCCT GTCGGCTCTC TCCATGGCTA TACTCTACGG GTCGATGGAT CTCGAAGAGA TCGTTGGCAG GCAGTTCCTG CCCGGAGCGC TCCTGAACCC CGTCGCGGCC TTCACCTTCT TCGTCGCGAT GGTCATGAGC TCCGGGAGGC TACCCTTCGA CATTGTTGAG GGCGAACAGG AGATAGTGGC AGGCCCCTAC GTCGAGTACA CGGGGATCGT TTTCGGGATA GGCATGGGCT TAGCGTACCT AAAGCTCTAC GCCCTCTCGC TACTCTACTC GCTACTCTTT CTGTCGGGCT GGGAGCCCCT GCCCCGCGCC CTCTACTCGG TGTACCCCGG GCTCGCCGGC GTATGGCTCT TCGCGAAAGC GTTTACGCTG ATGCTGTTCG TAGTCTTCCT CAGGTCCGTG TACGGGAGGT ACAGGCTGGA CCAGGCTCTC CGCGCAGGGT GGAGGGTGTA CCTGGTCCTA GCCGTTGTTT CGATTCTTCT TTCATGCTTG CTTAGGGTGG TGGTGAATGT CTAG
|
Protein sequence | MGLVEWIVRL VLSPGVFAPV IFPGLLTALA VLLIVIWAER KIAARVQMRV GPLYVTRHFG GVLQMLADGT RYMFQEFIVP ETADKVPYML APALALTLAI APFALIPSAP GFAPVKSPYS LPALLAILAS TPLSVLLMGW SSNNKFSIQG AVREAFMTLA YEVPLFLSAL SMAILYGSMD LEEIVGRQFL PGALLNPVAA FTFFVAMVMS SGRLPFDIVE GEQEIVAGPY VEYTGIVFGI GMGLAYLKLY ALSLLYSLLF LSGWEPLPRA LYSVYPGLAG VWLFAKAFTL MLFVVFLRSV YGRYRLDQAL RAGWRVYLVL AVVSILLSCL LRVVVNV
|
| |