Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1086 |
Symbol | |
ID | 4600934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1021055 |
End bp | 1022557 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773863 |
Product | proton-translocating NADH-quinone oxidoreductase, chain M |
Protein accession | YP_920488 |
Protein GI | 119719993 |
COG category | [C] Energy production and conversion |
COG ID | [COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) |
TIGRFAM ID | [TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGTAC CGTACCTCTG GTTGGCGTTG CTGGTACCCT TAGCGGCGTC ACTGGCCTCG CTGGCGTTAA AGAGCAGGAA AGCTCTAGCC GCGCTGAACT CCTCTGCCCT AGCGTTCTCG GCCGCCGTCC TCCTCCTACT CTACCTCACC AACGGCTCGA ACAGGTGGTT CGACCCCCTC TCGTTCAAGC TGGGAAGCCT GGGGACTTTC TCGCTGGTTA TGGACCCCAT GGTATTCCTG GTAGCGTTCA GCGTGGCGGT TACAACCTCT GTCATCGCGC TCTACAGCTC CCCCTACATG GAGCATAGAT TCGAGGAGCT TGAACGCGAA GGGGTCTCGG CGCCCGGGTG GGGGGCCTAC TACTTCCTCT ACACGCTCTT TGCCCTCTCG ATGATGGGCA CCGTCATGTC CACGAACATA ATTGAGTTCT ACGTGTTCCT CGAGCTAACG CTCATCCCCA GCTTCCTGCT GATAGCCTTC TACGGCTACG GCGAGAGGCT GAAGATAGCC ATAATGTACC TCATCTGGAC CCACGTGGGC GCCCTCCTCT TCCTCATAGG TGCGCTGACC GTGGGCTCCA AGGTGGGCTT CGACTTCGTA GATCCCGAGA AGGGCTTCCT CCTGGGGCTG GGCGAAGGCG TGGGCGTGCT CGCCTTCTGG CTTATGGTGG TAGGGCTCTC GGTGAAGCTA GCGGCGGCGG GCCTGCACAT GTGGCTCCCC TACGCCCACG CTGAGGCCCC CACGCCCATC TCGGCGCTCC TAAGCCCGAA CCTCATCGGG CTGGGAGGAG CGATGATGTT CCGCGTCGTC TACGTGCTTT TCCCGAAAAC CTTCGCCGCC GCCTCGCCGG TACTGATGGC GTGGGCCCTC GTAACGATGA TCTACGGCGG GCTTATGGCG CTGAGCCAGT CCGATTTCAA GAGGCTCCTC GCGTACAGTA GCATCAGCCA GATGGGCTAC CTCCTGCTGG GGCTCGCCTC GGTCGACGTG TACGGCGTCG CCGGGACGTT CCTGCACTAC ATGGTGCACG CCTTCGGCAA GGCTATACTC TTCGCCGTGG CGGGCATACT GATAGCCACG TACCACGGCC TGCGGGACAT AACGAGGATG GGAGGCCTCG CCTCGAAGAT GCCCTACACG GCCTCGCTGG CGCTCATCGG CTTCATGCAC ATCACGGGTA TACCTCCAAC CCTGGGCATC TGGAGCGAGT ACCTGATACT AAGAGGAGCC GTCGCGCACG CCCTAGCCCT TGGAGCCCCC TCGTACGTGC TCCTGGCGGC GGCCCTTCTC GTGGGTATAG GGCTCTCGAC AGCCTACTCC TTCCTGACGA TGAGGAGGGT GTTCTACGGG CCCCTAAAGG TACCTGAGGC GCGTGAGGCC GGTAAAGCCC TCTGGGCGCC GCTCCTAGCC TTCGCAGTGC TGGGCGTGCT GTTCTTCGTG TGCGCCTCCC TGCTCATAGA CCCGCTGGTC TCGTCGCTCG GAGGGCTCGG GCTGGGTGGT TGA
|
Protein sequence | MGVPYLWLAL LVPLAASLAS LALKSRKALA ALNSSALAFS AAVLLLLYLT NGSNRWFDPL SFKLGSLGTF SLVMDPMVFL VAFSVAVTTS VIALYSSPYM EHRFEELERE GVSAPGWGAY YFLYTLFALS MMGTVMSTNI IEFYVFLELT LIPSFLLIAF YGYGERLKIA IMYLIWTHVG ALLFLIGALT VGSKVGFDFV DPEKGFLLGL GEGVGVLAFW LMVVGLSVKL AAAGLHMWLP YAHAEAPTPI SALLSPNLIG LGGAMMFRVV YVLFPKTFAA ASPVLMAWAL VTMIYGGLMA LSQSDFKRLL AYSSISQMGY LLLGLASVDV YGVAGTFLHY MVHAFGKAIL FAVAGILIAT YHGLRDITRM GGLASKMPYT ASLALIGFMH ITGIPPTLGI WSEYLILRGA VAHALALGAP SYVLLAAALL VGIGLSTAYS FLTMRRVFYG PLKVPEAREA GKALWAPLLA FAVLGVLFFV CASLLIDPLV SSLGGLGLGG
|
| |