Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1076 |
Symbol | |
ID | 4601688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1013127 |
End bp | 1014788 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773853 |
Product | proton-translocating NADH-quinone oxidoreductase, chain L |
Protein accession | YP_920478 |
Protein GI | 119719983 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit |
TIGRFAM ID | [TIGR01974] proton-translocating NADH-quinone oxidoreductase, chain L |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.031536 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTGTAG CAGTTGTACC GGCTTTACTG ATACTTTCGA GCCTCGTATG CGCTGTGCTA GGACCGCTGT CGCGTCGTCT ATGCGAGGCT CTCGCGGTGC TCGCAACGTT CGTAGGCGCC TGCCTATCCG CCGCTTTCTT CGCCGGATAC CTGCCACCGG GCCCCGGAGC TTACGGGGAC TGGATTTCCC GCCTGATGCT AGCCGTAGTG AACGTCCTAG GCTTCCTCAT AACGCTGTAC AGCGTCGGCT ACATGTCGGG AGAGGCCGGC TACGTCCGCT ACTACTCGCT GATACTCCTG TTCATAGGCT CGATGTCGGG GCTCGTCCTC ACCGAGGACC TAGTCCTCCT CTACTTGTTC TGGGAGCTCG TAGGGGTATG TAGCGCACTG CTCATCGCGT ACTGGTGGGA GAAGCCGGAA GCAAGGAGGG CGGGCCTCAA GGCGTTCACC GTGACGAGAG TGGGAGACGT AGGGCTCCTA CTGGCTCTGG CAACGGTAAT CCAAGCTACC GGGACGACGC GCATCCCCGG GGTGATCGCC TCCCTCTCGG GCAACGCGCA GCTAGCCCTG ACGGTGTGCA GTCTCCTGCT CGTAGCGGCT GTGGGCAAGT CCGCTCAATT CCCCCTCTTC GTCTGGTTGC CGGACGCAAT GGAGGGGCCC ACGTCTGTCA GCGCCCTGAT ACACGCCGCG ACAATGGTCA AGGCCGGTGT CTACCTGCTC GCGCGCTTCT ACCCGCTCCT ACAGGTTGCG CCCGGGCTAG GCGAGCTCGT TGTCTACCTG TCCGTATTCA CGGCTCTACT CTCGGCGCTC GCAGCCCTGG GAGCCTCTGA CCTGAAGAAG GTATTAGCGT ATAGCACGAT AAACCACCTA GCGCTAATGT TCCTGGCGAT AGGTGCAGGC GCACTCGGAG CGGCGATGTA CCACTTGCTC GCGCACTCGA TGTTCAAAGC GCTCCTATTC CTCTGCGCCG GTCTAATAAT CCATGAGACG AAGACGAGGA GCCTTGACAA GCTCGAAGGG CTTTGGGACG CCGGGCTGAA GTTTACCGCT GTGGCGTTCC TCGTCGGGGC GCTAAGCCTT GCCGGGCTAC CCCCGCTCCC CGGCTTCTTC ACCAAGGAGG CTGTCCTCGC CTCGCTGGAA AGCGCTATAC ACGGCGAGGC TGTCCTAGCC CTCTGCTTCG CGCTGAGTGC TCTCTCCTCC CTATACATCT TCAGGCTGTT CTTCAGGCTA TTCACTGGTT GTTGCTTCAG GCCTCTACAC GAGGAGCTCG ACGCGATGAC TGTGCCTATA GCTGTGCTCG CAGTACTTAC CGCCCTAGGG CTACCAGTGT TACAGGCTGT GCATGCCTAC CTCGGAGTTC CCCTCGAGCT CTCCGAGGTG AACCTCCCGG CCGTCGTCGG CGCTTTCGCC GGGCTAGCCG TTTCGTACGC GGTGTGGGGT AGGCACGCTC TCGCCGAGCT GAGGCTGTTC CTGAGACCCC TAGCGCGGGT AGCGGACAGG GGCTTCTACT TCGACGACCT CTACACCTTC CTGGCAAAGA GGGTAGTCAG CATTCTCTCG GGCACGTTTA CGAGGCTTCA GTACGGCAAC CCAGCTGTTA ACACCCTGTG GCTCCTCGGC TTCCTCCTCG TCTTGTTGAT AGTTGTCCTG GGGGTGATCT AG
|
Protein sequence | MGVAVVPALL ILSSLVCAVL GPLSRRLCEA LAVLATFVGA CLSAAFFAGY LPPGPGAYGD WISRLMLAVV NVLGFLITLY SVGYMSGEAG YVRYYSLILL FIGSMSGLVL TEDLVLLYLF WELVGVCSAL LIAYWWEKPE ARRAGLKAFT VTRVGDVGLL LALATVIQAT GTTRIPGVIA SLSGNAQLAL TVCSLLLVAA VGKSAQFPLF VWLPDAMEGP TSVSALIHAA TMVKAGVYLL ARFYPLLQVA PGLGELVVYL SVFTALLSAL AALGASDLKK VLAYSTINHL ALMFLAIGAG ALGAAMYHLL AHSMFKALLF LCAGLIIHET KTRSLDKLEG LWDAGLKFTA VAFLVGALSL AGLPPLPGFF TKEAVLASLE SAIHGEAVLA LCFALSALSS LYIFRLFFRL FTGCCFRPLH EELDAMTVPI AVLAVLTALG LPVLQAVHAY LGVPLELSEV NLPAVVGAFA GLAVSYAVWG RHALAELRLF LRPLARVADR GFYFDDLYTF LAKRVVSILS GTFTRLQYGN PAVNTLWLLG FLLVLLIVVL GVI
|
| |