Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1505 |
Symbol | |
ID | 4601206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1453078 |
End bp | 1454268 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774280 |
Product | cytochrome bd-type quinol oxidase subunit 1-like |
Protein accession | YP_920905 |
Protein GI | 119720410 |
COG category | [C] Energy production and conversion |
COG ID | [COG1271] Cytochrome bd-type quinol oxidase, subunit 1 |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00635225 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCC CCGTGCTCTT CATGGCGCTG GTATTCGGGG TACACATCGT TGCAGTGAAT ATAGGGATAG CGCTCTCGAC GATAGTGCCG CTCCTCGTCA AGAGGTCTAG GGCGCTCGGT GACAAGGGGC TAGAAGGCGT GGCGCGCTCC TTGTTCAAAG TGTACGCGGC TACGTACGGG CTGGCGGGAG TCATGGGTAC GGCGTTCACA GTTTTCCTGG CGAGCTTCTA CCCAGAATTC GTCGGCGTAG CCGGGAACCT GACGATGGTA CCATTCGGCA TATCGATTGT ATCCATTATG CTCCACTTCT TCGCCATAGT CGCGTTCTGG TACGGCTGGG ACAGGTTCAG CCCGAAGGTC CACGAGGCCG TAGGCTGGCT ACTAGCGGCC ACAGCGTACA CGATACCGCT AGGCTTCAGG GCTGTCTTCG CTTTTCTCAA CACGCCCGCG GGGCTCGTTC TCGGGGAGAA GCCGAGCCTC AACGTAGTTC AGGCGCTTCT CAACCCGACG TTCCCGCCGC TCTACCTTAA AAGCGTGGTA GGCGCCCTCG TTGCTGGCTT CCTCTTCGTA GCGGCGGTAC TGGCGTTCAA GGGGCTTAGG ACAGCCTTGA CGGACGCTGA GAGGGGGTTG TACGTCTTCA GCCTCCGCTA CGCCTCGATG CTTCTCTTCG CGATGATGTT CCTAGGCGCC TGGTACGCAG TCTCCCTAGT GAACGTCCCT GTGAAGTTCA ACAATATCTT CGCCCCGCTG GGCGCCTCGC TACCAGCACC TACAACGTCT AACTACGCCT GGCTCTTCCT CGTGAAGATG GTTCTCGTAG CTGTGCAGGG CGCCGCACTG CTACTGGTGC TGCTCCCGCG CGGCAAGGGA GAAAGCCTCC TAGAATCCCC GGCGGGCTAC AAGGCAACCA TAACCGCGGC AGTAGCGGCT GCCCTAACCG TGCTTACAGG GGAATACCTC AACGCCTTCA GCCAGTACCC GTTCTTCGTA GCTAACGCTC CCCTTATAGC CGACAAGCTC CCGGAACCCT ACAGGACCAT CCTCACGCGA GCGTTAAACC TTGAAAATGT CAGCCCACTA GCCCAGGATC CGGCGCTCTA CGCGGCAACA GTGCTGGGCG TGCTCGTGCT CTTCGCGGCA GCCGGGTACA TGATCTACGT GGTCTTCTTC AAGAAGGGAG GCGAGGAGTA A
|
Protein sequence | MNAPVLFMAL VFGVHIVAVN IGIALSTIVP LLVKRSRALG DKGLEGVARS LFKVYAATYG LAGVMGTAFT VFLASFYPEF VGVAGNLTMV PFGISIVSIM LHFFAIVAFW YGWDRFSPKV HEAVGWLLAA TAYTIPLGFR AVFAFLNTPA GLVLGEKPSL NVVQALLNPT FPPLYLKSVV GALVAGFLFV AAVLAFKGLR TALTDAERGL YVFSLRYASM LLFAMMFLGA WYAVSLVNVP VKFNNIFAPL GASLPAPTTS NYAWLFLVKM VLVAVQGAAL LLVLLPRGKG ESLLESPAGY KATITAAVAA ALTVLTGEYL NAFSQYPFFV ANAPLIADKL PEPYRTILTR ALNLENVSPL AQDPALYAAT VLGVLVLFAA AGYMIYVVFF KKGGEE
|
| |