Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0114 |
Symbol | |
ID | 4600646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 89300 |
End bp | 90499 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639772868 |
Product | FAD dependent oxidoreductase |
Protein accession | YP_919527 |
Protein GI | 119719032 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00154267 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCCTTA TAGAGGGAAG GTACGAGGCA GTAGTGATAG GGCTAGGACC CGCCGGGGCG GCGGCTCTCA AGAGACTACA GGAACTGGGG ATCAAGGCTG TAGGTATCGA GAGGAAGAAG GCTCCAGAAG AGCCGGCTGT CTGCGGAGAG TTCCTTCCAG AGCCCTCCGC GATAGAGTTC ATCTCCAGGT TTCCCTCTGT GAGAAAAGCC TACGAGTACA TAGGGCTGGC TAGGCGTACG AACGCTATAC GCAGAATCAT ACTCAGCTTT GCCGGCGGGA AGACTTACAA CCTCGAAATC CCAGGATTCA CAGTGAGTAG AAAGGAGATG GTGAGCAAGC TGATCGAGGG GTCTGACTAC GTAACGGGTA GCGACGTTGT TGGAATCAGG CGCGTCGGAG ACGCCTACCT GGTGAAGACG AGGAGGGGTA AAGAGTTCTC GGCGAGCTAC GTGATAGCCG CGGATGGTTT TCCTTCAGCT ACGAGGAGAC TTCTAGGACT CCCAGCAGGC CTGCAACCGG AGGACTACGC CGTGGGGGTC AACCTGAAGA TGGAGACTCC GAGCATGCCC AGAGATACCA TCTTCATGTA CGCTTCCAGC TTTACCCAAG GAGGTTACGC ATGGATAATA CCGGTTGGAG ACGGCTTATC GAACGTCGGC ATAGGTCTGC GCTTCAACTA CGTGAAGAAA GGGGCTAACC CCTTAAAGGC TCTCACCAGC TTCGTGGAGC TGGAAAGGCG GGGCTTCCTA GACTCCTCCA GGCAGCTCGA AGAGCCTCGC TCTAGGATGA TCCCCGTAAG CGGCTTCTAC TCGGAGCCTT CGACGGGGAA AGTGCTTTTC GCCGGTGACT CTTTGGGGGC GGTTAACCCC ATTAACGGGG GCGGGATATT CACAGCGATG GCTCTAGGGA TTCTCGCGGC GGAGAGCGTC TGGCTGGGGA ACCCCGGGAT CTACGACGAG AGGTCCTGGA GGGAGATAGG GCAGGTACTC TCGATAGGGC GTGCCTACCG CGTGCTGGTA GACTTTCTCT ACGAACACTT CGACCACGTA GCAGTGCTAA CGGCTCTCTT CCCGAGGTTC CTCTTAGAGA AGATACTGAA AGGGGAGAAA ACGCCTATTA AGGGGATTCT CGAGATTGGA ATAGGAAGGA AAACCTCCTA TCGACCTCCA CGGGGTCAAG CTTCTCGTAA GCCTCCGTAG
|
Protein sequence | MGLIEGRYEA VVIGLGPAGA AALKRLQELG IKAVGIERKK APEEPAVCGE FLPEPSAIEF ISRFPSVRKA YEYIGLARRT NAIRRIILSF AGGKTYNLEI PGFTVSRKEM VSKLIEGSDY VTGSDVVGIR RVGDAYLVKT RRGKEFSASY VIAADGFPSA TRRLLGLPAG LQPEDYAVGV NLKMETPSMP RDTIFMYASS FTQGGYAWII PVGDGLSNVG IGLRFNYVKK GANPLKALTS FVELERRGFL DSSRQLEEPR SRMIPVSGFY SEPSTGKVLF AGDSLGAVNP INGGGIFTAM ALGILAAESV WLGNPGIYDE RSWREIGQVL SIGRAYRVLV DFLYEHFDHV AVLTALFPRF LLEKILKGEK TPIKGILEIG IGRKTSYRPP RGQASRKPP
|
| |