Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1761 |
Symbol | |
ID | 4601959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1701319 |
End bp | 1702725 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639774534 |
Product | hypothetical protein |
Protein accession | YP_921159 |
Protein GI | 119720664 |
COG category | [S] Function unknown |
COG ID | [COG3372] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.987591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGCCGT CCAACCTCCT CGTGGCACGC GCGCGGCGGG GGCGCATCGA GCCCCTGCTA CTCGAGCCTT CCGGGCTACC GGTGGCCCTC GCCTCGGAGG TCCTGGAGCT CTTCGAGCGG GGGGTGGGCA AGGGCAGGAG GGAGCTTGAG GAGGGCCTCG CCGACCTGGA GGAACTCGCC CTCGAGCTGG GCCTCGACCT CAGGGTAGTC AGGGCTATGT TCACGCTCGC GGCGCGGCAC GCGAGGTTCG AGCCCCCGAA GGCCCCCGTC GACCCCGTGA AGGCGAGGAT GGAGGTCTTC GAGGAGGCTT GCAGGGAGTT CGGAGTAGCG GTCACGGAGG AGGAGAGGTC TACGGTGCTC CGGCGGGTAG CGGAGAGGCT CGGCTGCAGC GTCGAGGACC TGGAGTCGGT GCTCGGCGGG TACCTCGAGG AAGTACTGGC GGAGCCCCCG GCGCTGAAGG CCGAGGACCT CGTGAGGATG CTTAACCTCT CGATGGTGCA GACGCTGCTC TTCAAGGCCT CGCAGCTCGA AGTGGTCTTC AGGAGCGACG GCGCGACGGC GAAGAAGCTG CTGAGAGCGG TGAAGAGGCT GGGCCTACTC TACGTCGCGG AGCAGCTCGG CGAGGGGGTG AGGCTAACCG TGGACGGGCC CGCGTCGCTG CTGAGGCAGA CCAGGAGGTA CGGGACCAGG CTGGCGAAGC TAGTCCCACT CGTGATGCTG GCTGAGCGCT GGAGGATCAG GGCTAGGGTC CCGGCGAGGG GGAGGGACCT CTTCTTCGAG CTCGGCGACG AGAAGTCGGG GCTGTTCCCA AGGGTGGAGG AGGAGCCGGA GCCCCTCTTC GACAGCGAGG TCGAGAGGGA GTTCTACAGG AGCATCTCGA GCCTCGCGCG CGGCTGGAGG GTGGAGCGGG AGCCCGAGCC GCTCGTAGCC GGGAACAAGG TCCTGATACC CGACTTCTCG GTGTCGAACG GCGAGAGGAA GGTGTACATA GAGATACTCG GCTTCTGGAC CAAGGACTAC CTGGAGAGGA AGATGCAGAA GCTTAGAGAG CTACGCGGGG TCAACATCGT GGTAGCCGTG AACGAGGAGC TAGCGTGCTC CTCCGTGAAG GACCTGCCCC ACGACGTCGT GGTCTTCAAG GGGCGGCTGA GGGGGGCGGA CGTCTACCCG GTACTCAAGC GCTACCTCGG CGAGCCCCCC GAGGAGCCGC GAGAAGAAGT CGAGTACCGC CTGGAGGGGC TACCGGACCT CTCCGGGAGA ACACTCTCCG AGGCTATCAG GGAGCTCAGG AAGCTGGGGG TCCCCGAGAG CGAGGCAGTC AAGGTGCTCG AAAAGCTGGG CTACGGGGTC GACTGGACAA CGCTAGACCC CGACAAGCTC GTACTGAAAA AGAATAAACG CCGGTAG
|
Protein sequence | MLPSNLLVAR ARRGRIEPLL LEPSGLPVAL ASEVLELFER GVGKGRRELE EGLADLEELA LELGLDLRVV RAMFTLAARH ARFEPPKAPV DPVKARMEVF EEACREFGVA VTEEERSTVL RRVAERLGCS VEDLESVLGG YLEEVLAEPP ALKAEDLVRM LNLSMVQTLL FKASQLEVVF RSDGATAKKL LRAVKRLGLL YVAEQLGEGV RLTVDGPASL LRQTRRYGTR LAKLVPLVML AERWRIRARV PARGRDLFFE LGDEKSGLFP RVEEEPEPLF DSEVEREFYR SISSLARGWR VEREPEPLVA GNKVLIPDFS VSNGERKVYI EILGFWTKDY LERKMQKLRE LRGVNIVVAV NEELACSSVK DLPHDVVVFK GRLRGADVYP VLKRYLGEPP EEPREEVEYR LEGLPDLSGR TLSEAIRELR KLGVPESEAV KVLEKLGYGV DWTTLDPDKL VLKKNKRR
|
| |