Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1661 |
Symbol | |
ID | 4601243 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1608039 |
End bp | 1609646 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639774434 |
Product | hypothetical protein |
Protein accession | YP_921059 |
Protein GI | 119720564 |
COG category | [S] Function unknown |
COG ID | [COG4879] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTGGAGG CCGAGGTTCG CCCGACAAAT AGTTTAACTA AGCCTTTAGG CTTTGCATCG GCAGGAGTCA CGGAGCACTG GGGGAGGTGC TTGGCGTACC TCCAGAAGAG AAGGCTCCAG GTAGTCACCG ACGTCCTGAG CAGGTTGCTG GCGGGCGAGG CCGCTACGAG GGACGAAGCC ATCGGCTTGC TGCGCAGAGC ATACCTAGAG GCGGGCCTGG AGCCGCTGAG GGGGCTTTCC ACTCCTCGGA GCTTCTACAG GGAGATGTCG CTGGTGTACG CCGTGGGCAA GTACGGGCTT GGGCTCGACG AGGAGCTCGA AGGATTGCGG GAAATATTCG AGGTGGAGGC TCTCTGCGAC GCGGCGCTGG GGCTCGTGAG GAGCGGCGCC GGCGTCGAGG AGTCGGTTTT CAGGGTGTTC GGGCGCCTAA AGGAGTCCGA CGTTAGGAGT TTCCTCAACT ACGTCTTGGC GGCGAGCCTG CTCGGCTACG TGGGCTTCGA GGAGCTCTTG AAGGTGCTGG GAGGTCTAGA GGCTTACGGC AACGTAGCGC GGGACTACAG GATCCTGGTG GTAGCGTACA GGCTCGCGGA CTTCGTGGGC TCCAACAAGT TTTCCAGGAG GTCCGAGAAG GAGGCGGCTA AGCTGGACCT AGCGAGGAGC CTCGGCGACG AGAAGGCTTT GCCGTCGGAC GGGCTCGTCT GGAGGATAGC TGTAAACGTC TACGGCGTAG ACGAGGGTGT GGCTAACCGC GTCCTCAGGA TGAGCCCCAG CGACCTCGCA AGGGTAGTTC TCGAGTCTAG CACGTGGTGG TACGGGTACG TGGTGTCTCC CGGAGAGCTG GAGGAATACC TCGCCGGCCT CGAGCCGGAG TGGCTCCAGG CGTACCTCTA CCTCGAAAAG AAGCTCAGGA AGGCTTTCCC GGCTTCCTCG AGGCTAGTCG CGGCGGCAGC CGTAGACCAG GCGAAGGCGG AGGGGGCGAG CCCCGAGTCC CTGACGGCTA GGAGCGTAAA CATCGAGAAC CCCGTCTCCA GCCTGCTGGA GTGGGGGTGC TCCGGCTGGA GGTTTACCTA CATGAACCTG GCTCCCAAGG GGGAGTTCGA GCTGAGGCTT GAGGACAAGC ACGAGGTAAT AGTCTTCGAC CGCGTCCGGG CGTACGAGGC CCTCGCCTTG GGGGTCAGGA GGCTCAGGGA GAGGATAGCG GAGAAAGCTA GCGGGGACCT CGACGTGAAG GTGAGGCTCG GCGGGAGGCT CAGCGGTATG TGGCTCAGGG TCCAGGCCAT GCTCCTCGCG GTTAAAGTCG TCGGGGAGGC CTACGTGTTG ACCGCGCCGC CCGCAGCCCA GGCGAGGCTC CGGGAGGGCC TCCTGGAGGA GAGGAGGATC CCGGTGGACG GCGGGGAGCT CGTCGCGAGG CTTGTTGAGC GAGGGTTTAA CCGGTACGTC ACGCTGGAGC TAGGCGGGAG GAGGATCGCG ACGCTGAAGC TGGGCTCCGA CCCCGGGAAG CTGGAGGAAA AGGTCGAGAA GATACTCTCG CATAACCTCC CGAAGGGCGT GAGCGAGGAG AAAAGGCGGC TACTGGGAGA AGAGGTGAGG AGCCTGCTCA AGCGCTGA
|
Protein sequence | MVEAEVRPTN SLTKPLGFAS AGVTEHWGRC LAYLQKRRLQ VVTDVLSRLL AGEAATRDEA IGLLRRAYLE AGLEPLRGLS TPRSFYREMS LVYAVGKYGL GLDEELEGLR EIFEVEALCD AALGLVRSGA GVEESVFRVF GRLKESDVRS FLNYVLAASL LGYVGFEELL KVLGGLEAYG NVARDYRILV VAYRLADFVG SNKFSRRSEK EAAKLDLARS LGDEKALPSD GLVWRIAVNV YGVDEGVANR VLRMSPSDLA RVVLESSTWW YGYVVSPGEL EEYLAGLEPE WLQAYLYLEK KLRKAFPASS RLVAAAAVDQ AKAEGASPES LTARSVNIEN PVSSLLEWGC SGWRFTYMNL APKGEFELRL EDKHEVIVFD RVRAYEALAL GVRRLRERIA EKASGDLDVK VRLGGRLSGM WLRVQAMLLA VKVVGEAYVL TAPPAAQARL REGLLEERRI PVDGGELVAR LVERGFNRYV TLELGGRRIA TLKLGSDPGK LEEKVEKILS HNLPKGVSEE KRRLLGEEVR SLLKR
|
| |