Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1191 |
Symbol | |
ID | 4600436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1130627 |
End bp | 1131784 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639773967 |
Product | hypothetical protein |
Protein accession | YP_920592 |
Protein GI | 119720097 |
COG category | [S] Function unknown |
COG ID | [COG1415] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCG GGGTAGCAGA GCTGCCTCTG CACGACGGGA GCGTCCCCCG CTGGCTCATA GCCAGGATGG AGAGGCTAGC GGGGATCCTC GTAGAGATGA TAGTCGAGGA GTACGGTACG CGGGGCCTTC TCGAAAGGTT AGCCGACCCG GTGTACTTCC AGGCTATCAA CAACATAATA GGCATGGACT GGGACAGCTC GGGCTCGACG ACGGTGACTA CGGCCGTGCT GAAGAAAGTG CTGGAGGAGA GGGAGCTGGG GGTAAAGGCT TGCGGTGGGA AGGGCTCTTC GAGCAGGAAG GCCCCCGAGG AGATAAGGCT CCACGCCGAG AAGTACGGGC TTGACCCGTC GGGGCTCGTG TCCACCTCCT ACCTCGTCGC GAAGGTGGAC AGCGCGGCGC TACAGGCCGG CTACCAGCTG TACCACCACG CGTTCTTCTT CGACGAGGAG GGCAGGTGGG CTGTCGTCCA GCAGGGTATG AAGCCTTCTA CTCGCACTGC TAGGAGGTAC CACTGGTTCT CGGAGCGCGT GGGCGACGTC ACCGTCGAGC CTCACAGCGG CATCCATGGG TTCAGGGAGC CCTTCGCGCT CAACACGGTA GCCGCCGAGG CGGGGGAGTT CCGCAGGCTG GTAGTCGACC TCGTAGGTGA GGGGGCCTCC AGGCTTGAAC GCCTCGTGAG CGAAGCGCTC CGCGTTCTGG AAGGCTACAG CCCGCTCGTC AGCTACGCGC CCTACAGCGC CGAGAAGGCG CGGTCCCTGC GCGAGAGGAT GAGGCGCCTG GGCAAGCCCT CCCTAAGCCG CGAGGCTCTA GCATCGCTCG CAGGCAGGGG CGTGGAGAGC TTCAGGGATA TTCTCGCCGC GAAAGCCGTG GGGCCCTCAG CTATAAGGGC GCTCGCGCTA GTCGCCGAGC TCGTATACGA GACGCCCCCG TCGTGGCGCG ACCCCGTAAC GCACCAAGTG GACCCCTTCA AGTTCGCGTA CGCGGTGGGG GGAAAGGACG GGGTACCGTT CCCCGTGGAC AGGAAGACGT ACGACGAGCT AATCTCGATA CTCGAAGAGT TGAAGCAACG CTTCAGAGGC GAGCCGGGAG TATTCAGAAG ACTCGCCGAG CTTACGAAGA ACTGGACACC GCCGCCCGAG GAGAAAGTAC CCACCTAG
|
Protein sequence | MKTGVAELPL HDGSVPRWLI ARMERLAGIL VEMIVEEYGT RGLLERLADP VYFQAINNII GMDWDSSGST TVTTAVLKKV LEERELGVKA CGGKGSSSRK APEEIRLHAE KYGLDPSGLV STSYLVAKVD SAALQAGYQL YHHAFFFDEE GRWAVVQQGM KPSTRTARRY HWFSERVGDV TVEPHSGIHG FREPFALNTV AAEAGEFRRL VVDLVGEGAS RLERLVSEAL RVLEGYSPLV SYAPYSAEKA RSLRERMRRL GKPSLSREAL ASLAGRGVES FRDILAAKAV GPSAIRALAL VAELVYETPP SWRDPVTHQV DPFKFAYAVG GKDGVPFPVD RKTYDELISI LEELKQRFRG EPGVFRRLAE LTKNWTPPPE EKVPT
|
| |