Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1035 |
Symbol | |
ID | 4600940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 976062 |
End bp | 977039 |
Gene Length | 978 bp |
Protein Length | 325 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639773813 |
Product | hypothetical protein |
Protein accession | YP_920438 |
Protein GI | 119719943 |
COG category | [S] Function unknown |
COG ID | [COG4260] Putative virion core protein (lumpy skin disease virus) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.275565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCAAG TAATCGAGTG GGTCAACCCA GGGCCCGACG ACATAGTCTG GCGCTACCCC GACGAGAACA TAACGTGGGG CGCGCAGCTC ATAGTCCACG AGTACGAGGT CGCCGTCTTC TTCCGGGACG GCAAGGCGTA CGACGTGCTT GGACCCGGGA GGCACACGCT GACAACGCAG AACCTACCCC TGCTGACGAG GGTGCTGTCC GCCATCGCCG GCTACCCGAC AACCCCGTTC AAGGCTACCG TCATATTCGT CTCAACGAAG CAGTTCAGAG GCTTGTTCGG TGGCAGGAGC CAGACGACGG AGCTAGCCCC ACTCATGTTC AGGGGTTCCT ACTGGTTCAG GGTCGGAGAC CCCAAGCTCT TCGTAACCGA GGTCGTCGGA GGCCAAGGGA AGTATACCAG CGCCGAGGTA AACGAGTTCA TAAGGGGCTT CATAAACGAG AAGGTAATAA AGCACCTCTC CGGGTACTCC TTGGCGGAGG CGTTCTCGAG CCTCGAGCAG GTATCCTTCA AGACGAAGGC CTTCCTCCTG GAGGAGGTGA GGAGGATAGG CCTCGAGCTC ATAGACCTTA AGTTCGAAGC CATAGACACG ACCCCCGAGT ACAGGGACAG GCTCTTCTGG ATAAAGCAGA CCGGCGCGGC TGGCTACGTG CTACAAATGG ACACAGCGAA GGAAGTCGCC AGGGAGCTTG GCAAATCCCC AGGCGCAGCC GTAGGAGCAG GGGTAGTTAT GATACCCCCA CTCTTCCAGC CCCCACCACA GCAACCCCTG CCCCAGCAAC CTCCCGCCGC CCAGCCGCCG GCCACCAAGA CCTGCCCACA GTGCGGGCGC CCGGTACCCT TAGACGCCCT GTTCTGCCCG TACTGTGGCT ACCGATTCCA GCCCGCCACG AAGAAGTGCC CCAACTGCGG TAGAGAGGTA CCGGCAGACG CCCTCTACTG TCCGTACTGC GGTACGAAGC TCGCCTAG
|
Protein sequence | MPQVIEWVNP GPDDIVWRYP DENITWGAQL IVHEYEVAVF FRDGKAYDVL GPGRHTLTTQ NLPLLTRVLS AIAGYPTTPF KATVIFVSTK QFRGLFGGRS QTTELAPLMF RGSYWFRVGD PKLFVTEVVG GQGKYTSAEV NEFIRGFINE KVIKHLSGYS LAEAFSSLEQ VSFKTKAFLL EEVRRIGLEL IDLKFEAIDT TPEYRDRLFW IKQTGAAGYV LQMDTAKEVA RELGKSPGAA VGAGVVMIPP LFQPPPQQPL PQQPPAAQPP ATKTCPQCGR PVPLDALFCP YCGYRFQPAT KKCPNCGREV PADALYCPYC GTKLA
|
| |