Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0035 |
Symbol | |
ID | 4601383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 25762 |
End bp | 26838 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639772788 |
Product | hypothetical protein |
Protein accession | YP_919448 |
Protein GI | 119718953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.101929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTGTTG TGGCGCTACT TGAAGGCTGG TACGCGTCGA TAGCGAGAAG CGGGGAAGCC TTCAGGCTGA CTCCGAACGA GTGGAAAGTC CTGGTATCCG TCGTGAACTC TACGACGGTC TCGCGCGTAT CGGAGCAGGC GAAGCTACCC TACAGCACTG TGATAGACGT TATGAGGAGG CTCTCGGGGA ACGGTCTCGG GATACACTTC ATACCGTACT TCGACCTAGC GGGCCTGAGG AAAGTCTTCG CGTTGATAGA GGGTGCTCCC ATTTCCGGGT ACCCCCTCTA CACTACAAAC GTTTACAGGC TCATCGGGAG AGGTTTCTTC ACGGGGGTTC TGGGTCTTGT CCCGGACTAC CTGGTCAACG AGTTCTTCGC TAGCCTGCCG CGGGAGCCCG TCGCTACGGT GGTCGGGGAT GAGTACAGGC ACTGGGCTCC AACGGGGAGG CTAACGAGGT ACGTCGCGGC GCACGGCATA GTAGTCCCCG CCTACGACCA GCTGGAGGAT GTGCTCTACG CGTCGAGGGG ACCCGTGACG AGGAGGGAGA AGAAGTGGGT GGACTGGGTG GACGTGCTGA TAGTCTACTT CAAGATGAAG TACGCGTTTA CGAAGCTGAG CGACGTCTAC TCCCTGGCGA AGAGCGTTTT CGGCACGGCT CCCCCCAGCA GGCAGCTGAT GAGCTACCAT TACAGAACCC ACGTCTCGCC CCTATGGTCC TACAACGGCG TGAGCTTCAG GATAGACAGG CACCTCGCGC CCGAAAGGGT TTACGTTCTC CGGGGTAACG CGAGCAAAGT CGCCGCCAGG ACGCTTATCG AGGCACCCTT CTTCTTCGAA GCGCTTGTGA ACGAGGAGTC TTCCGTCCTC CTCGCGCAGA CCCCGTGCTA CATGTCCCCC CTTGTATACA GGGTTCTTGC CGAGACGCGC GTGGACCTTC CCTACGGCGA GCTCTTAGTA GCCGAGTCCT GGAGCTTCGA GTGGCTTACT AGCGACGCGG TGAAGCATTA CAGGGACCAC AACGAGTGGC TCCACCCATC TGTGAGCGTG AGGATGCTCC CACCCGATGC AAAGTAA
|
Protein sequence | MPVVALLEGW YASIARSGEA FRLTPNEWKV LVSVVNSTTV SRVSEQAKLP YSTVIDVMRR LSGNGLGIHF IPYFDLAGLR KVFALIEGAP ISGYPLYTTN VYRLIGRGFF TGVLGLVPDY LVNEFFASLP REPVATVVGD EYRHWAPTGR LTRYVAAHGI VVPAYDQLED VLYASRGPVT RREKKWVDWV DVLIVYFKMK YAFTKLSDVY SLAKSVFGTA PPSRQLMSYH YRTHVSPLWS YNGVSFRIDR HLAPERVYVL RGNASKVAAR TLIEAPFFFE ALVNEESSVL LAQTPCYMSP LVYRVLAETR VDLPYGELLV AESWSFEWLT SDAVKHYRDH NEWLHPSVSV RMLPPDAK
|
| |