Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1815 |
Symbol | |
ID | 4602052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1757635 |
End bp | 1758762 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774588 |
Product | signal-transduction protein |
Protein accession | YP_921213 |
Protein GI | 119720718 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0955331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTTTA CTGGGCCCCC TTCCCAGGGG GTTTTCACGA AGATAGACCC CCTAGGCTTA AACGACACCG TTTCTGTTGC CGCTGAACGC ATGTGGAGGT TTGAGCTCCC CGCAGTGCCC GTGGTCGACG GGGAGGGGAG GTACGCGGGG ATAGTGTCTA TCTTTTCGCT TTTGAGGACA AGGTACCAAG CGGGCACGAA GCTTGGGAGT GTCCTCGAAA AGGCGCCCGT GGTGGAGCCG TCCCTCCCCT TGACAGAGGT AGCGAGGATG CTCGTCAAGA CGGGGCAACC CGGGCTAGCA GTAGTGGAGG GTGGAAGGGT CGTGGGGATA GTCTCGGCGA GAAGGTTGCT CGCAGGTATG GGTCTCTCTT CGAGGGTTAC CGCGAGGCAT ATTGCCTACA GGCTCGACCC ACTGGCCCCC AGCGATCCCT TGGAGAAGGC TAGGAAGCTC ATCGTGGACC TGGGCTTGAG GCTCGTACCG GTAGCCGAGG ACGGGAAGGC CGTGGGGGTG GTCAGAGTCT ACGATCTAGT GAACTTCGTG TACAATACTC CCCTGAGGCG TGAGAGGCTG GGCGAGGTGA AGGGCGAGAC CTCCTACTTC CTCGAACAGC CGGTCGCGAA AATAGCTACA GCGAACTTTA GGACGGTTCA CGTAGACGGC TACCCCTCGG TGGAAGACAT AGCCGAGGGG TCTGTCGTCG TCGACTCCTC CGGGAAGGTC TACGGGATCA TATCGCCCTA CCTGCTACTG AGGAGGCTTC TGCCCGCGAT CGAGGAAGCA AAGGTCCCTC TAAGGGTAGA AGGGGTGGAC GAGCTGGACT TCATCCAGAG GAACCTCATA TACAGGAAGA GCCTGGATAT AGCTTCGGAG GTCTCGAGGA GGGCTAGGTT ACTCGAAATG AGCGTAGTGC TAAAGTCTAG GGAGAAGTCG GGGAACAGGA GGAGGTACGA CGCGATAGTA TCCATCAAGC TCGACGTCGA CAGCTATAGC GCGAGGTCCT CTGGGTGGGA CGCTGTCGAG ACAGTGTACG AAGCGCTAGA CTCGGCGTAC AAGGTTTTCT CGAAGTCGAA GGAAAAGAAG AGGGAGAAGA GGATCTCGCT AGCAAGGCTC AGGAAGATCC TGGAGTAA
|
Protein sequence | MSFTGPPSQG VFTKIDPLGL NDTVSVAAER MWRFELPAVP VVDGEGRYAG IVSIFSLLRT RYQAGTKLGS VLEKAPVVEP SLPLTEVARM LVKTGQPGLA VVEGGRVVGI VSARRLLAGM GLSSRVTARH IAYRLDPLAP SDPLEKARKL IVDLGLRLVP VAEDGKAVGV VRVYDLVNFV YNTPLRRERL GEVKGETSYF LEQPVAKIAT ANFRTVHVDG YPSVEDIAEG SVVVDSSGKV YGIISPYLLL RRLLPAIEEA KVPLRVEGVD ELDFIQRNLI YRKSLDIASE VSRRARLLEM SVVLKSREKS GNRRRYDAIV SIKLDVDSYS ARSSGWDAVE TVYEALDSAY KVFSKSKEKK REKRISLARL RKILE
|
| |