Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0269 |
Symbol | |
ID | 4601890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 237423 |
End bp | 238871 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773024 |
Product | hypothetical protein |
Protein accession | YP_919682 |
Protein GI | 119719187 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCGC CACTTAGGAG GATCACCGAG TACCTATGGG AAATACCGGA GAAGTACAAG CCCGGCATGA ACGTGCCGGG GCTCGTCATA GCGGATGAAG TGTTAATATC GAAGATGAAG GAGGATTTAA CGCTCGAGCA GGTCGCTAAC GTTGCCATGT TGCCGGGCAT CTACAAGTAC TCTATAGTTC TTCCAGACGG CCACCAGGGT TACGGCTTCC CAATAGGCGG CGTCGCCGCG TTCGATGCAG AGAAAGGCGT GCTTAGCCCC GGAGGCGTAG GCTACGATAT CAACTGCGGC GTAAGAGTCC TGTCCACGAA CCTTACGGAG CAGGAGGTTA GGCCGAAACT CAGGGAGCTC GCAGAGACCC TTTTCAGGAA GATCCCCTCG GGGCTTGGAA GCACGAGTGG TTTAAGGCTG AGCCACGCGG AGCTGGACCG CGTACTCGAG GAGGGGGTCG AGTGGGCTAT TGAAAGAGGC TACGGCTGGA GAGAGGACAT GGAGCATATA GAGGAGAAGG GGAGAATGGA GGGCGCAGAT GCAGACGCGG TGTCCAACGA GGCCAAGCAG AGGGGTAGCA ACCAGCTGGG CACGCTGGGT AGCGGGAACC ACTTCCTGGA AGTCCAGAGG GTTGACAAAA TCTACGACCC CGAGGTTGCA AAGGTTTTCG GGATTGAGAG GGAGGGACAA GTAACCGTAA TGATACACAC AGGTAGCAGG GGGCTTGGAC ACCAGGTTGC AAGCGACTAT CTGAAGATAA TGGAGAGAGT CGTAAGAAAG TACAACATGC CGCTACCGGA CAGGGAGCTT GTATCCGTGC CGGCGACGTC CCCGGAGGCC GAAAGGTACT TCGCAGCCAT GAAGGCCGCG GCGAACTTTG CCTGGACGAA TAGGCAGGTT ATCACACACT GGGTTAGGGA AAGCTTCCGA GCGGTTTTCA AGACAGACCC GGATAAGCTC GGTCTCAATG TGATCTACGA CGTAGCTCAC AACATTGCCA AGCTCGAGGA GCACGTGGTG GACGGTAAAA GAGTGAAAGT CTACGTTCAC AGGAAGGGGG CTACGCGGGC CTTTCCACCC GGGCACCCAG AGATCCCCGC AGACTACAAA TCCATAGGTC AACCCGTCCT GATACCCGGC TCTATGGGTA CTGCGAGCTA CATACTCGTA GGAACGCAGA AAGCCATGGA TTTGACGTTC GGCTCCTCTC CGCACGGCGC TGGAAGGATG CAGAGCCGCG CTGAAGCGCG TAGAAGCGTG AGGGGACAGG AGATAAAGTC CGAGCTTGAG AGTAGAGGTA TAGTGGTTAG GGCTGCGAGC CTAGCTGTCG TCGCCGAGGA GGCTCCAGAC GCGTACAAGG ACGTGGACAG AGTAGTTATG GTCGCGGATG CAGTCGGCAT TGCTAGGAAG ATAGTGAGGA TGACGCCCAT AGCAGTGGTG AAAGGCTAA
|
Protein sequence | MAPPLRRITE YLWEIPEKYK PGMNVPGLVI ADEVLISKMK EDLTLEQVAN VAMLPGIYKY SIVLPDGHQG YGFPIGGVAA FDAEKGVLSP GGVGYDINCG VRVLSTNLTE QEVRPKLREL AETLFRKIPS GLGSTSGLRL SHAELDRVLE EGVEWAIERG YGWREDMEHI EEKGRMEGAD ADAVSNEAKQ RGSNQLGTLG SGNHFLEVQR VDKIYDPEVA KVFGIEREGQ VTVMIHTGSR GLGHQVASDY LKIMERVVRK YNMPLPDREL VSVPATSPEA ERYFAAMKAA ANFAWTNRQV ITHWVRESFR AVFKTDPDKL GLNVIYDVAH NIAKLEEHVV DGKRVKVYVH RKGATRAFPP GHPEIPADYK SIGQPVLIPG SMGTASYILV GTQKAMDLTF GSSPHGAGRM QSRAEARRSV RGQEIKSELE SRGIVVRAAS LAVVAEEAPD AYKDVDRVVM VADAVGIARK IVRMTPIAVV KG
|
| |