Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1762 |
Symbol | |
ID | 4601960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1702725 |
End bp | 1704080 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639774535 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_921160 |
Protein GI | 119720665 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGAAGC TTGAGTGGAG CGCGGGGACT ATACTGCTCA GGGGTTCTCC CCCGCCGAGC GTGGCTCCCT ACTTCCGCTT CGACCCTAGG GTCAAGGGTT ACCGCGCCCT CGCGATACAG TACAGGTGGA TAGTGGAGGC CTTGAGGGAG GCTGGCGTCG AGTTCGAGGA CGACGTCCTG CACCCTCCCC AGTGCAAGCT CCAAGCGCGC GAGGTCCAGC TCAGGGACTA CCAGGAGGAG GCCCTGGAGA GGTGGATGGC TGGGAGGAGG GGGGTCGTGG TCCTGCCTAC GGGGGCAGGC AAGACGATGG TCGCCCTCGC GGCGATCGCT AGGCTCGCTT GCCCCACGCT GATAGTCGTG CCGACACTGG AGCTCATGGA CCAGTGGGAG GAGGGGGTTA GGAGGCACCT GGGGGTCGCG CCGGGGAGGT ACGGGGGAGG GGAGAAGGAG GTGGGCTGTG TCACTGTAGC CACGTACGAC TCGGCGTACG TGAACGCGGA GTTCCTGGGA GACAAGTTCG AGCTCCTAGT GTTCGACGAG GTCCACCACC TCCCGAGCCC GGGCTACAGG CAGGTAGCAG AGCTCTCGGC CGCCCCCTGG AGGATGGGCC TCACGGCGAC CCCGGAGCGG GAGGACGGGC TCCACGAGCT CCTGCCGTAC CTCGTCGGCC CCGTCGTGTA TAGGCGCGGC GTGGGCGAGA TGGCGGGGAA GTGGCTCGCG GAGTTCGACG TTGTCCGCGT GTACGCGGAG ATGTCGCCGG AGGAGAGGGA GGAGTACGAG AGGCTTACGA GGACGTACAG GTCGTTCCTG AGGAAGAGGG GGCTCAGGAT CCGGGGCCCC CGGGACTTCG AGAGGCTCGC CGCGCTCAGC GTGAAGGACC CCGAGGCCAG GGAAGCCCTC CTCGCGTGGT ACAGGGCCAG GAGGATAGCC CTGCACGCCT CCTCGAAGAT GGAGGTCCTC GAAGAGCTCC TGGCGAGGCA CAGGGGCGAC AAGGTGCTGA TATTCGCCGA GCACGGCGAC GTGGTGAGGA GGATATCCTC CCGCTTCCTG GTACCCGAGA TAACGTACAG GACGCCCGAG GAGGAGCGGA GAGCCGTGAT GTCCGCCTTC AGGAAGGGGC TCGTGCGCGC CATAGTGACG AGCAAGGTGC TGGAGGAGGG CGTCGACGTC CCGGACGCGA ACGTCGCGGT GATCCTGAGC GGAACGGCGA GCAGGAGGGA GTTCGTCCAG AGGCTTGGGA GGGTCCTAAG GCCGCGCGAG GGGAAGAGGG CCGTGGTCTA CGAGGTCGTG ACCTCCGGAA CCAAGGAGGT GGAGATCTCG CGGAAGCGCA GAAAGGCGCT GAAGGGTGGG CAGTAG
|
Protein sequence | MLKLEWSAGT ILLRGSPPPS VAPYFRFDPR VKGYRALAIQ YRWIVEALRE AGVEFEDDVL HPPQCKLQAR EVQLRDYQEE ALERWMAGRR GVVVLPTGAG KTMVALAAIA RLACPTLIVV PTLELMDQWE EGVRRHLGVA PGRYGGGEKE VGCVTVATYD SAYVNAEFLG DKFELLVFDE VHHLPSPGYR QVAELSAAPW RMGLTATPER EDGLHELLPY LVGPVVYRRG VGEMAGKWLA EFDVVRVYAE MSPEEREEYE RLTRTYRSFL RKRGLRIRGP RDFERLAALS VKDPEAREAL LAWYRARRIA LHASSKMEVL EELLARHRGD KVLIFAEHGD VVRRISSRFL VPEITYRTPE EERRAVMSAF RKGLVRAIVT SKVLEEGVDV PDANVAVILS GTASRREFVQ RLGRVLRPRE GKRAVVYEVV TSGTKEVEIS RKRRKALKGG Q
|
| |