Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0652 |
Symbol | |
ID | 4601610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 602242 |
End bp | 604134 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639773425 |
Product | type III restriction enzyme, res subunit |
Protein accession | YP_920057 |
Protein GI | 119719562 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000732435 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGACCG ACTATGGGGG GCTGAACGAC AGCTATCTCA GTATATCTTT TCGTAAGAGA AGAACCCTGA CGGAGGCGGA GTTCAGAGAG TTCTTGGAGC TTGCGAGAAA GGTGGCGAAC TACGACCCAG AAAGCAAGGA GTGGAGGATT TCTGTGGAGA AGGTTTCCTC GCTAGACAAC GAGCTGGAAG AAGTACTCGA AAAGCTGAAG AAACTGTCAA CCTTGAGTGA CGCGGATCTT CAGAGGGTGG TCGCATATAC CCGTGGGAGG AGCGCGGGTA GGGTTTGCTG GATAGGCTAC GATCTAAGGG TGAAGGGTTT GCCTCCCACC GTCGTGGAGG CACTGCGGAA TGACACGACT CTAGGCGGGC TCTTCCTAGT GGAGGGACAG ACGCCGAGGT TGCGTTCCGT GCTTTTTCTC CACGAAGCAT CACGCGCTTT GAAGGAGAAG TTCAACGTTT CTTTAAGCTT CGACGAGAAG ATGACCAGCG TGGAGGTGCG TAGGGAGAAT GGCGTGCTTG TCTGGCGTTT CCAGTACCTT GACAAGGTGC TAGCTGAGAA GCTTGTAGAG GCCTCGACGT TGAAATTCTT CGTAGAGAAG GCAGTGCTAA ACGAGGAGGG TGAGTTCGAA GGCACGGAAC TCGTTGAGAG AAGAATGAGG ACTGCACACG TCGACTGGCA GAGGAAGGAA GTCTCGACAC CGGTGGCTCT ACTAGATAGC CTTAAAACCT TTCTCGAAGC GCACGGCTTT AGGGTTCTCG TCTCGATAGA GGAGAAGCCG CCCATAACTG TTCCCCTTGA ACACAACTTT AAGCTCTTAC CGCACCAGGT AGAAGCGTAT AAACAATGGA CGAGGAAACG TAGGGGCACC ATTTCGATAT TTACTAGAGG GGGGAAGTCG TTCATAGCAC TCGAAGCTAT CTACTCGCTG AGAAAGCCTA CCATAGTCTT TGTCACTACT CAGGAACTCG TTGAAACTTG GATTAGCTAC TTCGAGAAGT ACCTTGGGCT ACCGCGCTCA TTTGTAGGTG TTCTTGGCGG GGGAGAGCAG AAAATAAGGG AGATCACGGT CGCAACCTAC AGCAGTGCGG TTAAGTACAT AGATCTCATT AAGTCAAGGT TTGAGCTAGC GATATTTGAC GAGGCTCACC ACGTACCGGC GGCTACGTTC AAGCAGGTAG CGCTTGGTGT CGATGCCCTG TACAGAATGG CCCTTTCCGC CACTCCCGAG CGGAGGGATA GGAACGAAGG GCTTCTTTTC ACGCTGTGCG GAGGTTTGCT GTACCGGCTT ACGTACGAAG ATCTCGTGAG GCTTAAGGTC GTAGCTCCCA TAGAGGTCCT GGATGCCGTC TTCGTGGAGG GACCAGAGGA AAAGAAGAAG AAGCTCCTGG AGATTCTGCG CCGACATGCC GACGGAAAAG TAATCGTGTA CACGCAGTAC CTCCAGACTG CGGAAGATGT CTATGACTTG CTGAGGAGGA ACGGCTTTAA CGCGGAGATA GTAACAGGGG ATACACCGGC GCACAAAAGA GAGCTCGCCT TCAAGAACTT TGTCGAGGGC AGGTCTAACG TAATAGTCAC GACTACCGTC CTCGATGAGG GAATAACTGT GCCGGACGCC GACGTCGCCG TGATCTACGA GGGGACAGGC GAAGGAAGAC AGATGATACA GAGGATAGGG AGAGTTCTAG GCTATTACCC CGGGAAGACG GCCAAGGTGT ACGAGATAGT CGACTTAACG AACCCCAGAG AGAAATCAGC CTATAGGCGC AGGTCGTGGG TTAGAGAGCT TTACAGGGTC AGGGGTCTAG AGGAAATTGT GAGGAGAGTT AAAGAAGGGG ACGAGGAGGG GTATAAGCCC AGCTATCAGT TTCGCATAGA TTACTTTGAT TAG
|
Protein sequence | MMTDYGGLND SYLSISFRKR RTLTEAEFRE FLELARKVAN YDPESKEWRI SVEKVSSLDN ELEEVLEKLK KLSTLSDADL QRVVAYTRGR SAGRVCWIGY DLRVKGLPPT VVEALRNDTT LGGLFLVEGQ TPRLRSVLFL HEASRALKEK FNVSLSFDEK MTSVEVRREN GVLVWRFQYL DKVLAEKLVE ASTLKFFVEK AVLNEEGEFE GTELVERRMR TAHVDWQRKE VSTPVALLDS LKTFLEAHGF RVLVSIEEKP PITVPLEHNF KLLPHQVEAY KQWTRKRRGT ISIFTRGGKS FIALEAIYSL RKPTIVFVTT QELVETWISY FEKYLGLPRS FVGVLGGGEQ KIREITVATY SSAVKYIDLI KSRFELAIFD EAHHVPAATF KQVALGVDAL YRMALSATPE RRDRNEGLLF TLCGGLLYRL TYEDLVRLKV VAPIEVLDAV FVEGPEEKKK KLLEILRRHA DGKVIVYTQY LQTAEDVYDL LRRNGFNAEI VTGDTPAHKR ELAFKNFVEG RSNVIVTTTV LDEGITVPDA DVAVIYEGTG EGRQMIQRIG RVLGYYPGKT AKVYEIVDLT NPREKSAYRR RSWVRELYRV RGLEEIVRRV KEGDEEGYKP SYQFRIDYFD
|
| |