Gene Tpen_0652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0652 
Symbol 
ID4601610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp602242 
End bp604134 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content52% 
IMG OID639773425 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_920057 
Protein GI119719562 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000732435 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACCG ACTATGGGGG GCTGAACGAC AGCTATCTCA GTATATCTTT TCGTAAGAGA 
AGAACCCTGA CGGAGGCGGA GTTCAGAGAG TTCTTGGAGC TTGCGAGAAA GGTGGCGAAC
TACGACCCAG AAAGCAAGGA GTGGAGGATT TCTGTGGAGA AGGTTTCCTC GCTAGACAAC
GAGCTGGAAG AAGTACTCGA AAAGCTGAAG AAACTGTCAA CCTTGAGTGA CGCGGATCTT
CAGAGGGTGG TCGCATATAC CCGTGGGAGG AGCGCGGGTA GGGTTTGCTG GATAGGCTAC
GATCTAAGGG TGAAGGGTTT GCCTCCCACC GTCGTGGAGG CACTGCGGAA TGACACGACT
CTAGGCGGGC TCTTCCTAGT GGAGGGACAG ACGCCGAGGT TGCGTTCCGT GCTTTTTCTC
CACGAAGCAT CACGCGCTTT GAAGGAGAAG TTCAACGTTT CTTTAAGCTT CGACGAGAAG
ATGACCAGCG TGGAGGTGCG TAGGGAGAAT GGCGTGCTTG TCTGGCGTTT CCAGTACCTT
GACAAGGTGC TAGCTGAGAA GCTTGTAGAG GCCTCGACGT TGAAATTCTT CGTAGAGAAG
GCAGTGCTAA ACGAGGAGGG TGAGTTCGAA GGCACGGAAC TCGTTGAGAG AAGAATGAGG
ACTGCACACG TCGACTGGCA GAGGAAGGAA GTCTCGACAC CGGTGGCTCT ACTAGATAGC
CTTAAAACCT TTCTCGAAGC GCACGGCTTT AGGGTTCTCG TCTCGATAGA GGAGAAGCCG
CCCATAACTG TTCCCCTTGA ACACAACTTT AAGCTCTTAC CGCACCAGGT AGAAGCGTAT
AAACAATGGA CGAGGAAACG TAGGGGCACC ATTTCGATAT TTACTAGAGG GGGGAAGTCG
TTCATAGCAC TCGAAGCTAT CTACTCGCTG AGAAAGCCTA CCATAGTCTT TGTCACTACT
CAGGAACTCG TTGAAACTTG GATTAGCTAC TTCGAGAAGT ACCTTGGGCT ACCGCGCTCA
TTTGTAGGTG TTCTTGGCGG GGGAGAGCAG AAAATAAGGG AGATCACGGT CGCAACCTAC
AGCAGTGCGG TTAAGTACAT AGATCTCATT AAGTCAAGGT TTGAGCTAGC GATATTTGAC
GAGGCTCACC ACGTACCGGC GGCTACGTTC AAGCAGGTAG CGCTTGGTGT CGATGCCCTG
TACAGAATGG CCCTTTCCGC CACTCCCGAG CGGAGGGATA GGAACGAAGG GCTTCTTTTC
ACGCTGTGCG GAGGTTTGCT GTACCGGCTT ACGTACGAAG ATCTCGTGAG GCTTAAGGTC
GTAGCTCCCA TAGAGGTCCT GGATGCCGTC TTCGTGGAGG GACCAGAGGA AAAGAAGAAG
AAGCTCCTGG AGATTCTGCG CCGACATGCC GACGGAAAAG TAATCGTGTA CACGCAGTAC
CTCCAGACTG CGGAAGATGT CTATGACTTG CTGAGGAGGA ACGGCTTTAA CGCGGAGATA
GTAACAGGGG ATACACCGGC GCACAAAAGA GAGCTCGCCT TCAAGAACTT TGTCGAGGGC
AGGTCTAACG TAATAGTCAC GACTACCGTC CTCGATGAGG GAATAACTGT GCCGGACGCC
GACGTCGCCG TGATCTACGA GGGGACAGGC GAAGGAAGAC AGATGATACA GAGGATAGGG
AGAGTTCTAG GCTATTACCC CGGGAAGACG GCCAAGGTGT ACGAGATAGT CGACTTAACG
AACCCCAGAG AGAAATCAGC CTATAGGCGC AGGTCGTGGG TTAGAGAGCT TTACAGGGTC
AGGGGTCTAG AGGAAATTGT GAGGAGAGTT AAAGAAGGGG ACGAGGAGGG GTATAAGCCC
AGCTATCAGT TTCGCATAGA TTACTTTGAT TAG
 
Protein sequence
MMTDYGGLND SYLSISFRKR RTLTEAEFRE FLELARKVAN YDPESKEWRI SVEKVSSLDN 
ELEEVLEKLK KLSTLSDADL QRVVAYTRGR SAGRVCWIGY DLRVKGLPPT VVEALRNDTT
LGGLFLVEGQ TPRLRSVLFL HEASRALKEK FNVSLSFDEK MTSVEVRREN GVLVWRFQYL
DKVLAEKLVE ASTLKFFVEK AVLNEEGEFE GTELVERRMR TAHVDWQRKE VSTPVALLDS
LKTFLEAHGF RVLVSIEEKP PITVPLEHNF KLLPHQVEAY KQWTRKRRGT ISIFTRGGKS
FIALEAIYSL RKPTIVFVTT QELVETWISY FEKYLGLPRS FVGVLGGGEQ KIREITVATY
SSAVKYIDLI KSRFELAIFD EAHHVPAATF KQVALGVDAL YRMALSATPE RRDRNEGLLF
TLCGGLLYRL TYEDLVRLKV VAPIEVLDAV FVEGPEEKKK KLLEILRRHA DGKVIVYTQY
LQTAEDVYDL LRRNGFNAEI VTGDTPAHKR ELAFKNFVEG RSNVIVTTTV LDEGITVPDA
DVAVIYEGTG EGRQMIQRIG RVLGYYPGKT AKVYEIVDLT NPREKSAYRR RSWVRELYRV
RGLEEIVRRV KEGDEEGYKP SYQFRIDYFD