Gene Tpen_1762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1762 
Symbol 
ID4601960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1702725 
End bp1704080 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content66% 
IMG OID639774535 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_921160 
Protein GI119720665 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGAAGC TTGAGTGGAG CGCGGGGACT ATACTGCTCA GGGGTTCTCC CCCGCCGAGC 
GTGGCTCCCT ACTTCCGCTT CGACCCTAGG GTCAAGGGTT ACCGCGCCCT CGCGATACAG
TACAGGTGGA TAGTGGAGGC CTTGAGGGAG GCTGGCGTCG AGTTCGAGGA CGACGTCCTG
CACCCTCCCC AGTGCAAGCT CCAAGCGCGC GAGGTCCAGC TCAGGGACTA CCAGGAGGAG
GCCCTGGAGA GGTGGATGGC TGGGAGGAGG GGGGTCGTGG TCCTGCCTAC GGGGGCAGGC
AAGACGATGG TCGCCCTCGC GGCGATCGCT AGGCTCGCTT GCCCCACGCT GATAGTCGTG
CCGACACTGG AGCTCATGGA CCAGTGGGAG GAGGGGGTTA GGAGGCACCT GGGGGTCGCG
CCGGGGAGGT ACGGGGGAGG GGAGAAGGAG GTGGGCTGTG TCACTGTAGC CACGTACGAC
TCGGCGTACG TGAACGCGGA GTTCCTGGGA GACAAGTTCG AGCTCCTAGT GTTCGACGAG
GTCCACCACC TCCCGAGCCC GGGCTACAGG CAGGTAGCAG AGCTCTCGGC CGCCCCCTGG
AGGATGGGCC TCACGGCGAC CCCGGAGCGG GAGGACGGGC TCCACGAGCT CCTGCCGTAC
CTCGTCGGCC CCGTCGTGTA TAGGCGCGGC GTGGGCGAGA TGGCGGGGAA GTGGCTCGCG
GAGTTCGACG TTGTCCGCGT GTACGCGGAG ATGTCGCCGG AGGAGAGGGA GGAGTACGAG
AGGCTTACGA GGACGTACAG GTCGTTCCTG AGGAAGAGGG GGCTCAGGAT CCGGGGCCCC
CGGGACTTCG AGAGGCTCGC CGCGCTCAGC GTGAAGGACC CCGAGGCCAG GGAAGCCCTC
CTCGCGTGGT ACAGGGCCAG GAGGATAGCC CTGCACGCCT CCTCGAAGAT GGAGGTCCTC
GAAGAGCTCC TGGCGAGGCA CAGGGGCGAC AAGGTGCTGA TATTCGCCGA GCACGGCGAC
GTGGTGAGGA GGATATCCTC CCGCTTCCTG GTACCCGAGA TAACGTACAG GACGCCCGAG
GAGGAGCGGA GAGCCGTGAT GTCCGCCTTC AGGAAGGGGC TCGTGCGCGC CATAGTGACG
AGCAAGGTGC TGGAGGAGGG CGTCGACGTC CCGGACGCGA ACGTCGCGGT GATCCTGAGC
GGAACGGCGA GCAGGAGGGA GTTCGTCCAG AGGCTTGGGA GGGTCCTAAG GCCGCGCGAG
GGGAAGAGGG CCGTGGTCTA CGAGGTCGTG ACCTCCGGAA CCAAGGAGGT GGAGATCTCG
CGGAAGCGCA GAAAGGCGCT GAAGGGTGGG CAGTAG
 
Protein sequence
MLKLEWSAGT ILLRGSPPPS VAPYFRFDPR VKGYRALAIQ YRWIVEALRE AGVEFEDDVL 
HPPQCKLQAR EVQLRDYQEE ALERWMAGRR GVVVLPTGAG KTMVALAAIA RLACPTLIVV
PTLELMDQWE EGVRRHLGVA PGRYGGGEKE VGCVTVATYD SAYVNAEFLG DKFELLVFDE
VHHLPSPGYR QVAELSAAPW RMGLTATPER EDGLHELLPY LVGPVVYRRG VGEMAGKWLA
EFDVVRVYAE MSPEEREEYE RLTRTYRSFL RKRGLRIRGP RDFERLAALS VKDPEAREAL
LAWYRARRIA LHASSKMEVL EELLARHRGD KVLIFAEHGD VVRRISSRFL VPEITYRTPE
EERRAVMSAF RKGLVRAIVT SKVLEEGVDV PDANVAVILS GTASRREFVQ RLGRVLRPRE
GKRAVVYEVV TSGTKEVEIS RKRRKALKGG Q