Gene Tpen_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0426 
Symbol 
ID4602105 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp386639 
End bp387664 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID639773191 
Productflap endonuclease-1 
Protein accessionYP_919838 
Protein GI119719343 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTAA ACCTTACTCC TCTGGTAAAG CCAAAGAAGA TAGGTCTTCA AGGGGTACGC 
GGGAGGGTGC TCGCGGTAGA CGCTTTGAAC TCGGTGTACC AGTTCCTGGC TCTAGTAAGG
GATGAACGCG GAATGTTGTT TACGAATTCG AGAGGCGAGG TAACGTCGCA CCTCATCGGG
TTGCTGTCCC GTTACTCGAG GCTCGCCTAC GAGTACGACG CGAGCTTTAT CTTCGTCTTC
GATGGATCCC CTCACCCCCT AAAGGCAAGG GAGCTAGAAA AGAGGAAAAA ACAAAGAGAG
AAGGCTAAGC AGGAGTACGC GGAGCTTCTC TCGAAGGGTG ACCTCAGGAA GGCCTTCTCG
AAGGCTGTGG TCTCTGCCGA GGTCGACGAC CGCATAGTTG AGAGCACAAA GAGGCTAGTT
AGGCTCATGG GGTTCCCCGT TGTTGACGCG GTGCACGACG CTGAAGCTCA AGCCGCCTAC
CTCGTGAAAA GGGGGGAGGC CTGGGCTGTA AGCTCCATGG ACTGGGACTC GTTGCTGTAC
GGTTCGCCGA GGCTCGTAAG GTACCTCACA CTCACGGGGT TCGAGTGGCT TCCCAGCAAG
CAAAAAGCGA GGAAACTCAT TCCGGAACTA GTAACGTTAG AGGAGCTCCT AAGCGGGCAC
GGGATTACTC TGAGGCAACT CGTAGACATA GCCATACTCG TGGGGACAGA CTACAACGAG
GGCGTAAAAG GAGTTGGGCC TCTCCGGGCA TTAAAGATGA TTAAACGCTA CGGATCGCTG
GAAAACTTAC CCGTTAGCGT ACGGAGGCAT CTCCCAGAGA ACTACGAGGC CGTGAGGCAA
ATCTTTCTAA ATCCTCCCTT AAACGAGAAA TACTCCCTGA GTTTCTCGCC ACCAGATAAA
GAGGGTTTAA TGAAGTTCCT GGTGGACGAG AACGACTTCT CGCCTAAGAG AGTAGAAGTG
TACTTGAGGA GGCTCGAGGC CGCGTATGAG AGAAGGAAAG GCGGACTCAC GAGGTTCCTC
GCGTAG
 
Protein sequence
MGVNLTPLVK PKKIGLQGVR GRVLAVDALN SVYQFLALVR DERGMLFTNS RGEVTSHLIG 
LLSRYSRLAY EYDASFIFVF DGSPHPLKAR ELEKRKKQRE KAKQEYAELL SKGDLRKAFS
KAVVSAEVDD RIVESTKRLV RLMGFPVVDA VHDAEAQAAY LVKRGEAWAV SSMDWDSLLY
GSPRLVRYLT LTGFEWLPSK QKARKLIPEL VTLEELLSGH GITLRQLVDI AILVGTDYNE
GVKGVGPLRA LKMIKRYGSL ENLPVSVRRH LPENYEAVRQ IFLNPPLNEK YSLSFSPPDK
EGLMKFLVDE NDFSPKRVEV YLRRLEAAYE RRKGGLTRFL A