Gene Tpen_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0302 
Symbol 
ID4601125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp268962 
End bp270002 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content52% 
IMG OID639773063 
Productflap endonuclease-1 
Protein accessionYP_919715 
Protein GI119719220 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR03674] flap structure-specific endonuclease 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGGTTG ACATAAAGGA GCTTGTAGAG CCGGTAGCAA AGGAGGTGGA GCTGGGGTTC 
TTCTCGAAAA AAGTGATAGC CATAGATGCC TACAACTCGC TGTACCAGTT CTTGGCTACT
ATAAGGCAGA AGGACGGAAC GCCGTTACTT GACGCGCAAG GAAACGTAAC AAGCCATCTC
AACGGACTAT TCTATAGGAC GATAAACTAC ATAGAGTTAG GGATAAAGCC TGTGTACGTC
TTCGACGGGA GGCCGCCGGA GTTGAAGCAG AAGGAACTGG AGAGGAGGTA CCAGATAAAA
GTAGAAGCCG AGAAAAAATA CCGAGAAGCT ATAGAGAGGG GAGACCTCGA GGAAGCCCGT
ATCTACGCTC AGCAGACTAG CAGGCTAACG GCAGCCATGG TTCATGACGC GAAGCTCCTT
CTGCGCTACA TGGGCGTCCC CTACGTGGAG GCTCCGAGCG AGGGAGAAGC GCAGGCCGCT
TACATGGTGA AGAAAGGCGA CGCCTGGGCC TCGGGAAGCC AGGACTTCGA CTCGCTACTA
TTCGGAAGCC CCAGGCTCGT TAGGAATTTA GCCATAACTG GTAAGCGTAA ACTACCGCGG
AAAGACGTCT ACGTCGAAGT AAAGCCCGAG ATTGTCGAGC TCGAAGAACT ACTTAGGGTA
CACGGCATAA CACACCAGCA ACTCGTAGTT ATAGGTATTC TCGTAGGGAC GGACTATGCG
CCAGAGGGAG CTAGGGGTAT TGGGGTGAAA AAGGCTTTGA AGCTAGTGAA GGAGCTGAAG
GACCCCGAGA AAATTTTCAG ATCGGTGGAG TGGTCGAGCG ACGTGCCACC CGAGAAGATC
CTAGAGCTAT TCCTGCACCC AGAAGTCACG GATAGCTACG AGCTTACCTG GAAAGAGCCC
GACAAGGAGA AAGTAATCGA ACTACTGGTC GAAAGACATC AATTCTCGAT GGAACGCGTA
ACGAACGCCT TGGATAGATT AGAAAAGGCA GTCAAAACCC ACTTTAAGCA ACAATCTTTG
GAGTCCTGGT TCGGCTTCTA G
 
Protein sequence
MGVDIKELVE PVAKEVELGF FSKKVIAIDA YNSLYQFLAT IRQKDGTPLL DAQGNVTSHL 
NGLFYRTINY IELGIKPVYV FDGRPPELKQ KELERRYQIK VEAEKKYREA IERGDLEEAR
IYAQQTSRLT AAMVHDAKLL LRYMGVPYVE APSEGEAQAA YMVKKGDAWA SGSQDFDSLL
FGSPRLVRNL AITGKRKLPR KDVYVEVKPE IVELEELLRV HGITHQQLVV IGILVGTDYA
PEGARGIGVK KALKLVKELK DPEKIFRSVE WSSDVPPEKI LELFLHPEVT DSYELTWKEP
DKEKVIELLV ERHQFSMERV TNALDRLEKA VKTHFKQQSL ESWFGF