Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0302 |
Symbol | |
ID | 4601125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 268962 |
End bp | 270002 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639773063 |
Product | flap endonuclease-1 |
Protein accession | YP_919715 |
Protein GI | 119719220 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR03674] flap structure-specific endonuclease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGGGTTG ACATAAAGGA GCTTGTAGAG CCGGTAGCAA AGGAGGTGGA GCTGGGGTTC TTCTCGAAAA AAGTGATAGC CATAGATGCC TACAACTCGC TGTACCAGTT CTTGGCTACT ATAAGGCAGA AGGACGGAAC GCCGTTACTT GACGCGCAAG GAAACGTAAC AAGCCATCTC AACGGACTAT TCTATAGGAC GATAAACTAC ATAGAGTTAG GGATAAAGCC TGTGTACGTC TTCGACGGGA GGCCGCCGGA GTTGAAGCAG AAGGAACTGG AGAGGAGGTA CCAGATAAAA GTAGAAGCCG AGAAAAAATA CCGAGAAGCT ATAGAGAGGG GAGACCTCGA GGAAGCCCGT ATCTACGCTC AGCAGACTAG CAGGCTAACG GCAGCCATGG TTCATGACGC GAAGCTCCTT CTGCGCTACA TGGGCGTCCC CTACGTGGAG GCTCCGAGCG AGGGAGAAGC GCAGGCCGCT TACATGGTGA AGAAAGGCGA CGCCTGGGCC TCGGGAAGCC AGGACTTCGA CTCGCTACTA TTCGGAAGCC CCAGGCTCGT TAGGAATTTA GCCATAACTG GTAAGCGTAA ACTACCGCGG AAAGACGTCT ACGTCGAAGT AAAGCCCGAG ATTGTCGAGC TCGAAGAACT ACTTAGGGTA CACGGCATAA CACACCAGCA ACTCGTAGTT ATAGGTATTC TCGTAGGGAC GGACTATGCG CCAGAGGGAG CTAGGGGTAT TGGGGTGAAA AAGGCTTTGA AGCTAGTGAA GGAGCTGAAG GACCCCGAGA AAATTTTCAG ATCGGTGGAG TGGTCGAGCG ACGTGCCACC CGAGAAGATC CTAGAGCTAT TCCTGCACCC AGAAGTCACG GATAGCTACG AGCTTACCTG GAAAGAGCCC GACAAGGAGA AAGTAATCGA ACTACTGGTC GAAAGACATC AATTCTCGAT GGAACGCGTA ACGAACGCCT TGGATAGATT AGAAAAGGCA GTCAAAACCC ACTTTAAGCA ACAATCTTTG GAGTCCTGGT TCGGCTTCTA G
|
Protein sequence | MGVDIKELVE PVAKEVELGF FSKKVIAIDA YNSLYQFLAT IRQKDGTPLL DAQGNVTSHL NGLFYRTINY IELGIKPVYV FDGRPPELKQ KELERRYQIK VEAEKKYREA IERGDLEEAR IYAQQTSRLT AAMVHDAKLL LRYMGVPYVE APSEGEAQAA YMVKKGDAWA SGSQDFDSLL FGSPRLVRNL AITGKRKLPR KDVYVEVKPE IVELEELLRV HGITHQQLVV IGILVGTDYA PEGARGIGVK KALKLVKELK DPEKIFRSVE WSSDVPPEKI LELFLHPEVT DSYELTWKEP DKEKVIELLV ERHQFSMERV TNALDRLEKA VKTHFKQQSL ESWFGF
|
| |