Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0885 |
Symbol | |
ID | 4600828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 832839 |
End bp | 834734 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773663 |
Product | DNA topoisomerase |
Protein accession | YP_920289 |
Protein GI | 119719794 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0843416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTACA AGGCAGTCAT AGTAGCAGAG AAAAACTCTG TTGCGCGTGC AATAGCCAGC TTCCTAGGCG GGAGTAGCGT CCGGAGGTTC CGCGTCGAGA GGGTGCCTGC CTACGAGTTC CTCTGGGAGG GAGGAAAGCA CCTATCCATA GGCGTGAGCG GGCACATACT TGACTTCGAC TTCCCCGAGG AGTATAACAA GTGGGAGAGC GTCGACCCCC GGGTACTCTT CTTCACGAAC CCCGTGCTCG TCACTCGGGA AGGCGCGTAC ATCTACGTCA GAGCCTTGAA GACTCTGGCC CGGCAGACGT CGACAGTGAT ACTCGCGCTT GACGCGGACG TCGAGGGCGA GGCGATAGCC TTCGAAGTGA TGAGGATAAT GAAGAGCGTG AACCCGGAGC TCGAGTTCCG TAGAGCGTGG TTCAGCGCGG TGACTAAAAG CGATATACTG GAGGCTTTCA GGAAGCTGAG GGAGCCGAAC GAGAACCTGG CGAACAAGGC TTTTGCTAGG ATGGTTGTAG ACCTCACGAT AGGTGCGAGC TTCACGCGTA TACTCACTCT CTCGGCTAGG AGGAACGGCG GGGTAATGCC GAGGGGTAGC TTCCTGAGCT ACGGGCCATG CCAGACGCCG GTCCTCTACC TGGTGGTTAA GAGGGAGCTC GAAAGGGAGC AGTTCAAGAA GAAGAAGTAC TACGTCTTGA GGGTCAGGTT TAGGAGCCAG GAAGGCGTGT TCACGGCCTC CGCCACGTTC GAAGACCAGG AGAAAGCGAA GAGTGCTCTC GATAGCGTTA AGAGGACCGG CGTGGGGGTC GTTGTATCAG CGGAGTTCAA CGCGGTAGAG GTGGAGCCCC CCGTTCCTCT GAACACGGTG TACCTCGAGA GCAGGGCGAG CCTTTTCTTG AACCTCAGGC CCAAGGAAAC TCTCTCGATA GCGGAGAAGC TTTACTCCTT CGGCTACATA TCCTACCCGA GAACCGAGAC CACAATCTAC CCGCCGACGC TTAACCTGCG CGGAATAGCG TCCATGTTCA CCAGGTGGGA GGACGCCGGC TGGTACGTAG CCAAGGTGCT TGCAAAAGGC TTCACGCCGA CGCGGGGACG GGAAGACGAC AAAGCCCACC CGCCCATCTA CCCGACGAGG AGCGCGTCCC GCGAGGAGAT AACCAGAAGG TTCGGCGAGA AGGCGTGGAA AGTCTACGAG TACGTCGTCA GGCACTTCCT AGCAACGATC AGCGAGAACG CAGTAGTCGA GAGGCAGAGG GTAGTCGTGA AGGTGGGGGA GCTCTCCCTG CAGTCGGAGG GCCGAAAGCT GGTCTATGCT GGTTTCTACA ACGTCTACAA GTACTCCGTC CCGAAGGACG AGCCCTTGCC GTACGTCGTG GAAGGAGAGG AAGTCGAAGT CGAGGATGCA AAAATCGAGG CGAGAACCAC CCAGCCTCCA CCCTACCTCA GCGAGTCTGA GCTACTCGCC CTGATGAAGA AGTACGGCAT TGGTACGGAT GCCACAATGC AGGAGCACAT ACACACCAAC ATAGAGAGGA GATACTTCGT AGTTAAGAAC AAGCGGTGCA TACCCACGCC TCTAGGCAGG ACTCTCGCCC TGGCGCTCTA CGAGACCGTC CCGGAGCTCG TACTACCAGA GGTGCGCGGC AAGATGGAAG CCTCACTCTC GAAGATAGCT ACTGGCGAGA GAACCCCCGA GGAAGTCGTA AACGAGATGC GTAGCGAGTT CCTGGAGTAC TACGACCGCC TAGTCGAGCG CATAGACTAC GTCTCCAAGA AGATAGTAGA GGGGCTGAAA ATGGTCTTCC AGGACGAAAA GCAGCAAGCT CGAGCAGGCT CAGTGGGCGA AGAACGTTCG CGCTCCACGC GTACACACCG TAAAGGTAAA AAGTAG
|
Protein sequence | MSYKAVIVAE KNSVARAIAS FLGGSSVRRF RVERVPAYEF LWEGGKHLSI GVSGHILDFD FPEEYNKWES VDPRVLFFTN PVLVTREGAY IYVRALKTLA RQTSTVILAL DADVEGEAIA FEVMRIMKSV NPELEFRRAW FSAVTKSDIL EAFRKLREPN ENLANKAFAR MVVDLTIGAS FTRILTLSAR RNGGVMPRGS FLSYGPCQTP VLYLVVKREL EREQFKKKKY YVLRVRFRSQ EGVFTASATF EDQEKAKSAL DSVKRTGVGV VVSAEFNAVE VEPPVPLNTV YLESRASLFL NLRPKETLSI AEKLYSFGYI SYPRTETTIY PPTLNLRGIA SMFTRWEDAG WYVAKVLAKG FTPTRGREDD KAHPPIYPTR SASREEITRR FGEKAWKVYE YVVRHFLATI SENAVVERQR VVVKVGELSL QSEGRKLVYA GFYNVYKYSV PKDEPLPYVV EGEEVEVEDA KIEARTTQPP PYLSESELLA LMKKYGIGTD ATMQEHIHTN IERRYFVVKN KRCIPTPLGR TLALALYETV PELVLPEVRG KMEASLSKIA TGERTPEEVV NEMRSEFLEY YDRLVERIDY VSKKIVEGLK MVFQDEKQQA RAGSVGEERS RSTRTHRKGK K
|
| |