Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0666 |
Symbol | |
ID | 4601624 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 616421 |
End bp | 617971 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639773439 |
Product | DNA topoisomerase VI subunit B |
Protein accession | YP_920071 |
Protein GI | 119719576 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1389] DNA topoisomerase VI, subunit B |
TIGRFAM ID | [TIGR01052] DNA topoisomerase VI, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.448413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGA AGTTCCGAGG CCTCAGTCCA GCGGAATTTT TCCATCGAAA CCGCGAGATA GCGGGGTTCT CGAATCCCGC CAGGGCGCTT TACCAAGCTG TAAGAGAGCT CGTCGAAAAC TCTCTTGACG CGACTGAGAC TCATGGGATT CTACCGTTTA TAGACGTCGA GATAAGGTTG CACGAGGAGA GGCCTGAGTG GGTTGTTTTA AGAGTCGCGG ACAACGGTAT CGGTATACCT CTAAGCGAAG TGCCAAATGT GTTCGGCAGA GTGTTCTACG GGTCTAAGTA CGTTGTTCGT CAGACGCGCG GTGTATTCGG GTTAGGAGTG AAGATGGCAG TGCTCTACGC GCAGATGACG ACGGCTAGAC CTATCTATGT TAAGAGTTCT CCTATAAACT CGCGGTACGT CGGGGAGTAC CTACTCTACA TAGACATTAG CAGGAACATT CCTCATGTGC AGAAAATGCG TATAAAGAAG AAGACGAAGA ACTGGCACGG CACCATAGTT AAGCTAACGC TGGAAGGGTC TTGGGTTCAG GCAAAGAAGA GAATAGAGGA CTACATTAGG CGTACCGCGC TTATCTCGCC TTACGCCACC ATTAGGTACA GGTCTCCAGA CGGCGAGCTG ATTTTCAAAA GGGTTTCAAG GGAGCTCCCC CAGCCTCCCG AGATCGGAAA GTATCATCCT CGGGGCGTTG ACGTAGAGGT ATTGAAGGAA CTCATAAGGG CTACTAACAA TGCGTCTGAA GTTACGCTTC TAGAGTTTCT AGTAAAGCAC TTCGAGGGCG TCGGGGAGAA GAAGGCTACG GAGTTCCTCC AGTGGAGCGG CTTCTCGCCG GATACCAAGC TGACCGAGCT GAAGCTGGCG GACCTCGAAG TCCTCGCGTC GAAGATGAAG ACTTTCCCTG GTTGGCGCCG CCCACGCCCG CTGACACTCT CGCCGCTAGG CGCGGATCTA CTGAAGAAGG GCGTTAAGAG CATCCTGAAA CCAGAGTTCG TAGCCGCGGT GACGCGCCCC CCCTCCTCGT ACAGTGGCCA CGCCTTTATA GTTGAGGCTG CGATAGCCTA TGGCGGCGAG ATTCCTCCCC AAGATACTGT TATGCTACTC CGCTTTGCGA ATAAGATGCC TCTTCTCTAC GACGAGGGTG TAGACGTGTC CAGGAAGATC ATTGACAGCA TAGACTGGAG TATCTACAAG GTGAAGCTAC CTGCTCCCGT TGCCGTTGTG ACGCATGTGT GTTCTACGAA AATACCCTTC AAGGGTGTTG GGAAAGAGGC TATAGCCGAT GTTCCGGAGG TTGAGCACGA GCTGGAGATA GCTATTAGGG ACGTAGCAAG AAGGCTTAGG GCGTACTTGT CTAGGATGGA GAAGCTCTAC GAGGTGAAGA GAAAGGAGGT AACAATCAGG AAGTACATGG GGGAAGTTTC AAGCGCGCTA GCGTACATAG TCAACAGGGA TCCCGAGGAG ATTAACGCTT TAATCGAAGA GCTACTTAAG AAAGAACTAG CGAAGAAAGA GGTGAGGCCG GATGTCGTCT CAGAGTCCTA A
|
Protein sequence | MSEKFRGLSP AEFFHRNREI AGFSNPARAL YQAVRELVEN SLDATETHGI LPFIDVEIRL HEERPEWVVL RVADNGIGIP LSEVPNVFGR VFYGSKYVVR QTRGVFGLGV KMAVLYAQMT TARPIYVKSS PINSRYVGEY LLYIDISRNI PHVQKMRIKK KTKNWHGTIV KLTLEGSWVQ AKKRIEDYIR RTALISPYAT IRYRSPDGEL IFKRVSRELP QPPEIGKYHP RGVDVEVLKE LIRATNNASE VTLLEFLVKH FEGVGEKKAT EFLQWSGFSP DTKLTELKLA DLEVLASKMK TFPGWRRPRP LTLSPLGADL LKKGVKSILK PEFVAAVTRP PSSYSGHAFI VEAAIAYGGE IPPQDTVMLL RFANKMPLLY DEGVDVSRKI IDSIDWSIYK VKLPAPVAVV THVCSTKIPF KGVGKEAIAD VPEVEHELEI AIRDVARRLR AYLSRMEKLY EVKRKEVTIR KYMGEVSSAL AYIVNRDPEE INALIEELLK KELAKKEVRP DVVSES
|
| |