Gene Tpen_0666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0666 
Symbol 
ID4601624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp616421 
End bp617971 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content52% 
IMG OID639773439 
ProductDNA topoisomerase VI subunit B 
Protein accessionYP_920071 
Protein GI119719576 
COG category[L] Replication, recombination and repair 
COG ID[COG1389] DNA topoisomerase VI, subunit B 
TIGRFAM ID[TIGR01052] DNA topoisomerase VI, B subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.448413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA AGTTCCGAGG CCTCAGTCCA GCGGAATTTT TCCATCGAAA CCGCGAGATA 
GCGGGGTTCT CGAATCCCGC CAGGGCGCTT TACCAAGCTG TAAGAGAGCT CGTCGAAAAC
TCTCTTGACG CGACTGAGAC TCATGGGATT CTACCGTTTA TAGACGTCGA GATAAGGTTG
CACGAGGAGA GGCCTGAGTG GGTTGTTTTA AGAGTCGCGG ACAACGGTAT CGGTATACCT
CTAAGCGAAG TGCCAAATGT GTTCGGCAGA GTGTTCTACG GGTCTAAGTA CGTTGTTCGT
CAGACGCGCG GTGTATTCGG GTTAGGAGTG AAGATGGCAG TGCTCTACGC GCAGATGACG
ACGGCTAGAC CTATCTATGT TAAGAGTTCT CCTATAAACT CGCGGTACGT CGGGGAGTAC
CTACTCTACA TAGACATTAG CAGGAACATT CCTCATGTGC AGAAAATGCG TATAAAGAAG
AAGACGAAGA ACTGGCACGG CACCATAGTT AAGCTAACGC TGGAAGGGTC TTGGGTTCAG
GCAAAGAAGA GAATAGAGGA CTACATTAGG CGTACCGCGC TTATCTCGCC TTACGCCACC
ATTAGGTACA GGTCTCCAGA CGGCGAGCTG ATTTTCAAAA GGGTTTCAAG GGAGCTCCCC
CAGCCTCCCG AGATCGGAAA GTATCATCCT CGGGGCGTTG ACGTAGAGGT ATTGAAGGAA
CTCATAAGGG CTACTAACAA TGCGTCTGAA GTTACGCTTC TAGAGTTTCT AGTAAAGCAC
TTCGAGGGCG TCGGGGAGAA GAAGGCTACG GAGTTCCTCC AGTGGAGCGG CTTCTCGCCG
GATACCAAGC TGACCGAGCT GAAGCTGGCG GACCTCGAAG TCCTCGCGTC GAAGATGAAG
ACTTTCCCTG GTTGGCGCCG CCCACGCCCG CTGACACTCT CGCCGCTAGG CGCGGATCTA
CTGAAGAAGG GCGTTAAGAG CATCCTGAAA CCAGAGTTCG TAGCCGCGGT GACGCGCCCC
CCCTCCTCGT ACAGTGGCCA CGCCTTTATA GTTGAGGCTG CGATAGCCTA TGGCGGCGAG
ATTCCTCCCC AAGATACTGT TATGCTACTC CGCTTTGCGA ATAAGATGCC TCTTCTCTAC
GACGAGGGTG TAGACGTGTC CAGGAAGATC ATTGACAGCA TAGACTGGAG TATCTACAAG
GTGAAGCTAC CTGCTCCCGT TGCCGTTGTG ACGCATGTGT GTTCTACGAA AATACCCTTC
AAGGGTGTTG GGAAAGAGGC TATAGCCGAT GTTCCGGAGG TTGAGCACGA GCTGGAGATA
GCTATTAGGG ACGTAGCAAG AAGGCTTAGG GCGTACTTGT CTAGGATGGA GAAGCTCTAC
GAGGTGAAGA GAAAGGAGGT AACAATCAGG AAGTACATGG GGGAAGTTTC AAGCGCGCTA
GCGTACATAG TCAACAGGGA TCCCGAGGAG ATTAACGCTT TAATCGAAGA GCTACTTAAG
AAAGAACTAG CGAAGAAAGA GGTGAGGCCG GATGTCGTCT CAGAGTCCTA A
 
Protein sequence
MSEKFRGLSP AEFFHRNREI AGFSNPARAL YQAVRELVEN SLDATETHGI LPFIDVEIRL 
HEERPEWVVL RVADNGIGIP LSEVPNVFGR VFYGSKYVVR QTRGVFGLGV KMAVLYAQMT
TARPIYVKSS PINSRYVGEY LLYIDISRNI PHVQKMRIKK KTKNWHGTIV KLTLEGSWVQ
AKKRIEDYIR RTALISPYAT IRYRSPDGEL IFKRVSRELP QPPEIGKYHP RGVDVEVLKE
LIRATNNASE VTLLEFLVKH FEGVGEKKAT EFLQWSGFSP DTKLTELKLA DLEVLASKMK
TFPGWRRPRP LTLSPLGADL LKKGVKSILK PEFVAAVTRP PSSYSGHAFI VEAAIAYGGE
IPPQDTVMLL RFANKMPLLY DEGVDVSRKI IDSIDWSIYK VKLPAPVAVV THVCSTKIPF
KGVGKEAIAD VPEVEHELEI AIRDVARRLR AYLSRMEKLY EVKRKEVTIR KYMGEVSSAL
AYIVNRDPEE INALIEELLK KELAKKEVRP DVVSES