Gene Tpen_0885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0885 
Symbol 
ID4600828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp832839 
End bp834734 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content57% 
IMG OID639773663 
ProductDNA topoisomerase 
Protein accessionYP_920289 
Protein GI119719794 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0843416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTACA AGGCAGTCAT AGTAGCAGAG AAAAACTCTG TTGCGCGTGC AATAGCCAGC 
TTCCTAGGCG GGAGTAGCGT CCGGAGGTTC CGCGTCGAGA GGGTGCCTGC CTACGAGTTC
CTCTGGGAGG GAGGAAAGCA CCTATCCATA GGCGTGAGCG GGCACATACT TGACTTCGAC
TTCCCCGAGG AGTATAACAA GTGGGAGAGC GTCGACCCCC GGGTACTCTT CTTCACGAAC
CCCGTGCTCG TCACTCGGGA AGGCGCGTAC ATCTACGTCA GAGCCTTGAA GACTCTGGCC
CGGCAGACGT CGACAGTGAT ACTCGCGCTT GACGCGGACG TCGAGGGCGA GGCGATAGCC
TTCGAAGTGA TGAGGATAAT GAAGAGCGTG AACCCGGAGC TCGAGTTCCG TAGAGCGTGG
TTCAGCGCGG TGACTAAAAG CGATATACTG GAGGCTTTCA GGAAGCTGAG GGAGCCGAAC
GAGAACCTGG CGAACAAGGC TTTTGCTAGG ATGGTTGTAG ACCTCACGAT AGGTGCGAGC
TTCACGCGTA TACTCACTCT CTCGGCTAGG AGGAACGGCG GGGTAATGCC GAGGGGTAGC
TTCCTGAGCT ACGGGCCATG CCAGACGCCG GTCCTCTACC TGGTGGTTAA GAGGGAGCTC
GAAAGGGAGC AGTTCAAGAA GAAGAAGTAC TACGTCTTGA GGGTCAGGTT TAGGAGCCAG
GAAGGCGTGT TCACGGCCTC CGCCACGTTC GAAGACCAGG AGAAAGCGAA GAGTGCTCTC
GATAGCGTTA AGAGGACCGG CGTGGGGGTC GTTGTATCAG CGGAGTTCAA CGCGGTAGAG
GTGGAGCCCC CCGTTCCTCT GAACACGGTG TACCTCGAGA GCAGGGCGAG CCTTTTCTTG
AACCTCAGGC CCAAGGAAAC TCTCTCGATA GCGGAGAAGC TTTACTCCTT CGGCTACATA
TCCTACCCGA GAACCGAGAC CACAATCTAC CCGCCGACGC TTAACCTGCG CGGAATAGCG
TCCATGTTCA CCAGGTGGGA GGACGCCGGC TGGTACGTAG CCAAGGTGCT TGCAAAAGGC
TTCACGCCGA CGCGGGGACG GGAAGACGAC AAAGCCCACC CGCCCATCTA CCCGACGAGG
AGCGCGTCCC GCGAGGAGAT AACCAGAAGG TTCGGCGAGA AGGCGTGGAA AGTCTACGAG
TACGTCGTCA GGCACTTCCT AGCAACGATC AGCGAGAACG CAGTAGTCGA GAGGCAGAGG
GTAGTCGTGA AGGTGGGGGA GCTCTCCCTG CAGTCGGAGG GCCGAAAGCT GGTCTATGCT
GGTTTCTACA ACGTCTACAA GTACTCCGTC CCGAAGGACG AGCCCTTGCC GTACGTCGTG
GAAGGAGAGG AAGTCGAAGT CGAGGATGCA AAAATCGAGG CGAGAACCAC CCAGCCTCCA
CCCTACCTCA GCGAGTCTGA GCTACTCGCC CTGATGAAGA AGTACGGCAT TGGTACGGAT
GCCACAATGC AGGAGCACAT ACACACCAAC ATAGAGAGGA GATACTTCGT AGTTAAGAAC
AAGCGGTGCA TACCCACGCC TCTAGGCAGG ACTCTCGCCC TGGCGCTCTA CGAGACCGTC
CCGGAGCTCG TACTACCAGA GGTGCGCGGC AAGATGGAAG CCTCACTCTC GAAGATAGCT
ACTGGCGAGA GAACCCCCGA GGAAGTCGTA AACGAGATGC GTAGCGAGTT CCTGGAGTAC
TACGACCGCC TAGTCGAGCG CATAGACTAC GTCTCCAAGA AGATAGTAGA GGGGCTGAAA
ATGGTCTTCC AGGACGAAAA GCAGCAAGCT CGAGCAGGCT CAGTGGGCGA AGAACGTTCG
CGCTCCACGC GTACACACCG TAAAGGTAAA AAGTAG
 
Protein sequence
MSYKAVIVAE KNSVARAIAS FLGGSSVRRF RVERVPAYEF LWEGGKHLSI GVSGHILDFD 
FPEEYNKWES VDPRVLFFTN PVLVTREGAY IYVRALKTLA RQTSTVILAL DADVEGEAIA
FEVMRIMKSV NPELEFRRAW FSAVTKSDIL EAFRKLREPN ENLANKAFAR MVVDLTIGAS
FTRILTLSAR RNGGVMPRGS FLSYGPCQTP VLYLVVKREL EREQFKKKKY YVLRVRFRSQ
EGVFTASATF EDQEKAKSAL DSVKRTGVGV VVSAEFNAVE VEPPVPLNTV YLESRASLFL
NLRPKETLSI AEKLYSFGYI SYPRTETTIY PPTLNLRGIA SMFTRWEDAG WYVAKVLAKG
FTPTRGREDD KAHPPIYPTR SASREEITRR FGEKAWKVYE YVVRHFLATI SENAVVERQR
VVVKVGELSL QSEGRKLVYA GFYNVYKYSV PKDEPLPYVV EGEEVEVEDA KIEARTTQPP
PYLSESELLA LMKKYGIGTD ATMQEHIHTN IERRYFVVKN KRCIPTPLGR TLALALYETV
PELVLPEVRG KMEASLSKIA TGERTPEEVV NEMRSEFLEY YDRLVERIDY VSKKIVEGLK
MVFQDEKQQA RAGSVGEERS RSTRTHRKGK K