Gene Mthe_0595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0595 
Symbol 
ID4461740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp615381 
End bp617615 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content55% 
IMG OID639699603 
ProductDNA topoisomerase I 
Protein accessionYP_843026 
Protein GI116753908 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACCTCA TCATTGCTGA GAAGCACGAT GCTGCGAAGA GAATATCCGA GATTCTTGCA 
GGTAAGAAAC CGACTGTTAC AAAGGTTTCC GGCATCGATA CATTTCGTTT TGATGACCGT
GTTGTTATAG GCCTCAGCGG CCATATTGTG GGCCTGGATT ATCCTGAGTG CTACAACAAC
TGGCATAAGG TCGATTACAG GGATCTCATC AGGGCGGAGA TCGTATCGAG ACCGACGCAG
GAGAGGATAG TCTCAGCTCT CAGAAAGCTC GGCAGAGAGG CAGACCGCGT AACGATCGCC
ACAGACTATG ATAGAGAGGG AGAGCTCATA GGTGTTGAGG CCCTCTGCAT ACTGCAGGAG
GTCAATCCAT CCATAAAAGC GGACAGGGTG CGGTTCAGCG CCATAACAAA AAGCGAGATT
CTGAGGGCAT TCGCCCAGCC GGAAGAGGTC GACTTCAATC TTGCCGCATC CGGCGAGGCG
AGGCAGATCA TAGACCTCGT GTGGGGCGCT GCACTGACGA GATTCATATC GATAACATCA
GGTCGGCTCG GGAAGGAGTT CCTCTCTGTG GGCAGGGTTC AGTCGCCGAC GCTGGCGCTG
ATCGTGGATC GCGAGAAGGA GATCGAGTCG TTCCAGCCGA AACCCTACTG GGAGATATAT
GCAGATCTCG AGAAGGGTAT CAGAGTAAGG CATTCAAAAT CCAGGATCTG GACCAGGGAC
GAGGTCGACC GCATATTGAA TTCTCTGAGC GATGTGGCAC GCGTCAGATC GATATCCAAA
GGGGAGCGGA GGGATAGGCC GCCAACGCCC TTCGATACCA CCAGCTTCAT CAGTGCGGCC
AGCAGCATCG GCTTCACAGC GGCAAACGCG ATGAGGATTG CAGAGTCCCT CTACACGAAT
GGCTACATCA GCTATCCAAG GACAGATAAC ACAGTGTACC CACCATCGAT CGATCTGAGA
TCGCTGCTCG AGATGCTCTC ATCAGGACCT TTCCGTGAGG ATGCACTGGA GCTGATGAAG
GGCAGCATCA CACCTACAAA AGGAAAACGC TCGACAACGG ACCATCCCCC GATATACCCG
ACGTCTGTAG CTGATAAAGA TGACCTCAAA GAGGATCAGT GGAAGATATA CGAGCTTGTT
GTAAGGCGGT TCTTCGCGAC ACTCTCTGGA GAGTGCGTCT GGGAGACCAC AAGCATCAGC
TTTGATATCG GGGATGAGGT ATTCAGGGCG AACGGCTCCA GGATTCTGGA CTTGGGATGG
AGAAGGTACT ATCCATACAG CAGGGCCGAG GAGAACGTTC TTCCTCCTCT CCGGGAGGGA
GAGGGGCTGA AGGTCCTCAG CCATGAGGTG CTGTCTAAAG AGACCCAGCC GCCTGCCAGG
TTCGGGCAGG GACGTCTCGT CAAGCTCATG GACGAGCTGG GCCTGGGTAC AAAGTCGACC
AGGCATGATA TAATCAGCAA GCTCTATGCG AGGGCCTATA TCCATGGCAA TCCGATAAGA
CCGACGAAGA CTGCGTACGC TGTTGTCGAC ACGCTCCAGC GCCATGCCCC TGCGATCACG
AAGCCTGAGA TGACAAGGAC TCTCGAGAAC GACATGCTCA AGATCGCAGA GCGGAAGATC
ACAAAGGGCG AGGTCATCGA GGAGTCGAGG CAGATGCTAG AAGCTGTATT CGATGAGCTT
CTCGCCCACA GGAAAGAGAT AGAAGATTCG CTGAGGGAGG GACTTCGTGT CGACAGGATC
GTGGGAAAGT GCAGGCTGTG CGGCTCCGAT CTGATTATAA GACGTGGCAG ACGAGGTGCC
AGGTTTGTGG GGTGTTCCGG GTATCCTGAG TGCAGGTTCA CGCTCCCCCT CCCCAGGGGT
GGCATGATCG TTGTCACGGA GCGGAGATGC GAGACGCACG ACATGAATCA CATCAGAATC
ATAAATCGTG GGAAGAGGCC ATGGGATCTC GGATGTCCAT ACTGCAACTT CAACAACTGG
CAGGCGAAAA AGGGATCAAA AACGCAGCGG ATGCCCGAGC TTGATGATAT CTCGGGGATA
GGTCCGAAGA GCAGGGAACG CCTCGAAAGC GCGGGAATAA AAACGCTGGA GGCGCTTGTC
TCCACAGACC CTGCCAGCAT CTCAGAGTCG ACTGGCATCA GCATCAAAAA GATCATCTCG
TGGCAGGAGT CGGCCAGGAG CATCATCCAC AAAGGAGAGC AGGGCGGCGA GGTGCCCGGA
TCATCCACGT GCTGA
 
Protein sequence
MHLIIAEKHD AAKRISEILA GKKPTVTKVS GIDTFRFDDR VVIGLSGHIV GLDYPECYNN 
WHKVDYRDLI RAEIVSRPTQ ERIVSALRKL GREADRVTIA TDYDREGELI GVEALCILQE
VNPSIKADRV RFSAITKSEI LRAFAQPEEV DFNLAASGEA RQIIDLVWGA ALTRFISITS
GRLGKEFLSV GRVQSPTLAL IVDREKEIES FQPKPYWEIY ADLEKGIRVR HSKSRIWTRD
EVDRILNSLS DVARVRSISK GERRDRPPTP FDTTSFISAA SSIGFTAANA MRIAESLYTN
GYISYPRTDN TVYPPSIDLR SLLEMLSSGP FREDALELMK GSITPTKGKR STTDHPPIYP
TSVADKDDLK EDQWKIYELV VRRFFATLSG ECVWETTSIS FDIGDEVFRA NGSRILDLGW
RRYYPYSRAE ENVLPPLREG EGLKVLSHEV LSKETQPPAR FGQGRLVKLM DELGLGTKST
RHDIISKLYA RAYIHGNPIR PTKTAYAVVD TLQRHAPAIT KPEMTRTLEN DMLKIAERKI
TKGEVIEESR QMLEAVFDEL LAHRKEIEDS LREGLRVDRI VGKCRLCGSD LIIRRGRRGA
RFVGCSGYPE CRFTLPLPRG GMIVVTERRC ETHDMNHIRI INRGKRPWDL GCPYCNFNNW
QAKKGSKTQR MPELDDISGI GPKSRERLES AGIKTLEALV STDPASISES TGISIKKIIS
WQESARSIIH KGEQGGEVPG SSTC