Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0839 |
Symbol | |
ID | 5105199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 768427 |
End bp | 770472 |
Gene Length | 2046 bp |
Protein Length | 681 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640506743 |
Product | DNA topoisomerase type IA central domain-containing protein |
Protein accession | YP_001190937 |
Protein GI | 146303621 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01056] DNA topoisomerase III, bacteria and conjugative plasmid |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0618738 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCTCG TAATCACGGA GAAGCCCAGC GTTGCGCTGG ATATAGCGAG GGCCCTGGGT AAGCCAACGA GAAGGCAAGG TTACCTCGAG GTTGGGGAAT ACCTAGTTAC GTGGACCTAC GGTCATCTCC TCGAGATTGG CGAGATTGCG CCGAAGAGGT GGAATTTGAA GGATCTTCCC ATTTTCCCTG AGAAGTTTGA GTACGACTTG ATCAAGGGTA AGGAGAGTCA GTTCAAGGTT GTGAGGAAAC TGCTTGAGGG AGCTGACGCG GTAATAAACT GCGGTGATGC CGGGAGAGAG GGCGAGCTCA TAGTGAGGGA ACTCCTCGAG TTCACGGGAT ATCGCGGGAA AGTGTTGAGG CTCTGGACCT CGGAGGCCCT CACACGGGAC GTGGTGTTGA GAGAGCTCAG GAGATTGAGG CCAAGCTCGG AGTTTGACAG CCTCTACTAC AGTGCCCTAG CCAGGCAGAA CGGCGACTGG ATCGTGGGGA TTAACCTGAC CAGGCTCGTC ACCTTGAAGG CAGGCGGTGG GGAGGTCTGG AGTGTTGGGA GGGTCCAAAC TCCCACCCTC GCGATGATCG TGAAGAGGGA CAGGGAGATA GAGGCGTTCA AGCCAGAGAT CTACTACGTT GTGCTCGCGG GGTTTGAGGG AGAGATGAAG GGTGTGATGT TGAGGAATGG GGAGGAGGCA AGACTCGCAA GGGAGGAGGC GGAGCAGGTC GTGAATGCCT TGAAAAGCGT GAGGAGCGGG AGAGTTGAGA AAGTGGACGT GGAGAGAAGA GAGGAGAGGC CCCCACTACT CCACTCGCTC ACTTCCCTGC AGAGGGAGGC AAACACGCTG TATGGCCTGT CCGCAAAGAG GACCCTCGAC GTTGCCCAAT CCCTGTACGA GGAGTGGAAG CTCATAAGTT ATCCCAGGAC TGACGCACGC TACCTTGGGG AGGGGAACAG GGATCTGGTT AAGGACGTCT TAAGGAAGCT TGGAAGGGGA GAACTGGTCC CTAGGGTAGA CCGCGTGGGG AAGAGGGTAT TTGATTCCTC AAAGCTCACT GACCACCACG CAATCATCCC GCTGGATAGG CCACCCGAGA ACCTTCCGGC AATTCACAGG AAGGTGTACG ACCTCGTGTA CAGAAAGTTT GTTGGGGCCT TCATGGACGA TTACGTGTAT GAGTTGCAGA GGGTATTCAT AAGACTTGAC GGTGAGCTCT TCCTGGTTGA GGGAAAGAGG AACCTCCAGC TGGGCTGGAT GGAGCTTTAT CCCCACGAGG ATAACCCCCT GAAAATCCCG AGCGGGGAGG TGAGGAAGGA GTGGGTGAAG GCCGAGGAGA GGCAGACCAA ACCCCCCGCA AGGTTCACCG AGGCCTCCCT TCTAAGGGAA ATGGAGAGGC TAGGGCTGGG GACACCGGCG ACCAGGGCCG GAATAATAGA GACACTCCTC GAGAGGGGTT ACGTGGAGAG GAGGGGGAAA TCCCTTTACT CCACGGATAA GGGGAGAGAA CTCGTGGATA AGCTGGGGGA TAGCAAGGTT GTTAGCCCCG ACATGACGGC GGAGTGGGAG AGGCAACTTG AGGAGATCTA CGTGAAGAGA TTGGGGGAGA AGGGGTATCA AGAATTCATG GAGGGAATAA GGAGGTTCAC AAGGGAGGAG GTCGAGAGGC TCATGAAGAG AGAGTTTAAG GTGGAGAGAA GGGCAACCCC TGAAATGCTG AGGTTAGCTA GGGCCGTGTC CAGGGATCTC GGGGTTAAGC TTGAGGGAAC TGGGATGGAG GAGGTAAAGA GGTTTCTGGA TGAGAACCTT CCAAAGATGA GGATTACGTG TAAATGCGGT GGGGAAGTGG TAGGGTTTTC AAGAGGGTGG AAATGCAGGA AGTGCGGGAC AGTGGTGTGG AGGGAAATAG CTGGGAAGAA GATAACCTTC AGGCAGGCTA AGTCCTTGTT TCAAGGAAAG GAGTTGAAGA TGAAGGGGTT CAGATCTAGG ACGGGGAAGA GATTCAGTGC AACCGTATAC CTAGAGGACG GGAAGGTAAA GTTCAAATTT GAGTGA
|
Protein sequence | MKLVITEKPS VALDIARALG KPTRRQGYLE VGEYLVTWTY GHLLEIGEIA PKRWNLKDLP IFPEKFEYDL IKGKESQFKV VRKLLEGADA VINCGDAGRE GELIVRELLE FTGYRGKVLR LWTSEALTRD VVLRELRRLR PSSEFDSLYY SALARQNGDW IVGINLTRLV TLKAGGGEVW SVGRVQTPTL AMIVKRDREI EAFKPEIYYV VLAGFEGEMK GVMLRNGEEA RLAREEAEQV VNALKSVRSG RVEKVDVERR EERPPLLHSL TSLQREANTL YGLSAKRTLD VAQSLYEEWK LISYPRTDAR YLGEGNRDLV KDVLRKLGRG ELVPRVDRVG KRVFDSSKLT DHHAIIPLDR PPENLPAIHR KVYDLVYRKF VGAFMDDYVY ELQRVFIRLD GELFLVEGKR NLQLGWMELY PHEDNPLKIP SGEVRKEWVK AEERQTKPPA RFTEASLLRE MERLGLGTPA TRAGIIETLL ERGYVERRGK SLYSTDKGRE LVDKLGDSKV VSPDMTAEWE RQLEEIYVKR LGEKGYQEFM EGIRRFTREE VERLMKREFK VERRATPEML RLARAVSRDL GVKLEGTGME EVKRFLDENL PKMRITCKCG GEVVGFSRGW KCRKCGTVVW REIAGKKITF RQAKSLFQGK ELKMKGFRSR TGKRFSATVY LEDGKVKFKF E
|
| |