Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2731 |
Symbol | |
ID | 8385036 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2799069 |
End bp | 2801603 |
Gene Length | 2535 bp |
Protein Length | 844 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644973805 |
Product | DNA topoisomerase I |
Protein accession | YP_003131625 |
Protein GI | 257053792 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.186723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCTGA TCGTCACCGA GAAGGACAAC GCCGCCAGGC GGATCGCCGA GATCCTCAGC GACGACTCGG CGAGTGGTGA GCGGCGAAAT GGCGTCAACG TCTACAAGTG GGGCGGCCAG CGCGTCGTCG GCCTCTCCGG ACACGTCGTC GGCGTCGACT TCCCGCCGGA ATACGCCGAG TGGCGCGACG TCGAACCGGC CGAGTTGATC GACGCCGATG TCGTTACAAC GCCCACGCGG GAGAACATCG TCGCGACACT CCGTAGCCTC GCCAGAAAGG CCGACGAGGT CGTGATCGCG ACCGACTACG ACCGCGAGGG CGAACTCATC GGGAAGGAAG CCTACGAGTT GATCCGCGAG GAAACCGACG CGCCGGTAGA GCGCGTTCGC TTCTCCTCGA TTACCGAACG AGAAGTCCAG AAGGCCTTCG AGAATCCCGA CGAGATCGAC TTCGACCTCG CCGCCGCAGG CGAAGCCCGC CAGATCGTCG ACCTGATCTG GGGCGCGGCA CTGACCCGCT TCCTCTCGCT GTCTGCCAAA CAGCTCGGCA ACGACTTCAT CTCTGTCGGC CGTGTCCAGA GCCCGACACT GAAGCTGATC GTCGACCGCG AACGCGAGAT CGAGGCCTTC GATCCCGACG ACTACTGGGA GATCCTCGCC GACCTCGCGA AGGACGGCCA GGGCTTCGAG GCCCAGTACT TCTACGACGA CGACGGCAGC GAAGCCGAGC GGGTCTGGGA CGAAGCGGCC GCCGAAGCTG CCTACGCGGA TCTCTCCGAA TCCGGGACGG CCACCGTCAC CGAAGTTCGG CGGCGAACGC GTACCGACGA CCCGCCCGCG CCGTTCAACA CGACCGCGTT CATCTCCGCG GCGAGTTCGC TCGGCTACTC CGCCCAGCGC GCCATGTCGC TGGCCGAGGA ACTCTACACG GACGGCTACC TGACCTACCC ACGGACGGAC AACACCGTCT ATCCCGACGA TCTCGATCCC GAAGAGCTGT TGGGTGCCTT CGAAGGCAGT TACGAGTTCG GCGACGACGC GGCGTCGTTG CTCGATCAGG ACGAAATCGA GCCGACCCGT GGCGAGACCG AATCGACCGA CCACCCGCCG ATCCATCCGA CCGGCGAACT GCCGGATCGC GACGAGGTCG GCGCGGACGC CTGGGAGATC TACGAGCTCG TCGTCCGGCG GTTCTTCGCG ACCGTGGCCG AGCCGGCGAC CTGGGCGCAC CTGCGGGTCG TCGCCGAGGC GGCGGGCCGG TCGCTGAAGG CCAACGGCAA GCGCCTGCTC GAGGAGGGGT ACCACGCGGT CTATCCCTAC TCCTCCGCCG GCGAAACGTT CGTTCCCGAG GTCAGCGAGG GCGACAAGCT CTCGATCGAC GCGGTCGAAC TGGAGGCCAA ACAGACCCAG CCGCCGCGCC GCTACGGCCA GTCGCGGCTC ATCCAGCATA TGGAGGAGAT GGGCATCGGG ACGAAAGCGA CCCGGCACAA CATCGTCGAG AAACTCTACG ATCGCGGGTA CGTCGAGGAC GATCCACCGC GGCCGACCAA ACTCGCCGAG GCAGTGGTTT CCGCCGCCGA GGAGTACGCC GATCACGTCG TCAGCGAGGA GATGACCGCC CAGCTGGAGG CCGACATGAC CGCCATCGCC GAGGGGGAGG CGACCTTGGA AGAGGTGACC GCGGAATCCC GGGAGATGCT CGGCGAGATC TTCGAGGAAC TACGCGAGTC CCGCACGGAG ATCGGCGAGT TCCTCCAGGA GTCGCTGAAG GCCGACCGAA CGCTCGGGCC CTGTCCGGAG TGTGGCGAGG ATCTGCTGGT TCGCCGGAGT CGCCAGGGAT CGTACTTCGT CGGGTGTGAC GGCTTCCCAG AGTGTCGGTT CACCCTTCCA CTGCCGAGCA CGGGCGAACC ACAGGTGACC GACGAAACTT GCGAGGATCA CGATCTTCGG CACGTCAAGA TGCTCGCGGG CCGGGACACG TTCGTCCACG GCTGTCCGCG GTGCAAAGCG GAGGCAGCCG ACGAGAGCGA GGACGAGGTG ATCGGGCACT GTCCCGAATG TGGAGATAGC GCTGCGCCGG AGGGACAGCG AAACGAAGGC GGCGAAGCCG CCGAAGGGGA GGGCGGCAAG CTCGCGATCA AACACCTTCG CTCGGGTTCG CGACTAGTCG GCTGTACGCG CTATCCGGAC TGTGATTACT CGCTCCCGCT CCCCCGTCGT GGTGAGATCG TCGTGACCGA CGAGCGTTGC GAGGACCACG ACCTGCCCCA CATCGAAATC CACGACGGCG ATGACGACGA TCCCTGGGAA CTGGGCTGTC CGATCTGCAA CTACGAGGAG TTCCAGGCGC GCAACGCCGT CGAAAACTTG GAGGACCTCG ACGGGGTCGG CCCGGCGACG GCGGAGAAAC TACAGGACGC TGGCGTAGCG GACCCCGAGG ACCTCCACGA GGTGGACCCG GACGCCGTCG CCGGCGAGGT CCAGGGCGTC AGTGCCGATC GCATCCGGGA ATGGCAAGAC GAGTTGGCCG CCTGA
|
Protein sequence | MRLIVTEKDN AARRIAEILS DDSASGERRN GVNVYKWGGQ RVVGLSGHVV GVDFPPEYAE WRDVEPAELI DADVVTTPTR ENIVATLRSL ARKADEVVIA TDYDREGELI GKEAYELIRE ETDAPVERVR FSSITEREVQ KAFENPDEID FDLAAAGEAR QIVDLIWGAA LTRFLSLSAK QLGNDFISVG RVQSPTLKLI VDREREIEAF DPDDYWEILA DLAKDGQGFE AQYFYDDDGS EAERVWDEAA AEAAYADLSE SGTATVTEVR RRTRTDDPPA PFNTTAFISA ASSLGYSAQR AMSLAEELYT DGYLTYPRTD NTVYPDDLDP EELLGAFEGS YEFGDDAASL LDQDEIEPTR GETESTDHPP IHPTGELPDR DEVGADAWEI YELVVRRFFA TVAEPATWAH LRVVAEAAGR SLKANGKRLL EEGYHAVYPY SSAGETFVPE VSEGDKLSID AVELEAKQTQ PPRRYGQSRL IQHMEEMGIG TKATRHNIVE KLYDRGYVED DPPRPTKLAE AVVSAAEEYA DHVVSEEMTA QLEADMTAIA EGEATLEEVT AESREMLGEI FEELRESRTE IGEFLQESLK ADRTLGPCPE CGEDLLVRRS RQGSYFVGCD GFPECRFTLP LPSTGEPQVT DETCEDHDLR HVKMLAGRDT FVHGCPRCKA EAADESEDEV IGHCPECGDS AAPEGQRNEG GEAAEGEGGK LAIKHLRSGS RLVGCTRYPD CDYSLPLPRR GEIVVTDERC EDHDLPHIEI HDGDDDDPWE LGCPICNYEE FQARNAVENL EDLDGVGPAT AEKLQDAGVA DPEDLHEVDP DAVAGEVQGV SADRIREWQD ELAA
|
| |