Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0800 |
Symbol | |
ID | 8410314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 772961 |
End bp | 775516 |
Gene Length | 2556 bp |
Protein Length | 851 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645019135 |
Product | Hef nuclease |
Protein accession | YP_003176638 |
Protein GI | 257386865 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1111] ERCC4-like helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.48491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCAGT CCGACGCCGA TCCGGGGAGC ACCGACGGCC ACGTCGATCA CCCGCTGGTG ACGCCGGGAC TACTCGAACA GCGGCGCTAC CAGCGCGAGC TGGCCGATAC GGCGCTGGCC GACCACACGC TGGTCTGTCT CCCGACCGGC CTGGGCAAGA CGACGGTCTC CTTGCTCGTG ACCGCCGAAC GGATACAGGA CGCGCAGTGG AAGTCCCTCC TGCTCGCGCC GACGAAGCCG CTCGTCCAGC AACACGCCGA GTTCTACCGG GAGGCCCTGC AGGTGCCCGA CGACGAGATC GTCGTGTTCA CCGGCGAGGT CCGGCCAGCC AAGCGGTCGG ACCTCTGGGA AGACGCCCGC GTCGTCATCG CGACGCCGCA GGTCGTCGAG AACGACCTCG TGGGCAACCG GATCTCCCTG GCGAACGTGA CCCACTGTAC CTTCGACGAG TGTCACCGCG CGACCGGCGA CTACGCCTAC AACTACATCG CCGAGCGCTA CCACGAGGAC GCATCGGATC CGCTCGTGAC GGCGATGTCG GCCTCGCCGG GCGGCGACGA GGAGGAGATC CTGACGGTGT GTGAGAACCT CGGCCTGCGT GAGGTGGCGG TGATGACCGA GGACGACGCC GACGTGGCCG AACACACCCA CGACACCGAA CTCGAGTGGA AGCGCATCGA GTTGCCCGAG ACCGTCATCG AGATACGGGA CGCGATCAAC GAGGTCGTCA GCGACCGGCT CGCACAGCTC AAAGAGCTGG GCGTCACCTC GACCACCCAG CCCGACGTGT CCGAGCGCGA GATCCAGAAG ATCCAGGGGA AGCTGTCCGA GCTGATGGAC AACGACCAGA GCGAGGGGTA CAGCGGGATG AGCCTGCTCG CCGAGGTCCG AAAGCTCCGC ACGGCCGTCA CCTACGCCGA GACCCAGAGC GTCGAGGCGC TGCGTCGGTA CTTCGAGCGC CAGAAGGAGG CCGCCCGCTC GTCGGGAGCC TCCAAGGCCG ACCAGCGACT CGTCTCGGAC CCGACGGTGA TGGAGGCCAT GCGGAAAGCC GAGAACTTCA CCGACCTCCA CCCGAAGTTC CGCCGGACCC GGATGTTGCT GGCCGAGACG CTGGGCATCG AGAACGGCGA GCGCGTCATC GTCTTCACCG AGTCCAGAGA CACCGCCGAG ACGCTGACCG ACTTCCTCTC CGATCACTTC GAGACCGAGA AGTTCGTCGG CCAGAGCGAC ACAGAGGGCA GCGAGGGGAT GACACAGACC CAGCAACAGG AGACACTCGA CCGGTTCCGA GCCGGAGAGT TCGAGGTGCT GGTCTCGACC AGCGTCGCCG AGGAGGGACT GGACGTGCCC GAAGTGGACC TCGTGTTGTT CTACGAGCCC GTCCCGACGG CGATCCGCTC GATCCAGCGC AAAGGACGGA CCGGCCGCCA GACCGAGGGT CGCGTCGTCG TCCTGCTGGC CGAGGACACC CGCGACGAGG CGTACTTCTG GAAGTCCCGA CAGGACGAGC AGCGCATGGA GGACGAACTC CGCACGCTGA AGGGCGTCGC CGGCCAGCTA GAGTCGAAAC TGGGGGGTGA GCAGACTGGC GTCGACGAGT ACGACGACGG CCAGACCGAC CGAGAGCGCG GCGAAGAACC CGGAGGAGAC GGTCTCGACG GGGACGAAGG TACCACTTCG AGTGGAAACG GCGGCGGTCC GTCCGACGCC GAGCGCCAGC CCTCGGAAGC GTCGAACACG GGCGAACGAT CCGAAGCCGA CGGCCAGACG GCCGACGGTG ACGGACAGGC CGGTCTCGAC GCCTTCGCCG ACGACGGCGC GACGGGAGCG GACGAAGATG AGACAGCCGG CGTTCCAGAC GAAACGAAGG GGTCGACGCC GGAGCCACAC GCCGACGGCG ACGAAACCGT CGCGATCGTC GCCGATCAGC GAGAACTCGA CTCGACGATC GCACGCGATC TCTCGACCCG CGAGGGCGTC CAGACGGAAC TGGAGACGCT GGCGGTCGGG GACTACGTGC TCTCGGATCG GGTCGTGGTC GAGCGCAAGA CCGTCAGCGA CTTCCTCGAT ACGCTGACCG GCGGCGACCG CTCGATGTTC GAGCAGGTCG GCGACGCCAC CCGTCACTAC GCCCGCCCCG TCGTCGTGAT CGAAGGCGGC GACCTCTACG GCGAACGCAA CGTCCACCAC AAGGCGATTC AGGGCGCACT GGCCTCACTC TCGGTCGACT TCGGCGCGAG CGTCCTCCAG ACGGCCGACG AAGAGGAGAC CGCGGACCTG CTGGAGACGA TCGCCCGGCG CGAACAGGAG GAGTCCGACC GCGAAGTCAG CGTCCACGGC GAGAAGCAGG CCAAGACACT CGGGGAACAG CAGGAGTACG TCGTCGCGTC GGTGGCGGAG GTCGGCCCCG TCACGGCGCG GGCGCTACTC GAACACTTCG GGAGCGTCGA GGCCGTTATG ACCGCCGACG AAGACGAGCT GATGGACGTC GACGGCGTCG GCGAGGTGAC GGCCGAGCGG TTTCGGGACG TGGTCGGCAG CGAGTTCGAG CAGTGA
|
Protein sequence | MAQSDADPGS TDGHVDHPLV TPGLLEQRRY QRELADTALA DHTLVCLPTG LGKTTVSLLV TAERIQDAQW KSLLLAPTKP LVQQHAEFYR EALQVPDDEI VVFTGEVRPA KRSDLWEDAR VVIATPQVVE NDLVGNRISL ANVTHCTFDE CHRATGDYAY NYIAERYHED ASDPLVTAMS ASPGGDEEEI LTVCENLGLR EVAVMTEDDA DVAEHTHDTE LEWKRIELPE TVIEIRDAIN EVVSDRLAQL KELGVTSTTQ PDVSEREIQK IQGKLSELMD NDQSEGYSGM SLLAEVRKLR TAVTYAETQS VEALRRYFER QKEAARSSGA SKADQRLVSD PTVMEAMRKA ENFTDLHPKF RRTRMLLAET LGIENGERVI VFTESRDTAE TLTDFLSDHF ETEKFVGQSD TEGSEGMTQT QQQETLDRFR AGEFEVLVST SVAEEGLDVP EVDLVLFYEP VPTAIRSIQR KGRTGRQTEG RVVVLLAEDT RDEAYFWKSR QDEQRMEDEL RTLKGVAGQL ESKLGGEQTG VDEYDDGQTD RERGEEPGGD GLDGDEGTTS SGNGGGPSDA ERQPSEASNT GERSEADGQT ADGDGQAGLD AFADDGATGA DEDETAGVPD ETKGSTPEPH ADGDETVAIV ADQRELDSTI ARDLSTREGV QTELETLAVG DYVLSDRVVV ERKTVSDFLD TLTGGDRSMF EQVGDATRHY ARPVVVIEGG DLYGERNVHH KAIQGALASL SVDFGASVLQ TADEEETADL LETIARREQE ESDREVSVHG EKQAKTLGEQ QEYVVASVAE VGPVTARALL EHFGSVEAVM TADEDELMDV DGVGEVTAER FRDVVGSEFE Q
|
| |