Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0101 |
Symbol | |
ID | 8382363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 96581 |
End bp | 99541 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644971160 |
Product | type I site-specific deoxyribonuclease, HsdR family |
Protein accession | YP_003129023 |
Protein GI | 257051190 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAACG AACAACCATC TGAGGGCGGC CTTCAGACAT CGGTCCTCCA GTGGCTCGAC GGCCTCGGGT GGGAGACGTA TGACCCAGAC GAAGGCCACG GCGCGACGGT TCTCGACGAG CGATACGGCC GCCAACGCTC GGAGGTCATC TACTGGGACC TGCTCGCCGA GGCCGTCGTC GAGATCAACG ACGAACTGAC CGAGGCCAAC GTCGACCGCT TTCTCAACTC GCTCCGGCGC GACCTCGATC ACGACAACCT GCTCGACGGC AACGAGGCGT TCCACGAGAT CCTGACCACG GGCAAGAAAC ACACCGTCGA CCAGCAACAC AACGGCACGA AGACGATCTA CGCCGACTTG ATCGACTTCG AGCACCCCGA GAACAACCGG CTCCACGCTG TCGACGAGTT CGCCGTCTCC CGACGCGGCT CGATTCGCCC CGACGTGACC CTGCTCATCA ACGGGATCCC CATCGTCCAG ATGGAACTGA AATCCGTCAC CCAGGACAAC GACTTCTACG ACGCGATCAC CGACCTCCAG GCCTACGAGG AGAAGGTCCC ACGGGCATTC ATCCCGACGC TGTTCAACGT CGCCGCCGAC CAGAGTGTCT TCCAGTACGG AGCCGTCCGA GCCCCCCGCG AGTTCTACCA AGGGTGGACG ACCGCACCCG AGGCCTACCA GTCTGACAAC GACGTCAAGC AGGCCGTCCA GGCTTTGCTG AACCCTCAGA CGCTACTGGA CATCCTGAAG TACTTCGTCT TTTTCGAGGA GCAACCGGAC CAAGACGCGA AGATCATCCC CCGCCATATG CAGTACTACG CGGTGAAGCG GATCCTCGAC CGTGTCGAGC GCGGCGATCA CCGCAAGGGA CTGATCTGGC ACACCCAGGG CTCGGGGAAG TCGTTCACGA TGCTGTTCAC GGCGAAGAAC CTCCTCGAAC GCGACATTCT CGACGCCCCG CAACTGTTTG TCGTCGTCGA CACGGACAAA CTGAACAGCC AGATGCGCGA CCAGCTCGCC AACCTCTCCT TCGAGCGCTG GACCGAGGCC GAGAGTATCG AGGGACTCGA AGACACCATC GCGGCGGGCC GGAGCGAACT CGTCGTGACG ACCATCCAGA AGTTCCAAGA CGTCGATCCC GGCGTCCAGT CGACCGACGA GGCCGTCGTG ATGTCCGACG AAGCCCACCG GTTCTTGGAG GCCGATCTCG GGAGCAGACT CGAAGCCGCC CTTCCCGATG CGTACCACTT CGGCTTTACC GGGACGCCCG TCCGCGAAGG CGACCGCGAG AAGGACCGCA ACACGTTCGA CGAGTTTTCT CCCGAGGGCG AGGAGTACCT CCACCGCTAC TCGATCAAGG ACGGCATCGA CGACGAGCTG ATCCTCCCGG TTTTCTTCCG ACTCCGCCAC GAGATGGACT GGGACGTCGA CGAGGCCGGC CTCGACGAGG AGTTCGACGA GGCGTTCGCC ACCCTCCCCA AAGAGGAGAA GTTGGCGATC ATCCGCGATC ACGTCACCAG TCGGATGCTC GCCGAGATCG AGCCCCGCGT CGAACGTGTG GTCGCCGAGA TCGACGATCA CTTCGATGGC GTCGAGAAGA ACGGCTGGAA AGGCATGGTC GTCACCCCGA GTCGGGAGTC GGCGGCCATG TACGGGGAAC GCTTGGTCGA CCGGCGAGGC GAGGACGCCG TTGACGTTCT CTTTACCACG ACGCAAGACG ATCCGGAGTT ACTCCAGCAG TTCCATACCG ATCCCGGCGA GCGTGATCAG ATCGTCCGGG ACTTCAAAAA CGAAGACGAT CCCAAACTCC TCGTGGTCCA CAATATGCTA CTGACGGGCT TTGACGCCCC TGTACTGAAG ACGATGTACT TGGACCGGGA ACTCCATGAT CACACTCTCA TGCAGGCGAT CGCCCGCACG AACCGGCCCG CTGATGGCAA GGAGAATGGC GAAATCGTCG ACTTCCAGGG TGTCTTCGAG AATATCGACG ACGCCCTCGA CTACGACGAC GAGACGAAAC AGTACGCGGC CCAAGACAGC GAGCAACTGT TCGAGAAACT CCAGAACCAA CTCGACGCAG TTCTCGATAT CTTCGAGGGA ATCCCCAGGG AGGACAGCCA GGAGGTCGTC GACGAGTGCC TTGACCGAGT GAGTACCCAT CCTGAGAAGC GAGAGTTCAA GCAAGGGTTC CGACGACTCC AGGACCTCTA TGAATCGGTC TCGCCAGATC GCCGACTTGT CGAAGAAGGG ATCGATGAGG ATTACGGGTG GCTCGGGCGG ATCCACACGG CTTTCCAGCG GACGGCCAAC CGCTCGGAAC GACCTGAAGA CGAGATGCGC GAGAAGACGC GTGAAATCGT CGAGGAGCAC GTCGACATCG GCGAGATCAA GCGGGATTAT CCCGTCTACG AACTCGGCGC GGAGTACTTG GAGGACCTGG ACCACCTCAG GAGTGACGCC GCAAAGGCGT CGACGATCGC TCACGCCATC CAGGAGAGCA CGCAATCTCG AATGGGACAG AACCCCCGCT ACGAACGACT GAGCGAGCGC GTGACCGACA TCGTCGAGGA GTGGCAGGCT GGCGACAGAG CCGATCCCGA GGCCGTCGAG GCGCTCCGGG AGGTCGAAGC AGAGGTGCTT GCCATCGAGG AGGAGGCTAA CGAACGCGGG ATGTCCGATG CCGAGTTCGC TATCTTCACA GACATCACTG AGGAGCGAGA TCTCGATCTC TCTGAGGACA CCGTCGAAGC ACTTGCCCGC GACATCGTAG CCGAGTTCGA CGACCGCGTC GACACGAGTT ACGAAGGATG GGAGACGAAC GACCAGACGG TCAAGGAGAT CGAACTTGTA CTGTTGGATG TACTGGTGAA AGAACACGAC CGAGGCGAAC TGGTCACCGA CGAGTTCATC GACGCCGTCC ATACCTACCT GATTCAAAAC TATGTCGCAG ACGACGAGTA A
|
Protein sequence | MANEQPSEGG LQTSVLQWLD GLGWETYDPD EGHGATVLDE RYGRQRSEVI YWDLLAEAVV EINDELTEAN VDRFLNSLRR DLDHDNLLDG NEAFHEILTT GKKHTVDQQH NGTKTIYADL IDFEHPENNR LHAVDEFAVS RRGSIRPDVT LLINGIPIVQ MELKSVTQDN DFYDAITDLQ AYEEKVPRAF IPTLFNVAAD QSVFQYGAVR APREFYQGWT TAPEAYQSDN DVKQAVQALL NPQTLLDILK YFVFFEEQPD QDAKIIPRHM QYYAVKRILD RVERGDHRKG LIWHTQGSGK SFTMLFTAKN LLERDILDAP QLFVVVDTDK LNSQMRDQLA NLSFERWTEA ESIEGLEDTI AAGRSELVVT TIQKFQDVDP GVQSTDEAVV MSDEAHRFLE ADLGSRLEAA LPDAYHFGFT GTPVREGDRE KDRNTFDEFS PEGEEYLHRY SIKDGIDDEL ILPVFFRLRH EMDWDVDEAG LDEEFDEAFA TLPKEEKLAI IRDHVTSRML AEIEPRVERV VAEIDDHFDG VEKNGWKGMV VTPSRESAAM YGERLVDRRG EDAVDVLFTT TQDDPELLQQ FHTDPGERDQ IVRDFKNEDD PKLLVVHNML LTGFDAPVLK TMYLDRELHD HTLMQAIART NRPADGKENG EIVDFQGVFE NIDDALDYDD ETKQYAAQDS EQLFEKLQNQ LDAVLDIFEG IPREDSQEVV DECLDRVSTH PEKREFKQGF RRLQDLYESV SPDRRLVEEG IDEDYGWLGR IHTAFQRTAN RSERPEDEMR EKTREIVEEH VDIGEIKRDY PVYELGAEYL EDLDHLRSDA AKASTIAHAI QESTQSRMGQ NPRYERLSER VTDIVEEWQA GDRADPEAVE ALREVEAEVL AIEEEANERG MSDAEFAIFT DITEERDLDL SEDTVEALAR DIVAEFDDRV DTSYEGWETN DQTVKEIELV LLDVLVKEHD RGELVTDEFI DAVHTYLIQN YVADDE
|
| |