Gene Huta_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0101 
Symbol 
ID8382363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp96581 
End bp99541 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content61% 
IMG OID644971160 
Producttype I site-specific deoxyribonuclease, HsdR family 
Protein accessionYP_003129023 
Protein GI257051190 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAACG AACAACCATC TGAGGGCGGC CTTCAGACAT CGGTCCTCCA GTGGCTCGAC 
GGCCTCGGGT GGGAGACGTA TGACCCAGAC GAAGGCCACG GCGCGACGGT TCTCGACGAG
CGATACGGCC GCCAACGCTC GGAGGTCATC TACTGGGACC TGCTCGCCGA GGCCGTCGTC
GAGATCAACG ACGAACTGAC CGAGGCCAAC GTCGACCGCT TTCTCAACTC GCTCCGGCGC
GACCTCGATC ACGACAACCT GCTCGACGGC AACGAGGCGT TCCACGAGAT CCTGACCACG
GGCAAGAAAC ACACCGTCGA CCAGCAACAC AACGGCACGA AGACGATCTA CGCCGACTTG
ATCGACTTCG AGCACCCCGA GAACAACCGG CTCCACGCTG TCGACGAGTT CGCCGTCTCC
CGACGCGGCT CGATTCGCCC CGACGTGACC CTGCTCATCA ACGGGATCCC CATCGTCCAG
ATGGAACTGA AATCCGTCAC CCAGGACAAC GACTTCTACG ACGCGATCAC CGACCTCCAG
GCCTACGAGG AGAAGGTCCC ACGGGCATTC ATCCCGACGC TGTTCAACGT CGCCGCCGAC
CAGAGTGTCT TCCAGTACGG AGCCGTCCGA GCCCCCCGCG AGTTCTACCA AGGGTGGACG
ACCGCACCCG AGGCCTACCA GTCTGACAAC GACGTCAAGC AGGCCGTCCA GGCTTTGCTG
AACCCTCAGA CGCTACTGGA CATCCTGAAG TACTTCGTCT TTTTCGAGGA GCAACCGGAC
CAAGACGCGA AGATCATCCC CCGCCATATG CAGTACTACG CGGTGAAGCG GATCCTCGAC
CGTGTCGAGC GCGGCGATCA CCGCAAGGGA CTGATCTGGC ACACCCAGGG CTCGGGGAAG
TCGTTCACGA TGCTGTTCAC GGCGAAGAAC CTCCTCGAAC GCGACATTCT CGACGCCCCG
CAACTGTTTG TCGTCGTCGA CACGGACAAA CTGAACAGCC AGATGCGCGA CCAGCTCGCC
AACCTCTCCT TCGAGCGCTG GACCGAGGCC GAGAGTATCG AGGGACTCGA AGACACCATC
GCGGCGGGCC GGAGCGAACT CGTCGTGACG ACCATCCAGA AGTTCCAAGA CGTCGATCCC
GGCGTCCAGT CGACCGACGA GGCCGTCGTG ATGTCCGACG AAGCCCACCG GTTCTTGGAG
GCCGATCTCG GGAGCAGACT CGAAGCCGCC CTTCCCGATG CGTACCACTT CGGCTTTACC
GGGACGCCCG TCCGCGAAGG CGACCGCGAG AAGGACCGCA ACACGTTCGA CGAGTTTTCT
CCCGAGGGCG AGGAGTACCT CCACCGCTAC TCGATCAAGG ACGGCATCGA CGACGAGCTG
ATCCTCCCGG TTTTCTTCCG ACTCCGCCAC GAGATGGACT GGGACGTCGA CGAGGCCGGC
CTCGACGAGG AGTTCGACGA GGCGTTCGCC ACCCTCCCCA AAGAGGAGAA GTTGGCGATC
ATCCGCGATC ACGTCACCAG TCGGATGCTC GCCGAGATCG AGCCCCGCGT CGAACGTGTG
GTCGCCGAGA TCGACGATCA CTTCGATGGC GTCGAGAAGA ACGGCTGGAA AGGCATGGTC
GTCACCCCGA GTCGGGAGTC GGCGGCCATG TACGGGGAAC GCTTGGTCGA CCGGCGAGGC
GAGGACGCCG TTGACGTTCT CTTTACCACG ACGCAAGACG ATCCGGAGTT ACTCCAGCAG
TTCCATACCG ATCCCGGCGA GCGTGATCAG ATCGTCCGGG ACTTCAAAAA CGAAGACGAT
CCCAAACTCC TCGTGGTCCA CAATATGCTA CTGACGGGCT TTGACGCCCC TGTACTGAAG
ACGATGTACT TGGACCGGGA ACTCCATGAT CACACTCTCA TGCAGGCGAT CGCCCGCACG
AACCGGCCCG CTGATGGCAA GGAGAATGGC GAAATCGTCG ACTTCCAGGG TGTCTTCGAG
AATATCGACG ACGCCCTCGA CTACGACGAC GAGACGAAAC AGTACGCGGC CCAAGACAGC
GAGCAACTGT TCGAGAAACT CCAGAACCAA CTCGACGCAG TTCTCGATAT CTTCGAGGGA
ATCCCCAGGG AGGACAGCCA GGAGGTCGTC GACGAGTGCC TTGACCGAGT GAGTACCCAT
CCTGAGAAGC GAGAGTTCAA GCAAGGGTTC CGACGACTCC AGGACCTCTA TGAATCGGTC
TCGCCAGATC GCCGACTTGT CGAAGAAGGG ATCGATGAGG ATTACGGGTG GCTCGGGCGG
ATCCACACGG CTTTCCAGCG GACGGCCAAC CGCTCGGAAC GACCTGAAGA CGAGATGCGC
GAGAAGACGC GTGAAATCGT CGAGGAGCAC GTCGACATCG GCGAGATCAA GCGGGATTAT
CCCGTCTACG AACTCGGCGC GGAGTACTTG GAGGACCTGG ACCACCTCAG GAGTGACGCC
GCAAAGGCGT CGACGATCGC TCACGCCATC CAGGAGAGCA CGCAATCTCG AATGGGACAG
AACCCCCGCT ACGAACGACT GAGCGAGCGC GTGACCGACA TCGTCGAGGA GTGGCAGGCT
GGCGACAGAG CCGATCCCGA GGCCGTCGAG GCGCTCCGGG AGGTCGAAGC AGAGGTGCTT
GCCATCGAGG AGGAGGCTAA CGAACGCGGG ATGTCCGATG CCGAGTTCGC TATCTTCACA
GACATCACTG AGGAGCGAGA TCTCGATCTC TCTGAGGACA CCGTCGAAGC ACTTGCCCGC
GACATCGTAG CCGAGTTCGA CGACCGCGTC GACACGAGTT ACGAAGGATG GGAGACGAAC
GACCAGACGG TCAAGGAGAT CGAACTTGTA CTGTTGGATG TACTGGTGAA AGAACACGAC
CGAGGCGAAC TGGTCACCGA CGAGTTCATC GACGCCGTCC ATACCTACCT GATTCAAAAC
TATGTCGCAG ACGACGAGTA A
 
Protein sequence
MANEQPSEGG LQTSVLQWLD GLGWETYDPD EGHGATVLDE RYGRQRSEVI YWDLLAEAVV 
EINDELTEAN VDRFLNSLRR DLDHDNLLDG NEAFHEILTT GKKHTVDQQH NGTKTIYADL
IDFEHPENNR LHAVDEFAVS RRGSIRPDVT LLINGIPIVQ MELKSVTQDN DFYDAITDLQ
AYEEKVPRAF IPTLFNVAAD QSVFQYGAVR APREFYQGWT TAPEAYQSDN DVKQAVQALL
NPQTLLDILK YFVFFEEQPD QDAKIIPRHM QYYAVKRILD RVERGDHRKG LIWHTQGSGK
SFTMLFTAKN LLERDILDAP QLFVVVDTDK LNSQMRDQLA NLSFERWTEA ESIEGLEDTI
AAGRSELVVT TIQKFQDVDP GVQSTDEAVV MSDEAHRFLE ADLGSRLEAA LPDAYHFGFT
GTPVREGDRE KDRNTFDEFS PEGEEYLHRY SIKDGIDDEL ILPVFFRLRH EMDWDVDEAG
LDEEFDEAFA TLPKEEKLAI IRDHVTSRML AEIEPRVERV VAEIDDHFDG VEKNGWKGMV
VTPSRESAAM YGERLVDRRG EDAVDVLFTT TQDDPELLQQ FHTDPGERDQ IVRDFKNEDD
PKLLVVHNML LTGFDAPVLK TMYLDRELHD HTLMQAIART NRPADGKENG EIVDFQGVFE
NIDDALDYDD ETKQYAAQDS EQLFEKLQNQ LDAVLDIFEG IPREDSQEVV DECLDRVSTH
PEKREFKQGF RRLQDLYESV SPDRRLVEEG IDEDYGWLGR IHTAFQRTAN RSERPEDEMR
EKTREIVEEH VDIGEIKRDY PVYELGAEYL EDLDHLRSDA AKASTIAHAI QESTQSRMGQ
NPRYERLSER VTDIVEEWQA GDRADPEAVE ALREVEAEVL AIEEEANERG MSDAEFAIFT
DITEERDLDL SEDTVEALAR DIVAEFDDRV DTSYEGWETN DQTVKEIELV LLDVLVKEHD
RGELVTDEFI DAVHTYLIQN YVADDE