Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0102 |
Symbol | |
ID | 8382364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 99534 |
End bp | 100862 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 644971161 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_003129024 |
Protein GI | 257051191 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGAGG AGGCGACGCT GGACGAGTTC GTAGATGAGC AGGAAGCAGG AGGAAATCAT TCTGGAGACG TTAGTGTTGG GGATTTACAG CAATTCGAAT CTTCCCCGAT TGAGTCATGG AATCTTGTTA GGCTGGGTGA GATTCTAACC TTAGAGTACG GTGATAATCT TCCATCAGAT AGTCGAGAAA GTGGAACCGT ACCTGTTTTT GGCTCTAATG GCCAGGTAGA CACGCATTCT GAGGCCGCTG TAGAGAAACC AGGCATCATA TTGGGGCGAA AGGGTTCAAT TGGTGAGATT GATTTCAGCG ATAGACCGTT TTGGCCTATC GATACGACAT ACTATATCAC AAGCGAGGAG ACGAGCCAAA ACCTGCGTTT CCTGTATTAC CTCCTTCAGA ACATCCAACT GGAACGGTTA AACGCTGCAT CTGCCATACC TGGATTAAAC CGAAATGATG CGTACGGCCT GAAAGCACTC ATGCCTCCGG CCGAAGAACA GCGCAAAATC GCCAGCGTGC TCTATACCGT CGATCAGGCG ATTCAGAAGA GCGAAGAGAT AATCGAGCAA ACTGAACGGG TCCGTCGTGG TACTGAACAA GATGTCCTTT CGAGGGGCGT TCGTGAAGAT GGGACGCTCA GGCCCGACGA CGATGTCGCA TATCGAAGCA GTTGGGTCGG CGACATTCCC TGTGACTGGG ATGTCAAACA GTACAGCAAA CTGATTTCAG ATTCCTCCGT CGGTATCGTC GTTAAGCCTT CCCAGTATTA CGACGACGAC GGAACAGTCC CGATTCTTCG CTCGAAAGAT ATCTCCAGAG ATGGCATCGT TGATGGGGAT TTCGAGTATA TGTCGGAAGA GTCGAACGCC GAAAATGAAA ACAGCCGATT GCAGGAAGGT GACGTAATAA CGGTGAGGTC GGGGGACCCC GGCCTTTCTT GCGTCGTCGA CGGTGAATTT GATGGGGCAA ACTGTGCAGA TTTACTCATT TCCACGCCGG GACCGAAATT GGACCCCCAC TACGCCGCTA TGTGGATTAA TTCCTTTGCA GGGAGAAAGC AGATCGACCG GTTTCAGGCT GGTCTGGCAC AGAAGCACTT CAACCTCGGG GCCCTCCGTA AGCTTCGAGT CGGAGTGCCA TCGCTCGATG AACAGAAGCG GATCGTCGAA AAGGTGTCAT CAATATCAGA ATCTCTCGAA AGTCAGAGAG AGTCCAAAAG GCAACTCCAG CGCCTCAAAC AGGGCCTCAT GCAAGACCTC CTCTCGGGCA AGGTCCGCAC CCACGACACA GACATCGAGA TCGTAGACGA CGCACTCCAG CATGGCTAA
|
Protein sequence | MSEEATLDEF VDEQEAGGNH SGDVSVGDLQ QFESSPIESW NLVRLGEILT LEYGDNLPSD SRESGTVPVF GSNGQVDTHS EAAVEKPGII LGRKGSIGEI DFSDRPFWPI DTTYYITSEE TSQNLRFLYY LLQNIQLERL NAASAIPGLN RNDAYGLKAL MPPAEEQRKI ASVLYTVDQA IQKSEEIIEQ TERVRRGTEQ DVLSRGVRED GTLRPDDDVA YRSSWVGDIP CDWDVKQYSK LISDSSVGIV VKPSQYYDDD GTVPILRSKD ISRDGIVDGD FEYMSEESNA ENENSRLQEG DVITVRSGDP GLSCVVDGEF DGANCADLLI STPGPKLDPH YAAMWINSFA GRKQIDRFQA GLAQKHFNLG ALRKLRVGVP SLDEQKRIVE KVSSISESLE SQRESKRQLQ RLKQGLMQDL LSGKVRTHDT DIEIVDDALQ HG
|
| |