Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3295 |
Symbol | |
ID | 8409373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | + |
Start bp | 95352 |
End bp | 96392 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645018228 |
Product | restriction endonuclease |
Protein accession | YP_003175749 |
Protein GI | 257372975 |
COG category | [V] Defense mechanisms |
COG ID | [COG1715] Restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.978849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.622246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATT CGGTTGCGAG AGCTGTCGAG ACCCTGACGG CCGAGTACGC GGCCCACTCT GCCGTCGAGC CCGCCTTCGA CCTCGCGGAG TTTTTCGAAC CCGCGCTCGA CGACACCGAG CCACGCGAGA CCGTCGGACA GCACCGCAAA CGACGCTTTC GCTCGACGAT CGAACGCGGA CAGACGTTGT CCGAGCAGGA GGCCGTCTCA CTGCCCACCG AGGCGATCAG CGAACGACTC GTCGACGTGG CCAACGACCA GTCGATCGAA GCGCGCCGCC GCGCCGCGCT TCTCGACTAC TGTGACGCCC TCTTCGCGTT CGGCGCGCTG GCGTTCGAAC TCCAGGAGCG CTATCCCAGT GCGCCGGTCG AGACGGCCGT CGATCGACTC CGCGATGCCG GGGACGCCGC CGTCGCGGAC GATCTGAGCG ACGGTCTCTC GTCGATCCGG TCGGCCGCCC AGGAGCTGTA CCGCACCGCC GTGGTCGTCC GTCGCGCCGA TCAGCTGTTC GCCGCGCTGT CGGCCGTCGG CGAACCACAG CTCGGACGCG CCGAGGCCAC CCTCGGCGGG ACACTGGACG AGGCCATCGC CGAGCGCGAC ACGGACCGCA TCGACACCGT CGCCGAGCGG CTGAACGCCG CCACCGACGG CGAGTGGACC ACCGACGATC TGCTGTCCTG TTCACACCGC GAGTTCGAGG TCCTGATCGC GGACCTCTGG CGAGAGGGCG GGTTCGACGC CCGGACCACG AAGTACGTCC AGGACTACAA CATCGACGTC ATCGCCCAGG CCGATGGCAC GCGCGAACTG ATCCAGGCCA AACAGTACGA GCCGGGCAAC ACGGTCGGGG TGCGAACGGT CCAGCGGACC GCCGGACTGC TCGTCGAGTT CGACGCCGAC TCCGTCGCCG TCGTCACCAG TTCGAGCTTC ACCGAGAACG CCAGAGAGAG CGTCGAGCGG ATGAGCGAAC AGGTCCGCCT CGTCGACGGT GAGCGGCTCT GTGAGCTGCT GACCCGCTCC CAGCTCGTCC CGTCGCTGTA G
|
Protein sequence | MFDSVARAVE TLTAEYAAHS AVEPAFDLAE FFEPALDDTE PRETVGQHRK RRFRSTIERG QTLSEQEAVS LPTEAISERL VDVANDQSIE ARRRAALLDY CDALFAFGAL AFELQERYPS APVETAVDRL RDAGDAAVAD DLSDGLSSIR SAAQELYRTA VVVRRADQLF AALSAVGEPQ LGRAEATLGG TLDEAIAERD TDRIDTVAER LNAATDGEWT TDDLLSCSHR EFEVLIADLW REGGFDARTT KYVQDYNIDV IAQADGTREL IQAKQYEPGN TVGVRTVQRT AGLLVEFDAD SVAVVTSSSF TENARESVER MSEQVRLVDG ERLCELLTRS QLVPSL
|
| |