Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3052 |
Symbol | |
ID | 8412605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | - |
Start bp | 2937749 |
End bp | 2940856 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645021399 |
Product | hypothetical protein |
Protein accession | YP_003178864 |
Protein GI | 257389091 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.18746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGATC GCGGCCGCGT GCCGTTCGCA CTGATCGGCG TCCTCCTGCT CGTCAGCAGC GCGACGCTGG CGACGACGAT CGATCCGGGA TCGCTTCCGT CCGACAGCGA GACCGAGGTG GTCACTGAAC GGACGACTGC GACCGCACAG ACGGAACTGC GCGAGGCCGT GACGACCGCC AGCCGGGCGG CCGCAGCGGA CCCCGTCGTC GATCCGGCAG ACACCGCCGC CGGTCGCCTT CTCGACGAAG AGACGGCGTT TCGCGACGCG CTCCGGCTCC GGATCTACCT GCGGGCCCGC GACCGCCTCT CTCGGGTCGC GGTCCGTCGT GGCGAGGTCA CCGGCTCGGT CTCGCTGCCC TCGACGGAGA CGCGGGCCAA GCGCCGCGCC GCCATCGACC GCGTGACAAT CCAGCGGGCA GACGACGACG GCACCGCGAT CCGGGTGACC GTCGAGAACG TGACGGTCCG TACCCACCGT GGCGGGCAGG TGCGCTCCCG GACGACGATC TCTCCGACTG TCACCGTCGT GACGCCGGTG CTCGCGGCCC ACGATCGAGT CAGCACCTAC CAGCGGCGAC TCGACGCCGG CGTGACGGAG CGAGGATTGA GCCAGCGACT GACGACTCAG CTGTACGCGC TGGCGTGGTC GCGAGGCTAC CTCCAGTACG GCGAGGTGCC GATCAGCAAC GTCGTCTCGA ACCAGCACGT GGGCGTCGTC ACCAACGAGG CACTGCTGGA CCTCCAGCGC GAGACGATCG GCCACGCCGA TCCACGGGGC CGTCGCACGC TCGCGGTCGC GGCGGCACGG ACGGCGGCAC GGGATCTCAC CGTCGCCACA GGGACCGACT CGCGCGTGAC CGACGCCGTT CTGAGCGGAC CGACGAAGCC AGCGGCGAGC GACATCGAGG GACTGGAGCC ACCGCGCCGG TCGAGCCCCG ACGAGCGGCG CGAGGTCGCG GTCAACGAGA CGGCAGACAG AGCGTTCGTC GACGTGCTCG ACGACGGTGC CATCGACAGC ACGATACGGG ACGCCTACAG CGTCGAGGTC CGGACTGTCG GCCGCGTCGA GGGCGATCGC CACGTCGGCG TGTCGGCACC GCGCCCCGCC GGCTCGAACT GGACGCGCGT CGACAAGCGC CGTGAGCGAT CGATCCGCCA CCGAAACGTG TCCGTCGCGC CGCCGCCGAT CCCCGACGGC TGGCACGAGT TCGAGACCTA CGGGCGAGAG ACGGTCGTCA CGGAGCGAGC GGTGGGGGTC TGGGAGCGCC AGGTCGCGGG GCCAAACGGG AGCGTCGAGG TCCAGCGGAG GACCACCGAC AGGACGGGCA CGAGTCGACA GACCGTCACC CTCGCGGTCG TCGGTCGTCA CGACCGCACC TCGCCGGCTC CCGTTCGACC GATCCGACGG GCCCACCAGC GCGGTGCCGG CCCCCTGGAG GGACCGAACC TCGCCGACGC TCGCGAACGC GCCAGAGAGC GCCTGATCGA CAGCCAGGGC GGCCGGTCCG CGGCCCTCGA ATACGCGGTC CACAGCGGGC CGAACTCCGA CGTTCACACG ATCGAGCTAG AGGTACCCGC GAACGCGTCC GAGTGGGCGT ATCGCGATCT GATGGGGGTC CGAGAACGGG TCCGGTCCGT CGCCGTCGAG GTGCGGCAGG GACGAGTCGG GTCGTACGAA TCGAACCCGC CGGCAGCGCT CGCACGGGCC GTCGAGCGAG AGCGGACGCG GCTGATCGAC GCACCGTCCA GCTACGACGG CGCGGCGACC AAGGCGCGGA TCGCCGTCCG GGTCGCGTAC CTCGATCGCG TACAGGCCCG CCTCCGTGCG CGGGCAGACG ACCGGCGTGG CAGGGCCGAC GCGTTCGGCG ATCGGCTCGA CGAGGCCGGT ACGTCGATCG AGACGCTGCG CGAGGGGCTC GACGCTCGCG GGCGACCACC GTCGGACCGC CAGCCGCGGC TGGACGGTGT CGGCGGGCCG GTCGCGCTGA CGGCCGACGG CGCACCGGCC TACCTCACGC AGGCCTCGCT CAGCCACGAC GACGACCCGG CGATCGAGAA CAGCTCTCGT CCGCTGGTCG CGCGCAACGT CAACGTCTTT ACCGTACCCC ACCAGACCGT CTCGGACTCG CTGGTCGACG GTCTGTTCGG CGATCGATCG GGGGTCAGAC TCGACACGGC GGCCCGGACG TTGAACGCGA CCAACGCGAC CCTCGCCGAG GCGAACGCCG CTGCTGTGGA CGGCGAGGCG GTTCGACGGG CGGACGCGCG CCGACCCGAA CGCCACGCCG AGAACGTCTC GGCGCTGACC CGCGAACGCG ACGCCCTCCG CCGAGAAGTC GCGTCGGCGA ACGAGCACGT GATCGACGGC CAGCGGTCGG TGCTGTCCCG ACGGGCCGTC GCCGGCAGTG CCAGCGAGCG CGAGGCGATG CTTCGAGACG CGCTCGCGCC GTGGCAGACG ACCCACGATC GGGCCCTGGC GCTGGCGAAC GGCTCGGTCA GCCGTCGACT CGTCGCTCTC GCCGGCCGGC GGACCGACCT CTCGGTGGCT GCACGCGACC GGCTCGCCAT TCGACTGAAC GCCACGCGTC GCGCGGCGTT ACGGGAGCCG GGCGGGCGAC CGGACACCGA CGCCGTCGAC GCGAGCCGAT CGCGCACGCA GACGGTCGCG CGGGAGCTGG CTCGGGAGGC CGCTGCCGCC GGTGCGGAAC GGGCCACGAA GCGCGGCTAC GGCGCGGTGG TGAACGACAC GTTCGAGGCG ATGCCGTCCG GACTCCCGCT GGCACCCGTG CCCGGCTCGT GGTACGCCAC GACGAACGTC TGGCACGTCA CGGTCCGTGG CGAGTACGCC CGCTTCGGCG TGCGCGTCTC CCAGGGGCGG CCGACGACGC CCGGCGGCGA GTTCGTCTAC GCCAGGGACG GCGAGAACGT CAGCCTCGAC GTCGACGACG ACGGGTCTCC CGAGCGGATC GGCCGGTCAA CCCGCGTCGA CTTCGAGGCC ACGGCGACGG TCCTCGTCGT CGTGCCGCCG GGCAAGACCG GCGTCGGCGA CACCAACGGC GTCGCGATCG AGGAGTCGGA GGGGTGGCCC GATCCGGGGC CGGAGTGA
|
Protein sequence | MDDRGRVPFA LIGVLLLVSS ATLATTIDPG SLPSDSETEV VTERTTATAQ TELREAVTTA SRAAAADPVV DPADTAAGRL LDEETAFRDA LRLRIYLRAR DRLSRVAVRR GEVTGSVSLP STETRAKRRA AIDRVTIQRA DDDGTAIRVT VENVTVRTHR GGQVRSRTTI SPTVTVVTPV LAAHDRVSTY QRRLDAGVTE RGLSQRLTTQ LYALAWSRGY LQYGEVPISN VVSNQHVGVV TNEALLDLQR ETIGHADPRG RRTLAVAAAR TAARDLTVAT GTDSRVTDAV LSGPTKPAAS DIEGLEPPRR SSPDERREVA VNETADRAFV DVLDDGAIDS TIRDAYSVEV RTVGRVEGDR HVGVSAPRPA GSNWTRVDKR RERSIRHRNV SVAPPPIPDG WHEFETYGRE TVVTERAVGV WERQVAGPNG SVEVQRRTTD RTGTSRQTVT LAVVGRHDRT SPAPVRPIRR AHQRGAGPLE GPNLADARER ARERLIDSQG GRSAALEYAV HSGPNSDVHT IELEVPANAS EWAYRDLMGV RERVRSVAVE VRQGRVGSYE SNPPAALARA VERERTRLID APSSYDGAAT KARIAVRVAY LDRVQARLRA RADDRRGRAD AFGDRLDEAG TSIETLREGL DARGRPPSDR QPRLDGVGGP VALTADGAPA YLTQASLSHD DDPAIENSSR PLVARNVNVF TVPHQTVSDS LVDGLFGDRS GVRLDTAART LNATNATLAE ANAAAVDGEA VRRADARRPE RHAENVSALT RERDALRREV ASANEHVIDG QRSVLSRRAV AGSASEREAM LRDALAPWQT THDRALALAN GSVSRRLVAL AGRRTDLSVA ARDRLAIRLN ATRRAALREP GGRPDTDAVD ASRSRTQTVA RELAREAAAA GAERATKRGY GAVVNDTFEA MPSGLPLAPV PGSWYATTNV WHVTVRGEYA RFGVRVSQGR PTTPGGEFVY ARDGENVSLD VDDDGSPERI GRSTRVDFEA TATVLVVVPP GKTGVGDTNG VAIEESEGWP DPGPE
|
| |