Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_1485 |
Symbol | |
ID | 8411006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 1410632 |
End bp | 1412602 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 645019811 |
Product | zinc finger SWIM domain protein |
Protein accession | YP_003177307 |
Protein GI | 257387534 |
COG category | [S] Function unknown |
COG ID | [COG4715] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCT CCGATCCCGA TTCCGAACTT GATCCCGACT CCCGCGACCT GACACCGACC CGCGATGCCA TCAGGAGCAT CTGCACCGCA CAGTCATTCC AGCGCGGCGT CGAGTACGTC GAGGACGGTC GAGTGCGCGA CCTCACCGTC ACAGGTACCG AGGCGGAGGC GACGGTCAGG GGAAGCCACG ACTACCGAAC GGCCGTCGAC CTGTCAGTCG AGGGGTTCGA CCCGCAGTGT TCGTGCCCGT ACGACCACGC CGGCGAGTGC AAGCACGTCG TCGCGGTGTT ACTGACTGTG ATTGAACAGT CGGCAGAACT GTTCGATGAA CATGCGGAAC ACGATTTTGG AGTGCCAGAT CACGAAATGT TGACATCAGG CGTCGCGGTT GACGATCGAT CGACACGGCA TGCCGTCGAA CGAGCTGTCG ACGAGGCTGA CCCGGAGACG ATGCGGGCGT TCCTGCGCGA CGTGCTCGCC GAGGACTCCG ACCGTCGAGA GCAGTTTCTG GTCGCCGTCG GGACGCCGGC CGAGAAGAGC GTCGCAGACT ATCGGCGTGA GATCGACCGG AAGTTCGAGC AGGCCACGGA TCGCCGCGGG ATCGTCGCGT ACGACACGCA ACTCGACTTC CGCCAGTACT ACGACCGCGC GGACACCTAT CGGGAGCACG GCGACCACGA GCGAGCGCTG ACGATCTATC GAGCGCTCGC CGAGGGAATC CAGCAGAACC TCGACCGGAT CGACGACAGC GGCGGCCACT ACGCGGGCCA GATCGAGCGC GCGATGGACG TAGCCGTCGA CTGCATCAAC GAGATGGAGT CCGACGCCGA GCGGCGACGC GAGTTCCTCG ACACTCTCTT CGAGCAGTAC GAGGCGACTG AATACGCCTT CGTCATGGAG TACTACGACG ACGCGCTCCG GTCGATCTGT GAGACGACCG CGGACCTCGA ACACCTCCGT TCGCTGCTGG AGCCACACCT CCCCTCGCCT GTGGCGGAAG GCGACAAACG CATGGACGAA ACGGAAACCA GTGAGGGCGA CGAGGGTGGC CTCCCGGACC CGACCCGGGA CCGCCTCGAC GCCGATCTGC TCACCGGTGG TGTGCTCGAC ATCGACCGTC TCTCCGAGAG CCCGCTCGAC GTATCCGATT TCGTCGGCGA GACGCTCGCG ACAAAGCTCG CGTCGACGGA GGCGGGGGGC GCGACTGCAG GCGACAGCCA GCACTCCCAG ACGGGACTCT CGGCCGAGGC ACAGACGGTG GTCTCGACCT ACGCTTGGGT GCTGTCGGAA CTCGACGAGA ACGAGGCACT CCGCGCTGTG CTCGAACCGG TCGCAACCGA GACCCCGACG CTCTGTTGCC GATACGTCGA GGCGCTCCTC GCGGACGGCA GCGACGAACG CGCCCCAGCG GCGCTCGAAT GCGGACTCGA TCGGTTCGGC CACTCACGGG AACTCCATCG CTTCGCGGCC GATTGCTACC GTGATCGCGA CGACGAGCGC TACCGCGACC TCCTGCAGAC GATGTTCGTC CGCTTTGCGG ACTGGGAGGC CTACGACGAA CTCGTGAGCG CCTGTCCCGA CGACGAGTGG GAGTCAGTCT TCCACGGGCT CGTCTCGCAA CTCGGACGAC TCGACGCCGA CCGGCTGATC GACCTCTACA TCCGCGAGGG GGAACGCGAG AAAGCCCTGT CGCGCGTGCT CGACGGCGAG GATCCCGAAC TCCTTCGTCA GTACCAGACA GACCTCGCCG ATCTTGATCC CGAGGCGTAC TTCGAGACCT ACCGAGAGGT ACTGGCGTCG CATCTGGCCG ACGACACCGG GCGAGACCAC TACCGGGCGG TGATCGGCCA CCTCCGTGAG ATGTCACAGC TGGCCTTCGA CGAGGAACTG GCCGCGTTCG TCGCCCGGCT GCGAGAGAGC CACTCGAACC GCCCGGCGCT GCTGGACGAA CTCGACGACG CCAGATTCTA G
|
Protein sequence | MTRSDPDSEL DPDSRDLTPT RDAIRSICTA QSFQRGVEYV EDGRVRDLTV TGTEAEATVR GSHDYRTAVD LSVEGFDPQC SCPYDHAGEC KHVVAVLLTV IEQSAELFDE HAEHDFGVPD HEMLTSGVAV DDRSTRHAVE RAVDEADPET MRAFLRDVLA EDSDRREQFL VAVGTPAEKS VADYRREIDR KFEQATDRRG IVAYDTQLDF RQYYDRADTY REHGDHERAL TIYRALAEGI QQNLDRIDDS GGHYAGQIER AMDVAVDCIN EMESDAERRR EFLDTLFEQY EATEYAFVME YYDDALRSIC ETTADLEHLR SLLEPHLPSP VAEGDKRMDE TETSEGDEGG LPDPTRDRLD ADLLTGGVLD IDRLSESPLD VSDFVGETLA TKLASTEAGG ATAGDSQHSQ TGLSAEAQTV VSTYAWVLSE LDENEALRAV LEPVATETPT LCCRYVEALL ADGSDERAPA ALECGLDRFG HSRELHRFAA DCYRDRDDER YRDLLQTMFV RFADWEAYDE LVSACPDDEW ESVFHGLVSQ LGRLDADRLI DLYIREGERE KALSRVLDGE DPELLRQYQT DLADLDPEAY FETYREVLAS HLADDTGRDH YRAVIGHLRE MSQLAFDEEL AAFVARLRES HSNRPALLDE LDDARF
|
| |