Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_3410 |
Symbol | |
ID | 8409488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013201 |
Strand | + |
Start bp | 212841 |
End bp | 214676 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645018331 |
Product | hypothetical protein |
Protein accession | YP_003175852 |
Protein GI | 257373078 |
COG category | [S] Function unknown |
COG ID | [COG4289] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.36223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.559533 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAATC CAGTCACGGA GAACCCACTG GAGACGCGGT CGGACGTACA GCGAGCGGTG CGACAGCTCG TCGCACCCCT GGAAGCACAC CACAGTCCGG GCGGGGCACT CGTTCGCGTC GGCGCGGCCG GCGCGTCGTT CCCGAACCAC GAAGCGGGGA TCGAGGGGTA CGCTCGCTCG CTGTGGGGCC TCGCCACACT CGCGGCCGGC GGCGGCGAGT TCGACGGGTG GGAACGGTAT CGGGCGGGAC TGGTCAACGG AACGGACCCG GATCACCCTG AGTACTGGGG CACGATCGCC GACGGGCGAC AGAAGGTCGT CGAGGGCTCC TGTCTGGGAA TGGCGCTCGG GCTCGTCCCC GAGCGGCTCT GGGAGCCACT CGACGAGGCG GAACGCGAGC GAGTACAGAC CTGGCTGACG GCGGCCAACG GCGAGTGGAT CCCGGACAAC AACTGGCGCT TTTTCCGGGT GCTGGCCAAC GTCGGGCTGT CGTCGGTCGG TGCCTCGTTC GACCGCCAGC AGGTCGCGAC CGACCTGGAC CGTCTGGAAA CGTTCGCGCT GGAGAACGGC TGGTACGCCG ACGGTCCGGA CGCCCCGGTC GACTACTACT GTGCGTGGAC GATGCACGTC GAGGGGCTCC TCTACGCCTG GCTGGCCGAG GACGATGCCG AGCGAGCGGC CCGGTTCAGG CGGCGGGCGG TCGAGTTCGC AAGCGAGTAC CGCCACCTGT TCGAGCCCAG CGGGCGCGCG GTCCCCTACG GCCGGAGTCT CACCTACCGG TTCGCCCAGG CCGCGTTCTG GGGCGCGCTC GCGCTCGTCG GTCGAGACGC CGACCTCCCG TGGGGCGAGA TCAGGGGGCT GTGGCTCCGG AATCTCCGGT GGTGGTTCGA CCAGCCGATC TTCGCCGCCG ACGGGACGCT GACCGTCGGC TACCGCTACC CCTCACAGAA GATGACCGAG CGGTACAACT CCCCGTCCTC GCCGTACTGG GCGTTTCGGG CGTTCCTGCC CCTGATCGCC GACGCGGACC ACCCCTTCTG GACGGCCGAG GAACGGCCGC TGCCCGAACT CGACCGCCAG CGGCCGATCG AGGCGGCGGA CCTGCTCGTC CGGCGCGGGC CGGACCACGT CGTCGCGTTC ACCGGCGACA CCGCCACGCC CCGCTACCGG AACAAGTACG ACAAGTTCGC CTACTCCAGT CACTTCGGGT TCGGCGTCGA CGACGGGATC GCCGGACTCG ACGCCTGTGG AATCGACAGC ACGCTCGTGG TCAGTACGGA CGGCGAGCAC TTCCGCGGAC GGACGGCGGA CCTCGACGGC AGCGTCGACG ACGGCGTCGC GGCGTCGACG TGGGACCCCT TCGACGACGT GACAGTCAGG ACGAAGGTGC TCCCGGTCGG ACCGTGGCAC GTCCGCATCC ACCAGCTCGA AGCCGCGCGA GCGATCGAGA CCGCGGAGGG CGGCTTCGCG CTGCCGACGA CCGAGGAACG GTACGCCGAC GACGTACGAG AGACGACCGA GGGACAGATG GCGGCGGTGT CGTTCGAGGA CTTCAGCGGT CTCCGGGCCG TCGGCGGCGG GAACCGCTGT GAGCCAGCGG TGACCACACC GGTCCCGAAC ACGAACGTCC AGCATCCCCG TACCGCGGTC CCGGTCCTCT CTCGCTCCTT CGAGCCCGGC AGTTACCGGT TCGCGTCGGC CGTCCTCGGC GTGCCGGGGA CAGCCAGCCC GACGGCCTGG ACCGAACCGC CGACCGTCGA GTGGACCTGG TCGGGGGTCA CCGTCGGCGA CGGCGAGGGA GAGACGGTAC AGACAGTCAG TCTCGAACAG AAGTAG
|
Protein sequence | MGNPVTENPL ETRSDVQRAV RQLVAPLEAH HSPGGALVRV GAAGASFPNH EAGIEGYARS LWGLATLAAG GGEFDGWERY RAGLVNGTDP DHPEYWGTIA DGRQKVVEGS CLGMALGLVP ERLWEPLDEA ERERVQTWLT AANGEWIPDN NWRFFRVLAN VGLSSVGASF DRQQVATDLD RLETFALENG WYADGPDAPV DYYCAWTMHV EGLLYAWLAE DDAERAARFR RRAVEFASEY RHLFEPSGRA VPYGRSLTYR FAQAAFWGAL ALVGRDADLP WGEIRGLWLR NLRWWFDQPI FAADGTLTVG YRYPSQKMTE RYNSPSSPYW AFRAFLPLIA DADHPFWTAE ERPLPELDRQ RPIEAADLLV RRGPDHVVAF TGDTATPRYR NKYDKFAYSS HFGFGVDDGI AGLDACGIDS TLVVSTDGEH FRGRTADLDG SVDDGVAAST WDPFDDVTVR TKVLPVGPWH VRIHQLEAAR AIETAEGGFA LPTTEERYAD DVRETTEGQM AAVSFEDFSG LRAVGGGNRC EPAVTTPVPN TNVQHPRTAV PVLSRSFEPG SYRFASAVLG VPGTASPTAW TEPPTVEWTW SGVTVGDGEG ETVQTVSLEQ K
|
| |