Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hmuk_0251 |
Symbol | |
ID | 8409749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halomicrobium mukohataei DSM 12286 |
Kingdom | Archaea |
Replicon accession | NC_013202 |
Strand | + |
Start bp | 247698 |
End bp | 248924 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645018576 |
Product | PBS lyase HEAT domain protein repeat-containing protein |
Protein accession | YP_003176095 |
Protein GI | 257386322 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.169018 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0104715 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTCT ACCAGCTGGA ACGAGACGGC GAGGTACAGG AGATCATCCG GACGCTCCGG GAGTCGGACA ACCCGAAGGT CAGAGCGCGG GCGGCGGAGC TACTCGGGAA CTTCCCGAAC CACGACGACC GGCGAGACGT AGTCAACGCC CTCGTCGAGG CGGCCCAGGG CGAGGACAGT CGGATCGCCG CGACGGCCGT CGACTCGCTG GACGAGCTGG GCGGAGACGC GATCGAGCAG CTGATCGCCG ACATGGCCGG CGTCGACTTC GGCGACGACG GAGCCGAGTG GGTCCGCGCG AAGGCGTTCA CGCAGGCCCT CGACGCGGAC GTGCCCGAAC TCAGGATGGC GGCGGCCAAC GGCCTCGGCC AGCTCGAACA GGCAGATACG GTCGGTCCGC TGTCGAACCG CTTCGACGAC GACGACCCGC GCGTTCGGGC GCGGGCCGCG CGGGCCTGTG GGAAGATCGG CGATCCACGG GCGGTCGGTC CGCTCGAATC CCTGCTCCGG GATCCGAAGG CGGCCGTCCG CAGGGAGGCC GCCGACGCGC TGGGGTCGAT CGGGAACCGA CAGGCCCTAC AGGCGCTGCT CCCCCTGTAC GAGGACGACA ACGAGCGCGT CCGACGGATC GCCGTCGGAG CCTTCGGCAA CTTCGGCAAC GACCGGCCGG TCGACTACCT CGTCGAGTCG CTCACCGACG AGTCCTCCGG CGTCCGCCAG ACCGCCGTCT ACTCGCTGAT CGAACTGCTC TCGAACGTCC CGACAGAGCA GAGCCACGAG ATACGGGACA CCGTCGTCGA GCGACTCTCC TCGACCGACG ACCGCAGCGT GGTCGTGCCG CTGGTCGAGA TCCTCGAAGA GAGCACACAG AACGCCCAGC GGCGCAACAC CGCGTGGCTG CTGGGCCGGG TCACCGGCGA GCAAGAGCGC GTCCGCGTCA TCGAGTCGCT GATCGACGCG CTACACGAGG ACGATCAGAT GCTCCGGCAG TTCGCCGCGA CAAGCCTGGC CGAGATCGAC GGCGACGACG TGGAGCGGCG GCTCCTGTCG GTCGTCGATG ACGAGGCAGT CGACCCCGAT GTTCGCGCAC AGGCGATCTT CACGCTCGGG AAGGTCGGGA GCGAGCGCTC GCGCAAGACC CTGGACCGAA TCATCGATCA GACCGAGAAC GAGACGATCC GCAAGCGAGC GTTCTCGGCG ATCTCCAAGC TCGGCGGCCG ACGATGA
|
Protein sequence | MSLYQLERDG EVQEIIRTLR ESDNPKVRAR AAELLGNFPN HDDRRDVVNA LVEAAQGEDS RIAATAVDSL DELGGDAIEQ LIADMAGVDF GDDGAEWVRA KAFTQALDAD VPELRMAAAN GLGQLEQADT VGPLSNRFDD DDPRVRARAA RACGKIGDPR AVGPLESLLR DPKAAVRREA ADALGSIGNR QALQALLPLY EDDNERVRRI AVGAFGNFGN DRPVDYLVES LTDESSGVRQ TAVYSLIELL SNVPTEQSHE IRDTVVERLS STDDRSVVVP LVEILEESTQ NAQRRNTAWL LGRVTGEQER VRVIESLIDA LHEDDQMLRQ FAATSLAEID GDDVERRLLS VVDDEAVDPD VRAQAIFTLG KVGSERSRKT LDRIIDQTEN ETIRKRAFSA ISKLGGRR
|
| |