Gene Hmuk_0251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0251 
Symbol 
ID8409749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp247698 
End bp248924 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content69% 
IMG OID645018576 
ProductPBS lyase HEAT domain protein repeat-containing protein 
Protein accessionYP_003176095 
Protein GI257386322 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.169018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0104715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCT ACCAGCTGGA ACGAGACGGC GAGGTACAGG AGATCATCCG GACGCTCCGG 
GAGTCGGACA ACCCGAAGGT CAGAGCGCGG GCGGCGGAGC TACTCGGGAA CTTCCCGAAC
CACGACGACC GGCGAGACGT AGTCAACGCC CTCGTCGAGG CGGCCCAGGG CGAGGACAGT
CGGATCGCCG CGACGGCCGT CGACTCGCTG GACGAGCTGG GCGGAGACGC GATCGAGCAG
CTGATCGCCG ACATGGCCGG CGTCGACTTC GGCGACGACG GAGCCGAGTG GGTCCGCGCG
AAGGCGTTCA CGCAGGCCCT CGACGCGGAC GTGCCCGAAC TCAGGATGGC GGCGGCCAAC
GGCCTCGGCC AGCTCGAACA GGCAGATACG GTCGGTCCGC TGTCGAACCG CTTCGACGAC
GACGACCCGC GCGTTCGGGC GCGGGCCGCG CGGGCCTGTG GGAAGATCGG CGATCCACGG
GCGGTCGGTC CGCTCGAATC CCTGCTCCGG GATCCGAAGG CGGCCGTCCG CAGGGAGGCC
GCCGACGCGC TGGGGTCGAT CGGGAACCGA CAGGCCCTAC AGGCGCTGCT CCCCCTGTAC
GAGGACGACA ACGAGCGCGT CCGACGGATC GCCGTCGGAG CCTTCGGCAA CTTCGGCAAC
GACCGGCCGG TCGACTACCT CGTCGAGTCG CTCACCGACG AGTCCTCCGG CGTCCGCCAG
ACCGCCGTCT ACTCGCTGAT CGAACTGCTC TCGAACGTCC CGACAGAGCA GAGCCACGAG
ATACGGGACA CCGTCGTCGA GCGACTCTCC TCGACCGACG ACCGCAGCGT GGTCGTGCCG
CTGGTCGAGA TCCTCGAAGA GAGCACACAG AACGCCCAGC GGCGCAACAC CGCGTGGCTG
CTGGGCCGGG TCACCGGCGA GCAAGAGCGC GTCCGCGTCA TCGAGTCGCT GATCGACGCG
CTACACGAGG ACGATCAGAT GCTCCGGCAG TTCGCCGCGA CAAGCCTGGC CGAGATCGAC
GGCGACGACG TGGAGCGGCG GCTCCTGTCG GTCGTCGATG ACGAGGCAGT CGACCCCGAT
GTTCGCGCAC AGGCGATCTT CACGCTCGGG AAGGTCGGGA GCGAGCGCTC GCGCAAGACC
CTGGACCGAA TCATCGATCA GACCGAGAAC GAGACGATCC GCAAGCGAGC GTTCTCGGCG
ATCTCCAAGC TCGGCGGCCG ACGATGA
 
Protein sequence
MSLYQLERDG EVQEIIRTLR ESDNPKVRAR AAELLGNFPN HDDRRDVVNA LVEAAQGEDS 
RIAATAVDSL DELGGDAIEQ LIADMAGVDF GDDGAEWVRA KAFTQALDAD VPELRMAAAN
GLGQLEQADT VGPLSNRFDD DDPRVRARAA RACGKIGDPR AVGPLESLLR DPKAAVRREA
ADALGSIGNR QALQALLPLY EDDNERVRRI AVGAFGNFGN DRPVDYLVES LTDESSGVRQ
TAVYSLIELL SNVPTEQSHE IRDTVVERLS STDDRSVVVP LVEILEESTQ NAQRRNTAWL
LGRVTGEQER VRVIESLIDA LHEDDQMLRQ FAATSLAEID GDDVERRLLS VVDDEAVDPD
VRAQAIFTLG KVGSERSRKT LDRIIDQTEN ETIRKRAFSA ISKLGGRR