Gene Hmuk_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3049 
Symbol 
ID8412602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2935159 
End bp2936328 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content66% 
IMG OID645021396 
Productphosphoesterase RecJ domain protein 
Protein accessionYP_003178861 
Protein GI257389088 
COG category[R] General function prediction only 
COG ID[COG0618] Exopolyphosphatase-related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.1164 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATCT GGCAAGTGGG TGGCCTCCAG CGAGTCGTCG ATAGCGTCAC CGTGTTCGCG 
CGTGAATCGC CGCTGGTGGC CGCGCTCGTG GTCGTCGGGA TCCTCGTCTT TCTCGTCGGG
GTTCGCCTCG CGATCGACAG ACTCCGCCGG TCGCCGGCCG AACGCTTCCA GCGACTGCTG
GCGTCGACCG ACGAGGTCGC CGTGTTGATG CATCCGAATC CCGATCCGGA CGCGATGTCG
AGTGCGCTGG CCGTCGACAG GCTGGCGACG CAGGCCGGTT CGTCGCCGAC GCTGTACTAC
CCCGGACAGA TTCGCCATCA GGAAAACCGC GCGTTTCAGA CCGTTCTGGA TCTGGACTTC
GATCGCATCG AGAAGGCCGG ACAGCTACAA GAGAGCGAGG TCGTCCTGGT CGATCACAAC
GAGGCGCGTG GGTTCCCCGG CGCGGAGAGC ATCGATCCGA TCGCCGTGAT CGACCACCAT
CCCGGCGGCG GCGAGGGGTC GGAGCTGTCC GACGTTCGCA CCGGCTACGG TGCCTGTGCG
ACGATCTTCG CCGAGTACTT CGAGACTCTC GACTGGGAAC TGGCCGACGG CGACGCGACG
GCCGACGACA ACCAGATCGA CCAACAGGTC GCGACCGGGC TGCTATACGG CATCCAGTCA
GACACGAAAC AGCTCACGAA GGGGTGTTCG TCCGCGGAGT TCTCTGCGGC CGAGTACCTC
TACGACGGGA TCGACGAAGA CCTGCTCGAC AGAATCGCGA ACCCACAGGT CGACGCCGAG
GTCCTGGACG TGAAAGCCCG TGCGATCACC GACCGCCAGA TCAAGAACGC CTTCGCGATC
AGCGACGTGG GCGCGGTCTC GAACGTGGAC GCGATTCCAC AGGCTGCCGA CGAACTGCTC
CGACTGGAGG GCGTGACGGC GGTCGTCGTG ATGGGGCGCA AAGAGGACAC GCTGCACCTC
TCCGGGCGCT CGCGCGACGA CCGCGTCCAC ATGGGTAACG TCCTCCAGAC GGTCGTCGAC
GACATTCCGA TGGGGTCGGC GGGCGGCCAC GCCCGGATGG GCGGGGGCCA GCTCTCGATC
GATCACATGA ACGGGATCGG ACCGGGAAGC GGCGTCGCGA TGACCGACTT CAAGGGGCAC
CTGTTCGACG CGATGGCCGG CGACATCTGA
 
Protein sequence
MPIWQVGGLQ RVVDSVTVFA RESPLVAALV VVGILVFLVG VRLAIDRLRR SPAERFQRLL 
ASTDEVAVLM HPNPDPDAMS SALAVDRLAT QAGSSPTLYY PGQIRHQENR AFQTVLDLDF
DRIEKAGQLQ ESEVVLVDHN EARGFPGAES IDPIAVIDHH PGGGEGSELS DVRTGYGACA
TIFAEYFETL DWELADGDAT ADDNQIDQQV ATGLLYGIQS DTKQLTKGCS SAEFSAAEYL
YDGIDEDLLD RIANPQVDAE VLDVKARAIT DRQIKNAFAI SDVGAVSNVD AIPQAADELL
RLEGVTAVVV MGRKEDTLHL SGRSRDDRVH MGNVLQTVVD DIPMGSAGGH ARMGGGQLSI
DHMNGIGPGS GVAMTDFKGH LFDAMAGDI