Gene Hmuk_0318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0318 
Symbol 
ID8409816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp306747 
End bp309767 
Gene Length3021 bp 
Protein Length1006 aa 
Translation table11 
GC content66% 
IMG OID645018643 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003176162 
Protein GI257386389 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACGAG AGTACATCGA GGTCCGCGGT GCGGAGGAAC ACAACCTCAA GGACATCGAT 
GTTGAGATCC CGCGAGAGGA GTTGACGGTC GTGACGGGGC TGTCGGGATC GGGCAAGTCC
TCGCTGGCCT TCGAGACGGT GTACGCTGAG GGACAGCGCC GCTACATCGA GTCGCTGTCG
GCCTACGCCA GAAACTTCCT GGGCCAGATG GACAAGCCAC AGGTCGAGAC CGTCGAGGGG
CTGAGCCCGG CGATCTCGAT CGACCAGAAG AACGCCGCCA ACAACCCCCG ATCGACGGTC
GGGACCGTCA CGGAACTCCA CGACTACCTG CGCCTGCTGT ACGCCCGAAT CGGGACGCCA
CACTGTCCGG AGTGTGGCCG CGAGGTCGGC GAGCAGTCCG CCCAGAACAT GGTCCGTCGG
ATGCTGGCCC TGCCCGAGGG GACCCGCGCG AAGATCGCCG CACCGGTCGT ACGCGACCAG
AAGGGTGCCT TCGAGGAGCT GTTCGAGGAG CTGGTGAGCG AGGGGTACGC CCGCGTCGAG
GTCGACGGCG AACCGTACGA CCTGACGATC GACGACCCCG ATCTCGACGA GAACTACGAC
CACACGATCG ACGTGATCGT CGACCGCGTG AGCGTCAGCG CGGACGCCCG CTCGCGGATC
ACCGACTCCG TCGAGACGGC CCTCGAAGAA GGGGCGGGCA TCACGAAGGT GATCCTCCCG
GACCCGCCGG AAGGCGTGTC CCTCGGCGGG GCGACGGCGC GCTCGACGGG CGATCTCGCG
GGGACCGGCG ACAGCGACGA CCACAGCGAC CGCGAGCGGC TCGTCGTCGA GTTCTCCGAG
GAGCTCGCGT GTACCCACTG CGGGATCGAC ATCTCCGAGA TCGAGACCCG CTCGTTCTCG
TTCAACAGCC CCCACGGGGC CTGTCCGGAG TGTGAGGGCC TCGGCGAGAC CAAGGAGGTC
AGCGAGGAGC TCGTCATCCA GGACCGTTCG AAGCCCCTCA AGCACGTCTT CGAACCCTGG
AGCTACGACC GCACCTACTA CTCGCGACAG CTGGACAACG TCGCGGACCA CTTCGGCGTC
TCGCTGGAGA CGCCCTTCGA GGAGCTGGAC GAGTCGATCC AGCACCAGTT CCTCTATGGC
ACGGACGACC TCGTTCACTT CGAGTGGCAG ACCAAGAACG GCACCCGCGA GAAGACCGAG
CGCTTCGAGG GCGTCGTCCC GAACCTGGAA CGTCGCCACG TCGAGACCGA CAGCGACCGC
GCCCGCGAAC ACATCGAGGA GTTCATGGCC GTCACCACCT GTCCCGAGTG TGAGGGCACC
CGCCTGAAAG CGGAGTCCCG TCACGTCCTC GTCGACGGGA GTTCCGCGAG CGATGCGAGC
GGAACGTCGT CGGAGGTTTC CTCCGACGGT ACGTCGATCA CCGAGGTCAA CCGGATGTCG
ATCGGCGACG CACTGGAACA CTTCGAGGGG CTGGAAGCGT CGCTGGACGA ACGCGACACG
AAGATCGCCG AGGAGATCCT CAAGGAGATC CGCGCCCGCC TCGGGTTCAT GGAGGAGGTC
GGGCTGGAGT ATCTCACCCT CGACCGGGAG GCGTCGACGC TGTCCGGCGG CGAGAGCCAG
CGCATCCGGC TGGCGACCCA GATCGGCTCC GGACTGGTCG GCGTGTTGTA CGTGCTCGAC
GAGCCCTCGA TCGGACTCCA CCAGCGGGAC AACGACCGGC TGTTGAACAC GCTGGCTGAG
CTGCGCGACC TCGGAAACAC GCTCCTCGTC GTCGAGCACG ACGAGGAGAC GATGCGCCGG
GCCGACAACA TCATCGACAT GGGTCCCGGC CCCGGCAAAC GGGGCGGCGA AGTCGTCGTC
AACGGCGACA TGGACGAGGT GATCGACTGC GAGGACAGCG TCACCGGCGA GTACCTCTCG
GGCGAGCGGA CGATCCCGGT CCCCGACGAC CGCCGAGCGT CGGACACCCA CCTCACGATC
GAGGGGGCAC GCCAGCACAA CCTCGCAGAT CTGGACGTGG ACCTTCCGAT CGGCTGTTTC
ACCGCGATCA CGGGCGTCTC TGGGTCGGGC AAGTCGACGC TGATGCACGA CGTGCTGTAC
AAGGGCCTCG TCCGGCGGAT GAACGACACC GACGTGAACC CCGGCGAGCA CGACGGCATC
GAGGGGATCG ACGAGATCGA GACCGTCCGC CTCATCGACC AGTCGCCCAT CGGCCGCACG
CCGCGCTCGA ACCCGGCGAC CTACACGAAC GTCTTCGATC ACATCCGCGA GCTGTTCGCC
GAGACGAACC TCTCGAAGCA GCGCGGCTAC GAGAAGGGAC GCTTCTCGTT CAACGTCAAG
GGCGGTCGCT GCGAGGCCTG TGGCGGCCAG GGCAACGTCA AAATCGAGAT GAACTTCCTC
TCTGACGTGT ACGTCCCCTG CGAGGAGTGC GACGGGGCGC GGTACAACGA CGAGACGCTG
GATGTCACCT ACAAGGACGC CACCATCTCC GACGTGCTGG ACATGAGCGT CGCAGAGGCC
TACGACTTCT TCGAGGGCCA CACCGGTATC CGCCGCCGCC TGAAGCTCCT GAAAGACGTG
GGTCTGGACT ACATGCGGCT GGGCCAGCCC TCGACGACGC TCTCCGGCGG AGAGGCCCAG
CGGATCAAGC TCGCCGAGGA ACTGGGGAAG AAAGACTCCG GCGAGACGCT GTACCTGCTC
GACGAGCCCA CGACCGGGCT CCACCCCGAG GACGAGCGCA AGCTGATCGA CGTGCTCCAC
CGCCTGACCG ACGAGGGCAA CACAGTCGTC GTCATCGAGC ACGAACTCGA TCTGGTGAAG
AACGCCGACC ACATCGTCGA CCTCGGACCC GAGGGCGGCG AGCACGGCGG GACCATCGTC
GCCGAAGGCA CGCCCGAGGA CGTGGCCCGA AACGAGGAGT CCTACACGGG GCAGTACCTC
CGCGATCTGC TGCCGAAAGT GGATCTGGAG GGGCCACGCG CCGACCGCGA CGTGGCACAG
CCAGCGGCGG ACGACGACTG A
 
Protein sequence
MTREYIEVRG AEEHNLKDID VEIPREELTV VTGLSGSGKS SLAFETVYAE GQRRYIESLS 
AYARNFLGQM DKPQVETVEG LSPAISIDQK NAANNPRSTV GTVTELHDYL RLLYARIGTP
HCPECGREVG EQSAQNMVRR MLALPEGTRA KIAAPVVRDQ KGAFEELFEE LVSEGYARVE
VDGEPYDLTI DDPDLDENYD HTIDVIVDRV SVSADARSRI TDSVETALEE GAGITKVILP
DPPEGVSLGG ATARSTGDLA GTGDSDDHSD RERLVVEFSE ELACTHCGID ISEIETRSFS
FNSPHGACPE CEGLGETKEV SEELVIQDRS KPLKHVFEPW SYDRTYYSRQ LDNVADHFGV
SLETPFEELD ESIQHQFLYG TDDLVHFEWQ TKNGTREKTE RFEGVVPNLE RRHVETDSDR
AREHIEEFMA VTTCPECEGT RLKAESRHVL VDGSSASDAS GTSSEVSSDG TSITEVNRMS
IGDALEHFEG LEASLDERDT KIAEEILKEI RARLGFMEEV GLEYLTLDRE ASTLSGGESQ
RIRLATQIGS GLVGVLYVLD EPSIGLHQRD NDRLLNTLAE LRDLGNTLLV VEHDEETMRR
ADNIIDMGPG PGKRGGEVVV NGDMDEVIDC EDSVTGEYLS GERTIPVPDD RRASDTHLTI
EGARQHNLAD LDVDLPIGCF TAITGVSGSG KSTLMHDVLY KGLVRRMNDT DVNPGEHDGI
EGIDEIETVR LIDQSPIGRT PRSNPATYTN VFDHIRELFA ETNLSKQRGY EKGRFSFNVK
GGRCEACGGQ GNVKIEMNFL SDVYVPCEEC DGARYNDETL DVTYKDATIS DVLDMSVAEA
YDFFEGHTGI RRRLKLLKDV GLDYMRLGQP STTLSGGEAQ RIKLAEELGK KDSGETLYLL
DEPTTGLHPE DERKLIDVLH RLTDEGNTVV VIEHELDLVK NADHIVDLGP EGGEHGGTIV
AEGTPEDVAR NEESYTGQYL RDLLPKVDLE GPRADRDVAQ PAADDD