Gene Hmuk_3119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3119 
Symbol 
ID8412672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2997104 
End bp2998285 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID645021466 
Productvon Willebrand factor type A 
Protein accessionYP_003178931 
Protein GI257389158 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCTA TCGAGACGAG CGTCAATCGG CCGAACGTAC CGGCCGACGG CACGACCGTG 
ACCGCCGAGA TCGACGTCGA GCCGGGAGAA CAGGAGACGG ACGTGCGACG CCACATCGCG
CTCTGTATCG ACACGAGCGG GTCGATGGAG GGTGACAACA TCAAACGCGC TCGCGACGGC
GCTGCGTGGG TCTTCGGGCT GTTGGCCGAC GAGGACTACG TGAGTATCGT CGCGTTCGAC
ACCGAGGCGA CGGTGATCCT GCCCGCGACA CGGTGGTCGG ATCTCGACCG CCAGACGGCG
ATGGACCACG TCGAGGAGCT GACTGCCGGC GGCGGCACCG ACATGTACAA CGGGCTCAAG
GCCGCCAAGG AGACGCTGTC GTCCTCCGCG ACCGGGCCCG ACACGGTCAA GCGACTCCTC
TTGCTCTCGG ACGGCAAGGA CAACGAACGC ACGCCCGACG AGTTCGAGGG GCTGGCCGAA
GCCATCGACG ACGCCGGGAT CCGGATCCAG TCGGCCGGGA TCGGGACCGA CTACAACGAG
GCCACGATCC GGACGCTCGG GACGGCCGGG CGCGGGACGT GGACCCACCT CGAAGCGCCC
GGCGACATCG AGGACTTCTT CGGCGAGGCC GTCGAGCAGG CCGGCTCCGT CGTCGCGCCG
GACGCCCACC TCGACCTCGA CGTGGCCCCC GGCGTCGAGG TCAGCGAGGT GTATCGCGCG
CTCCCGCAGG CCCAGGAAGT CTCGCCCGAG TGGGAGGCAA ACGCCACCCG GGTCAAGCTC
CCCGACCTGA TCGAACGGGA GAGCCAGCGG GTCGTCCTCA AGATCCACGC GCCGCCCCGC
GAGCCCGGCA GCGAGGAGGT GCTCGCGGAC GTACAGCTCT CGGCCCGCGG CGACACCGCC
AGCGACCAGA TCGGCGTCGA GTACACGGAC GAACAGGAGA AGCTGGCCGA GCACAACGAG
TCCGTCGACA TCGACCACAA ACAGACCGTC ATCCGGACGG AGCTCGGCAA GGGCAACGTC
GAGGCCGCGG AGACGAAAGT CGAGCAGATG ACAGTGATCC ACGGCGAGGA CGCCGAGGCG
GTCCAGGAGG CCGAGCGCCA GACCGAGATC GTCAAAGAGG GCGGTCGTGC CGAACAGAGC
CAGGCGACCC AGATCGTCGA CAGCGACGAC GGCATCCAGT GA
 
Protein sequence
MASIETSVNR PNVPADGTTV TAEIDVEPGE QETDVRRHIA LCIDTSGSME GDNIKRARDG 
AAWVFGLLAD EDYVSIVAFD TEATVILPAT RWSDLDRQTA MDHVEELTAG GGTDMYNGLK
AAKETLSSSA TGPDTVKRLL LLSDGKDNER TPDEFEGLAE AIDDAGIRIQ SAGIGTDYNE
ATIRTLGTAG RGTWTHLEAP GDIEDFFGEA VEQAGSVVAP DAHLDLDVAP GVEVSEVYRA
LPQAQEVSPE WEANATRVKL PDLIERESQR VVLKIHAPPR EPGSEEVLAD VQLSARGDTA
SDQIGVEYTD EQEKLAEHNE SVDIDHKQTV IRTELGKGNV EAAETKVEQM TVIHGEDAEA
VQEAERQTEI VKEGGRAEQS QATQIVDSDD GIQ