Gene Hmuk_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1914 
Symbol 
ID8411441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1824082 
End bp1825245 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content69% 
IMG OID645020244 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_003177734 
Protein GI257387961 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCT CGACACCAGG CGTGACGCTG GGCGTCGTCG GCGGCGGTCA GCTCGGACGG 
ATGCTGGCGG AGGCCGCCGC GCCGCTGGGC GTCGATCTCG TCGTGACCGA TCCGACCGAG
GACCCGCCGG CCGGCCCGGT CGCGAGCGAC GCGCTCGTGG GCGATTTCGA CGAGTTCTCG
ACGATCCGGG CAGTCGCAGA GCGCGCCGAC TACCTCACCT TCGAGATCGA GCTGACCGAC
CCCGACGCCC TGGAGCAGGT CGCCGAGGAG ACCGGCGTTC CGGTACACCC GAAGCCGGAC
ACGCTCCGAC TCATCCAGGA CAAGCTCGTC CAGAAACGCC GTCTCGGCGA CGCTGGCGTC
CCGGTCCCGG CCTTCCGGCA GGTCGACGAC GAAGACGATC TCCTGGCGGC GGGCGAGGAA
CTGGGCTATC CCCTGATGCT CAAGGCCCGC GAAGGCGGCT ACGACGGGCG GGGCAACTAC
CCGGTCGAGT CGCCCGACGA CGTGACCGAC GCCCTCGACG CGATCCAGGG GCCGGCGATG
GCCGAGGAGA TGATCGACTT CGAGCGAGAG CTGGCCGTGA TGGGGTGTCT GGGAGCCGAC
GAGCGAGACA CGTTCCCGGT CACCGAGACG ATCCACCGCG AGGAGATCCT CCGAGAGACG
GTCTCCCCTC CGCGAGCCGA CGAGGACGTT CGAGAGCGCG CCCGCTCGAT CGCGCTCGAC
GTACTCGACG CGATGGAGGG GCGGGGCGTC TACGGGATCG AACTGTTCGA GACGAGCGAC
GGCGAGATCC TGCTCAACGA GATCGCGCCC CGCCCCCACA ACTCGGGCCA CTGGACCATC
GAGGGGTGTC ACACCTCGCA GTTCGAACAG CACGTTCGGG CCGTCACCGG GCGACCGCTC
GGGACGACCG AGCGCCGTGG ACCGACGGTT TCGGCAAACG TGCTGGGCGA CGTACCGGAC
CGCCAGCGTG CGACACTCTC CGGCGAAGAC GCGGTCTTCG AGACGCCTCG GGCGCACCTC
CACTGGTACG GAAAGCGAGA GGTGTACCGG CTCCGGAAGA TGGGGCACGT GACCCTCGTC
GGCGACGGCG AGACGATGGA CGAACTACTC GCGGACGTTC GAGCACTGCG AGAGCGACTG
ACCTTCCAGT CCCGCTCGCA GTGA
 
Protein sequence
MTISTPGVTL GVVGGGQLGR MLAEAAAPLG VDLVVTDPTE DPPAGPVASD ALVGDFDEFS 
TIRAVAERAD YLTFEIELTD PDALEQVAEE TGVPVHPKPD TLRLIQDKLV QKRRLGDAGV
PVPAFRQVDD EDDLLAAGEE LGYPLMLKAR EGGYDGRGNY PVESPDDVTD ALDAIQGPAM
AEEMIDFERE LAVMGCLGAD ERDTFPVTET IHREEILRET VSPPRADEDV RERARSIALD
VLDAMEGRGV YGIELFETSD GEILLNEIAP RPHNSGHWTI EGCHTSQFEQ HVRAVTGRPL
GTTERRGPTV SANVLGDVPD RQRATLSGED AVFETPRAHL HWYGKREVYR LRKMGHVTLV
GDGETMDELL ADVRALRERL TFQSRSQ