Gene Hlac_0872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0872 
Symbol 
ID7401242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp860477 
End bp861898 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content68% 
IMG OID643707937 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_002565540 
Protein GI222479303 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGG GGACCCTGTA CGACAAGGTG TGGGACCGGC ACACGGTGAC GAAGCTGCCC 
ACCGGACAGG ACCAGCTGTT CGTCGGGCTC CACCTCGTTC ACGAGGTCAC CAGCCCGCAG
GCGTTCGGCA TGCTGAAAGA GCGCGACCAA GAGGTGGCGT TCCCGGAGCG CACGCACGCG
ACCGTCGACC ACATTGTGCC GACTGGGAAC CGCGATCGGC CCTACCGCGA CGAGGCCGCC
GAGAACATGA TGGCGGAGCT GGAGGCGAAC GTCCGCGGCT CGGGCATCGA CTTTTCCGAT
CCGGACTCCG GCAACCAGGG GATCGTCCAC GTTATCGGGC CGGAGCAGGG GCTCACCCAG
CCGGGAATGA CGATCGTCTG TGGCGACTCG CACACGTCGA CGCACGGCGC GTTCGGCGCG
CTGGCGTTCG GCATCGGCAC CTCGCAGATC CGCGACGTGC TCGCGACGGG CTGTATCGCC
ATGGAGAAAC AGCAGGTCCG CAAGATCGAG GTCACGGGCG AGCTCGGCGA GGGCGTCACC
GCGAAGGACG TCATCCTGAC GATCATCGGG AAGCTCGGGA CCGACGGCGG CGTCGGCTAC
GTCTACGAGT ACGCCGGCGA GGCCATCGAG GACCTCGGGA TGGAAGGGCG GATGTCCATC
TGTAACATGT CGATCGAAGG CGGCGCCCGC GCGGGATACG TCAACCCCGA CGAGACCACC
TACGAGTGGC TCGCGGAGAC GGACGCCTTC GCCGACGACC CCGAGAAGTT CGAGCGGCTG
AAACCCTACT GGGAGTCGAT CCGGAGCGAC GCCGACGCCG AGTACGACGA CGTGGTCACC
ATCGACGGCT CGGCGATCGA ACCGACCGTC ACGTGGGGGA CCACGCCCGG TCAGACCGCG
GGCATCACCG AGCCGATCCC GGATCCCGAC GACCTGCCCG AGGAGGACCG CGACACCGCG
AAGCGGGCAC AGAAACACAT GCGCGTCGAG CCCGGCGACA CGATGGAGGG GTACGACATC
GACGTGGCGT TCCTCGGCTC GTGTACTAAC GCGCGGCTGA AGGACCTCCG CGAGGCCGCG
GCGTTCGTCG AGGGTCGCGA GGTCGACGAC GACGTGCGCG CGATGGTCGT CCCCGGTAGC
CAGCGCGTCC GCGACGCCGC CGAGGCCGAA GGGATAGACG AGATATTCAT CGAGGCCGGC
TTCGACTGGC GCGAGCCCGG CTGTTCGATG TGTCTCGGCA TGAACGACGA CCAGCTGGTG
GGCGACGAGG CGAGCGCCTC CTCGTCGAAC CGGAACTTCG TCGGCCGACA GGGCTCGAAG
GACGGGCGCA CCGTGCTGAT GAGTCCGATC ATGGTCGCGG CCGCGGCGGT GACCGGCGAG
GTCACCGACG TCCGCGAGAT GGAGGAGGTG GCGACCGTAT GA
 
Protein sequence
MSEGTLYDKV WDRHTVTKLP TGQDQLFVGL HLVHEVTSPQ AFGMLKERDQ EVAFPERTHA 
TVDHIVPTGN RDRPYRDEAA ENMMAELEAN VRGSGIDFSD PDSGNQGIVH VIGPEQGLTQ
PGMTIVCGDS HTSTHGAFGA LAFGIGTSQI RDVLATGCIA MEKQQVRKIE VTGELGEGVT
AKDVILTIIG KLGTDGGVGY VYEYAGEAIE DLGMEGRMSI CNMSIEGGAR AGYVNPDETT
YEWLAETDAF ADDPEKFERL KPYWESIRSD ADAEYDDVVT IDGSAIEPTV TWGTTPGQTA
GITEPIPDPD DLPEEDRDTA KRAQKHMRVE PGDTMEGYDI DVAFLGSCTN ARLKDLREAA
AFVEGREVDD DVRAMVVPGS QRVRDAAEAE GIDEIFIEAG FDWREPGCSM CLGMNDDQLV
GDEASASSSN RNFVGRQGSK DGRTVLMSPI MVAAAAVTGE VTDVREMEEV ATV