Gene Hlac_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1102 
Symbol 
ID7400174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1107304 
End bp1108617 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content67% 
IMG OID643708168 
Product3-isopropylmalate dehydratase 
Protein accessionYP_002565767 
Protein GI222479530 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGGAA AAACAATCTC GGAAAAGCTG CTGTCGGAGA AATCGGGACG CGACGTACGG 
GCCGGCGACT ACGTGGAAGC CGAGGTCGAC GTGATGATGA CCCACGACGT GACGGGACCA
CTGACCTTCG AGGTCTTTGA GGACGTGACG GGCGACGACC CCGAACTCGT CGATCCGGAC
AACACCGTCA TCACGATCGA CCACCACGCG CCGGCCGACG GCGTCGAGGC CGCGAACAAC
CACAACATCG TCCGGGAGTT CGCCGACACG TACGGCGCCC AGCAGTACGA CGTGGGCGAC
GGCATCTGTC ACCAGGTGCT CGTCGAGGAG GGGTTCGTCT CCCCGGGCGA CCTGGTGATC
GGTGCGGACT CGCACTCGAC CACCTTCGGC GGCGTCGGCG GCTTCGGCAC CGGCGTCGGC
TCCACCGACC TCGGGACGAC GCTCGCGACC GGCGAGCTGT GGTTCCGGGT GCCGGAGACG
CTCCGGTTCG AGGTCGAAGG CGACCTCCCC GACGGCGTGT ACGCGAAGGA CCTCATCCTG
AAGTTCATCG GCGACGTGGG GTTCGACGGC TGCACGTACA AGACCGCCGA GTACGGCGGC
TCGGCGGTCG AGGCGCTGCC GATCCACGAG CGGCTCGTCC TCTCGAACAT GGCGATCGAG
ATGGGCGGAA AGGCCGGCAT CGTCCCGCCC GATGAGCGGA CCCTCGACTT CCTTGAGGCA
CAGACCGGGG AGCGCCCCGA GATACCGGCG TACACCCAGC CGGACGACGA CGCCGACTAC
GAGGCGGTCC ACACCTATCA GGCCGAGTCG CTCTCGCCGC AGGTGTCGAC GCCGTCGAAC
CCGGAGAACG CGGTCCCCGT CGACGAGGTC GTCGGCACGG AGATCGACCA GCTGTTCGTC
GGGACGTGTA CGAACGGACG GTACGAGGAC ATCCGGATCG TCGCGGACAT CATCGAGGGC
GAGCAGCTCG CCCCGGACAC GCGGATGGTC GTCGTCCCGG CCTCGCGGTC GGTGTACAAA
CAGATGATGA ACACCGGCGT CATGAAAACG TTCGTCGACG CGGGCGCGAT CGTTCAGAGC
GCCGGCTGTG GTTCCTGCTT CGGGACCCAC CAAGGCGTGC TGGGCGACGG CGACGTCTGT
CTCGCCACGG CGAACCGCAA CTTCCCGGGC CGAGAGGGGT CGATGAAAAG CGAGGTGTAT
CTCGCCAGCC CCGCCACGGT GGGCGCGTCG GCGCTGTACG GCGAGATCAC CGACCCGCGG
GAGGTCGAAC TGAATCGCTA CGATGACTAC GTCCTCGAGG GGGTGGGCGC GTGA
 
Protein sequence
MTGKTISEKL LSEKSGRDVR AGDYVEAEVD VMMTHDVTGP LTFEVFEDVT GDDPELVDPD 
NTVITIDHHA PADGVEAANN HNIVREFADT YGAQQYDVGD GICHQVLVEE GFVSPGDLVI
GADSHSTTFG GVGGFGTGVG STDLGTTLAT GELWFRVPET LRFEVEGDLP DGVYAKDLIL
KFIGDVGFDG CTYKTAEYGG SAVEALPIHE RLVLSNMAIE MGGKAGIVPP DERTLDFLEA
QTGERPEIPA YTQPDDDADY EAVHTYQAES LSPQVSTPSN PENAVPVDEV VGTEIDQLFV
GTCTNGRYED IRIVADIIEG EQLAPDTRMV VVPASRSVYK QMMNTGVMKT FVDAGAIVQS
AGCGSCFGTH QGVLGDGDVC LATANRNFPG REGSMKSEVY LASPATVGAS ALYGEITDPR
EVELNRYDDY VLEGVGA