Gene Dret_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1458 
Symbol 
ID8419287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1686621 
End bp1687880 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content62% 
IMG OID645038033 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_003198323 
Protein GI258405581 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00139] homoaconitase
[TIGR01343] homoaconitate hydratase family protein
[TIGR02083] 3-isopropylmalate dehydratase, large subunit
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.888399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.260054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCACA CTGTAGCGGA GAAGATTTTA CAGCAACACA CACAGGATAC GGTCAGCGAG 
CCCGGACAGA TCGTTCGCTG CGAGATTTCC CTGGCCCTGG CGAACGATAT CACCGCGCCC
CTGTCCATCC GGTCTCTGGA GAAAATGGGC GCGACCCGGG TCTTCGACAA AGACCGGGTC
GCCCTGGTCT GTGACCATTT TACGCCCAAC AAGGATATCG ATTCCGCCGA GCAGGTCCGC
ATCGTTCGCG AATTTGCGCG CAAGATGGGC ATTACCCATT ATTACGAAGG CGGTGAGGTC
GGGGTGGAAC ACGCCCTGTT GCCGGAACTC GGCCTGGTCG GTCCCGGGGA TATCGTTGTT
GGTGCGGACA GTCATACGTG CACCTACGGC GGTCTCGGTG CCTTTGCCAC CGGCCTCGGC
AGTACCGACG TCGCCGGGGC AATGGCCCTG GGAGAAACCT GGTTCAAGGT CCCGCCCACC
ATCCGGGTCG ATGTCAACGG CGACCTGGGG CGCTTTGTCG GCGGCAAGGA TATCATTTTG
CATTTGATTG GCACCATTGG AGTGGACGGA GCGTTGTACA AGGCCCTGGA ATTCGGCGGC
GGCACTGTCA GGGATCTCGA TCTCGAGGCC CGGTTGACCA TCGCTAATAT GGCCATTGAG
GCCGGCGGCA AGGTCGGCCT GTTCCCGGCG GATCAGACAA CGCTGGATTA CCTCCAAGCC
CACGGGCGCA GCGGCGATAC CTTTCTGGAA GCCGACGAGG GGGCAGGATA TGAGCGCCGG
GTGGCCATTG ACGCTGCCTC CCTGCAACCC CAGGTCGCCT GCCCCCATTT GCCGGAAAAT
GTCCATCCCG TGCGCGATGT AGGGGATATT CCCCTGGACC AGGTGGTCAT CGGTTCCTGC
ACGAACGGCC GGATCAGCGA TTTGCGCGAG GCCGCGGAAC TGCTCAAGGG CAAAAAGGTG
GCCAAGGGGC TGCGTCTGAT CATTCTTCCA GCTACCCCGG GCATTTATCG CCAAGCTCTG
GAGGAGGGAC TGATGACCAC GTTCATGGAA GCCGGGGCCA TTGTCGGCCC GCCCACCTGC
GGTCCCTGCC TTGGTGGGCA CATGGGGATT TTGGCCCAGG GCGAGCGGGC TCTGGCGACA
ACGAACCGCA ATTTCCGAGG ACGGATGGGC AGTCTGGAAA GCGAAGTCTA CCTGAGCGGG
CCGTCGGTGG CGGCAGCCAG TGCGGTGACT GGACGGATCA GCCATCCCGA AGATCTCTAG
 
Protein sequence
MRHTVAEKIL QQHTQDTVSE PGQIVRCEIS LALANDITAP LSIRSLEKMG ATRVFDKDRV 
ALVCDHFTPN KDIDSAEQVR IVREFARKMG ITHYYEGGEV GVEHALLPEL GLVGPGDIVV
GADSHTCTYG GLGAFATGLG STDVAGAMAL GETWFKVPPT IRVDVNGDLG RFVGGKDIIL
HLIGTIGVDG ALYKALEFGG GTVRDLDLEA RLTIANMAIE AGGKVGLFPA DQTTLDYLQA
HGRSGDTFLE ADEGAGYERR VAIDAASLQP QVACPHLPEN VHPVRDVGDI PLDQVVIGSC
TNGRISDLRE AAELLKGKKV AKGLRLIILP ATPGIYRQAL EEGLMTTFME AGAIVGPPTC
GPCLGGHMGI LAQGERALAT TNRNFRGRMG SLESEVYLSG PSVAAASAVT GRISHPEDL