Gene Dgeo_1033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1033 
Symbol 
ID4057993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1104077 
End bp1105132 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content68% 
IMG OID641230050 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_604501 
Protein GI94985137 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.895977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00144137 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTAAAG TCGTCACCCT ACCCGGCGAT GGGATCGGCC CCGAAGTCAC CGCTGCCGCC 
GCCGAAGTGC TGCGCGAGGT CGCGCCCGAC GTCCACATTG AGGAACACGC CATCGGTGGC
GCGGCCTACG AACAGTTCGG GGATCCCTTC CCGCAGCGGA CCCGTGACGC CCTAGGGGAC
GCGGACGCGG TGCTGCTGGG GACCGTGGGG GGCGCGCAGA ACAGCCCCTG GAACAGCCTT
CCGCGTCCCT TGCGCCCGGA AAGCGGCCTG CTGGCGCTGC GCCGGGCGCT GGGCTGTTAC
GCCAACCTGC GGCCCGTGCG GGTGCTGCCG GGTCTGGAAC ACCTCTCGCC GCTCAAGCCC
GAACTGGCGC GCGGCGTGGA CATCCTGATT GTGCGTGAGC TGCTGGGCGG CATCTACTTC
GACGGCGACC GCAAGATCGA GGGGGACACG GCTTACAACA CCATGCGCTA CACCACGCCC
GAGGTCGAGC GCGTGGCAAG GGTGGCCTTT TGGGCCGCCG AGCAGCGCCG GGGTCGCGTG
ACGAGCGTGG ACAAGGCCAA CGTGCTGGAG GTGTCTGAGC TGTGGCGCCG CGACGTACAG
GCCCTGCGCG ACCGCGAGTA CCGCAACGTC CACCTCAACC ATGAGTACGT TGATTCGGTC
GCCATGCTGA TTGTTGCCAA TCCCAGCCGC TACGACGTGA TTCTCACCGA GAACCTCTTC
GGGGACATTC TCTCCGACCT GGCCGCTGTG ATTCCTGGTT CGCTGGGCTT GATGCCGAGT
GCCTCGCTGG GCGACGGCCC CGGTCTCTTT GAGCCGATCC ACGGCAGCGC CCCCGACATT
GCCGGGCAGG GCATCGCCAA CCCCGCCGCC GCGATCATGA GCGTAGCGAT GCTGCTGCGC
CACGGCCTCG AGCGTCCCCA GGTGGCCAAC CAGGTCGAGC GGGCGGTGGC CTTGGCCCTG
CGCGAGCATC CCACCCGTGA CCTGGGTGGG CAGGCCGATA CGCGGACCTT CACACACGCT
GTGCTGGACG CAATGGGGAG CCCGAGTGTG GGATAA
 
Protein sequence
MPKVVTLPGD GIGPEVTAAA AEVLREVAPD VHIEEHAIGG AAYEQFGDPF PQRTRDALGD 
ADAVLLGTVG GAQNSPWNSL PRPLRPESGL LALRRALGCY ANLRPVRVLP GLEHLSPLKP
ELARGVDILI VRELLGGIYF DGDRKIEGDT AYNTMRYTTP EVERVARVAF WAAEQRRGRV
TSVDKANVLE VSELWRRDVQ ALRDREYRNV HLNHEYVDSV AMLIVANPSR YDVILTENLF
GDILSDLAAV IPGSLGLMPS ASLGDGPGLF EPIHGSAPDI AGQGIANPAA AIMSVAMLLR
HGLERPQVAN QVERAVALAL REHPTRDLGG QADTRTFTHA VLDAMGSPSV G