Gene Dgeo_0599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0599 
Symbol 
ID4058049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp639140 
End bp640690 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content66% 
IMG OID641229613 
Product2-isopropylmalate synthase 
Protein accessionYP_604070 
Protein GI94984706 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.541423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAGC CCCAAGCCCA GGCCCAGCGC ATCCGCATCT TCGACACCAC CCTGCGTGAC 
GGCGAGCAGT CGCCGGGTGT GGCCCTGAAT CACACCCAAA AGCTGGAGAT CGCGCACCAG
CTCGCTCGGC TGGGCGTCGA CGTGATCGAG GCGGGCTTTC CCATCGCCTC TCCCGGCGAC
CTGGAAGGCG TCTCGCGCAT CGCCCGCGAG GTCCGCGGCC CCATCATCGC TGGGCTGGCT
CGCGCGGGCC GCGCCGACAT CGAGGCGGCA GCCAGGGCGG TTGAGCTGGC GGAAAAGCCC
CGCATCCACA CCTTCATCGC CACCAGCCCC ATTCACATGG CCAAGAAACT GCAACTCGAA
CCGGACGCGG TGATCGAGCG AGCAGTGGAG GCGGTGCGGC TGGCACGGTC CTTTGTGGAC
GACGTGGAAT TCAGCGCAGA GGACGCCACC CGCAGCGACC GTGACTTCCT GGTGCGCATT
TTCCGCGCTG CGGTGGAGGC GGGTGCGACC ACAATCAACG TGCCCGATAC GGTGGGCTAC
ACCACACCGG AAGAGATCCG CGACCTGTTC GCCTACCTGC GCGGCGAGCT GCCGGACCAC
ATTATTCTCT CGGCCCACTG TCACGATGAC CTGGGGATGG CTGTGGCCAA CTCCATCGCC
GCGGCGGAAG GCGGCGCGCG ACAGATTGAG TGCACTGTCA ACGGCATTGG CGAGCGCGCT
GGGAATGCCA GCCTGGAAGA GATTGTGATG GCCTTTCACA CCCGCCGTGA TCACTACGGC
TTCGAGACGG GCATCCGCAC CCGCGAGATC TACCGCACCA GCCGCATGGT GAGTCGCCTG
AGCGGGATGC CCGTCCAGCC CAACAAGGCT GTGGTGGGCG ACAATGCCTT TGCGCACGAG
TCGGGCATCC ACCAGGACGG CGTCATCAAG GCGCGCGAGA CCTACGAGAT CATGAACGCC
GAGCTGGTGG GGCGTGAGGC TGCCGTGCTG GTGATGGGCA AGCACTCGGG CCGTGCCGCC
TTCCGCAAGG CGCTGACGGA TTTGGGCTAC GCGGTGGACG AGGAACGCCT CAAGCAGCTG
TTTGCCCGCT TCAAGGACAT GGCCGACCGC AAGGGACAGA TCTACGCAGA CGACCTGCGC
GCCCTGGTGG AAAGCCGCAG CGACGTGCCG CAGACCTTTA CGCTCGAGGG CTTCCAGATC
ACCTCCGGCA TGAACATGAC ACCGGTCGCC TTTGTGCGTC TGCAGACGCC CGACGGCCCG
GTGGATGCGA CCGCACACGG CGACGGCCCG GTGGAGGCCG CTTTTCAGGC GATCAACAAA
ATCACCGGCA TCACGCCCAC GCTGGAGAGC TACCGCATCC AGGCCGTCAC GGGCGGCGGC
GACGCGCTGG GCGAGGTCAG CATCGGCGCG CGCTACGGCG AGACGACCCT GCACGGAACC
GGCGTGGCGA CCGATGTGGT TGAAGCTTCT GCCCGCGCCT GGATTCGCAT CGTGAATCAG
GTGGTGGCGG GCATGGGCAA GAGCCGGGCG GTGAGTCAGA CAACAGTGTG A
 
Protein sequence
MTQPQAQAQR IRIFDTTLRD GEQSPGVALN HTQKLEIAHQ LARLGVDVIE AGFPIASPGD 
LEGVSRIARE VRGPIIAGLA RAGRADIEAA ARAVELAEKP RIHTFIATSP IHMAKKLQLE
PDAVIERAVE AVRLARSFVD DVEFSAEDAT RSDRDFLVRI FRAAVEAGAT TINVPDTVGY
TTPEEIRDLF AYLRGELPDH IILSAHCHDD LGMAVANSIA AAEGGARQIE CTVNGIGERA
GNASLEEIVM AFHTRRDHYG FETGIRTREI YRTSRMVSRL SGMPVQPNKA VVGDNAFAHE
SGIHQDGVIK ARETYEIMNA ELVGREAAVL VMGKHSGRAA FRKALTDLGY AVDEERLKQL
FARFKDMADR KGQIYADDLR ALVESRSDVP QTFTLEGFQI TSGMNMTPVA FVRLQTPDGP
VDATAHGDGP VEAAFQAINK ITGITPTLES YRIQAVTGGG DALGEVSIGA RYGETTLHGT
GVATDVVEAS ARAWIRIVNQ VVAGMGKSRA VSQTTV