Gene Dret_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1100 
Symbol 
ID8418925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1290553 
End bp1291644 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content60% 
IMG OID645037672 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_003197966 
Protein GI258405224 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.757445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.166084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAT TTACTATCTG TTGTCTGCCA GGGGATGGCA TCGGCCCGGA GATCACCACT 
GAAGCCCAAA ACGTTCTGAC TGCGATTGGA AGGCAGTTTG GTCATACTTT TACGATGACC
GACGAGCCCA TCGGTGGCGC GGCGATCGAC ACGCATGGCG TTCCGTTGCC GGAGGCGACT
CTCCAGGCCT GCCGCGAGAG CCACGCAGTG CTCCTGGGTG CGGTCGGCGG GCCGAAGTGG
GATGCACTGG AGACAGCCAT CCGTCCGGAA AAAGGATTGC TGGCCCTGCG CAAGGGATTG
TCCCTGTATG CCAACCTCCG TCCCGCGGTT ATTTTTCCGG AACTCAAAGA GGCGTCTTAC
CTGCGTCCGG ATATTGTGGC CGACGGCGTG GACGTGCTGG TTGTGCGGGA ACTGACCGGG
GGGATTTATT TCGGCGAACC GCGCGGGCGC GAAGGCGAAC CGGGACAACG CCGGGCCATG
AACACCATGG TCTACGATGA GACCGAAGTA CGCCGTATCG GCCGGCTCGC TTTCGAAGCC
GCACAGCAAC GGGACAAACG GCTGTGTTCC GTGGACAAGG CCAATGTTCT TGAAGTCTCG
CAATTATGGC GGGAAGTCAT GAACGAACTG GCTCCCTCCT ATCCGGATGT CACCCTGGAG
CACATGTATG TCGACAATGC AGCCATGCAA CTGGTTCGGG ATCCGAAACA ATTCGATGTT
GTGGTCACTT CCAATCTTTT TGGGGATATC CTCTCCGATG AAGCCGCGAC AATCACCGGA
TCCATCGGCA TGTTGCCTTC GGCCTCCCTC GGCGACGAGA AGCCGGCTCT GTTTGAACCG
ATCCATGGCT CAGCTCCGGA TATCGCCGGT CAGGACAAGG CCAATCCGCT GGCGACCATC
CTTTCCGTGG GCATGTTGCT CCGATTCGGC CTCGGCCTGG AACAGGAGGC CGACGCCGTG
GACGCGGCAG TAGCCGACGT CATTGCCCAA GGTCTGCGTA CCGGGGATAT CGCCGGTCCT
GGCGAAGCTG TGCTCGGATG CCGTGCCATG GGTGCGGCTG TGGTTGACCG TCTCCAGGCC
CGTAAGGACT GA
 
Protein sequence
MATFTICCLP GDGIGPEITT EAQNVLTAIG RQFGHTFTMT DEPIGGAAID THGVPLPEAT 
LQACRESHAV LLGAVGGPKW DALETAIRPE KGLLALRKGL SLYANLRPAV IFPELKEASY
LRPDIVADGV DVLVVRELTG GIYFGEPRGR EGEPGQRRAM NTMVYDETEV RRIGRLAFEA
AQQRDKRLCS VDKANVLEVS QLWREVMNEL APSYPDVTLE HMYVDNAAMQ LVRDPKQFDV
VVTSNLFGDI LSDEAATITG SIGMLPSASL GDEKPALFEP IHGSAPDIAG QDKANPLATI
LSVGMLLRFG LGLEQEADAV DAAVADVIAQ GLRTGDIAGP GEAVLGCRAM GAAVVDRLQA
RKD