Gene Daro_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3775 
Symbol 
ID3567508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4059604 
End bp4060680 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content60% 
IMG OID637682250 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_286974 
Protein GI71909387 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.00289706 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00333636 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATCT GTGTAATTCC CGGGGACGGC ATCGGTGTCG AAATTTGTGC CGAGGCCGTC 
AGGGTGATCG ACGCCCTGAA GGATATCCAT GGCCTGAAAA TCGAATATGA GTACGGGCTG
TTGGGTGGCG CCGCCTACGA TCAGACGGGG CGCCCCTTGC CGGTTGAAAC CTTGAGGCTG
GCTGACGAAG CCAACGCCAT CCTGCTCGGT GCCGTCGGCG GCCCGAAGTG GGACAAGCTG
CCGGCCGAGT CGCGCCCCGA ACGCGGCTTG CTGGGGATCA GGAAATATCT CGGGCTCAAT
GCCAATCTGC GTCCGATCAA GGTCTATCCG GAACTGGCCA ATGCCTCGAC GCTGCGCCCG
GAGGTGGTCA GCGGACTCGA CATGATGATC GTCCGGGAAC TGACGGGCGA TATCTATTTT
GGCCAGCCGC GCGGGATACG GACCTCCGGG TTCGAACGCG TCGGTTACAA CACCATGGAG
TACTCCGAAT CCGAGATCGC ACTGATCGCC GAAATGGCGT TCAGGATTGC CCGCCAACGG
AGTGGCAAGG TGATGTCCGT CGACAAGATG AATGTGCTGG AGTGCATGCA GCTCTGGCGC
GACGTGGTGA CCAAGGTCGG CGAACGTTTC CCGGATGTCA CGCTCGATCA CATGCTGGTC
GACAACGCGG CCATGCAACT GGTCAAGAAC CCGAAGCAGT TTGATGTCCT GCTCACCGGC
AACATGTTTG GTGACATTCT CTCGGACGAA GCCGCGATGC TGACCGGTTC GATCGGCATG
CTGCCTTCGG CCTCGCTCAA TGTCGAGGAC AAAGGGATGT ATGAGCCTTG CCATGGTTCA
GCGCCGGACA TTGCAGGGCA GGGCGTTGCC AATCCCTTGG GCATGATTCT GTCGGCAGCG
ATGATGTTCC GCTATAGCTT GGGCCGGCCG GACATGGCCG ATGCGATCGA ATCAGCGGTC
CAGACGGTAT TGACGAATGG CGCGCGGACC AGGGATATTT TCCAGGCCGG TGACCGTTTG
GTTTCCACTT CCGAAATGGG TGGGCTGGTC GAGGCGGCGC TAAGGCGAAT CAGCTAG
 
Protein sequence
MKICVIPGDG IGVEICAEAV RVIDALKDIH GLKIEYEYGL LGGAAYDQTG RPLPVETLRL 
ADEANAILLG AVGGPKWDKL PAESRPERGL LGIRKYLGLN ANLRPIKVYP ELANASTLRP
EVVSGLDMMI VRELTGDIYF GQPRGIRTSG FERVGYNTME YSESEIALIA EMAFRIARQR
SGKVMSVDKM NVLECMQLWR DVVTKVGERF PDVTLDHMLV DNAAMQLVKN PKQFDVLLTG
NMFGDILSDE AAMLTGSIGM LPSASLNVED KGMYEPCHGS APDIAGQGVA NPLGMILSAA
MMFRYSLGRP DMADAIESAV QTVLTNGART RDIFQAGDRL VSTSEMGGLV EAALRRIS