Gene Emin_0404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0404 
Symbol 
ID6262470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp431164 
End bp432168 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content43% 
IMG OID642610871 
Productisocitrate dehydrogenase (NAD(+)) 
Protein accessionYP_001875298 
Protein GI187250816 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00175] isocitrate dehydrogenase, NAD-dependent, mitochondrial type
[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type
[TIGR02088] isopropylmalate/isohomocitrate dehydrogenases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.638011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC AAATTGTACT TTTACCGGGC GACGGCATCG GCCCTGAGAT AACAGAATCC 
GCCTTGCAAA TTATAAAATC GGCGGGCATT GATTTAGATT ATGTTGTTAT GCAGGCTGGA
GCGGAAAGCG CAGCCAAAAC TGGCGAAACG CTGCCCTCGG AAGTTGTTGA AAATATAAAA
AAATATAAAG TGGCTTTAAA AGGCCCGATA ACGACTCCTA TAGGCACCGG GTTTAGAAGC
GTAAACGTAG CTTTAAGAAA GGAACTTAAT TTATACGGCG CGGTCAGGCC TTCAAAAAAT
CTTGAAGGCA TAAGGACAAA GTTTGATAAT GTTGATTTAG TTATAGTGCG TGAAAATACG
GAAGACCTTT ACGCCGGAAT TGAGCGTATG GTTGATGATG ACACGGCCGA ATCTATTAAA
CGTATTACAA GAAGCGCCAG CATGAGAATA GCGGAATTCG CTTTTGATTA TGCCGTAAAA
AATAACAGAA AAAAAGTTAC AGTTGTCACA AAAGCTAATA TTTGCAAATT TTCGGACGGG
CTTTTTTTAG AATGCGCCAG GCAGACAGCG CAAAAATACC CGCAGATAGA ATTTAAAGAA
ATATTAATTG ACAACCTTTG CATGCAGCTT GTTGTGCGCC CGCATGAGTT TGACGTGCTT
TTATGCCCCA ATCTATACGG CGACATAGTT TCTGATTTAG CCGCCGGTTT AACGGGGGGG
CTTGGCATAG CGCCGGGCGC TAATTACGGG GAGGACGGCG CCGCTTTGTT TGAGCCTGTG
CACGGCAGTT CACCCGATAT CGCGGGTAAA GGAATAGCCA ACCCGACAGC CTTAATAAGA
AGCGCTGTTT TAATGCTTAA CCATTTAGAT TACGCAAAAG AGGCGGCAAA AATAGACTCT
GCCGTTAACA CCGTTATTAA AGAGGGTAAA TTTACAACGC CCGATATCGG CGGCAACGCG
ACAACAAAGG AGTTTACTTC GGCCGTTATT AATAAATTAA AATAA
 
Protein sequence
MKKQIVLLPG DGIGPEITES ALQIIKSAGI DLDYVVMQAG AESAAKTGET LPSEVVENIK 
KYKVALKGPI TTPIGTGFRS VNVALRKELN LYGAVRPSKN LEGIRTKFDN VDLVIVRENT
EDLYAGIERM VDDDTAESIK RITRSASMRI AEFAFDYAVK NNRKKVTVVT KANICKFSDG
LFLECARQTA QKYPQIEFKE ILIDNLCMQL VVRPHEFDVL LCPNLYGDIV SDLAAGLTGG
LGIAPGANYG EDGAALFEPV HGSSPDIAGK GIANPTALIR SAVLMLNHLD YAKEAAKIDS
AVNTVIKEGK FTTPDIGGNA TTKEFTSAVI NKLK