Gene Emin_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0402 
Symbol 
ID6262542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp429388 
End bp430644 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content47% 
IMG OID642610869 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001875296 
Protein GI187250814 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.344506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCAGA CAATTGCAGA AAAAATTATT TCAAACCATT CCGGACGTCG CGTAAAAGCG 
GGGGAATTTG TTATAGCAGA CGTTGATTTA ACGGCCGTGC AGGACGGTAC CGGCCCTTTA
ACGGTTGAAG AGCTTAAAAA AGCCGGTTTT ACCAAACTGG CAAATCCCGC AAGAACAATA
TTATTTATTG ACCATGCGGC CCCAAGCCCC AGAAAAGAGC TTTCAAACTC ACAAGTTGTT
TTAAGAAATT TCGCTAAAGA AACGGGCGCG ATACTTTCCG AAATTGGCGA AGGAGTTTGC
CATCAGCTTT TGGCGGAAAA ATACGTAAAC CCCGGCGAAA TTCTAATCGG CGCTGATTCC
CACACCTGTA CTGGCGGGGC GCTTGGCGCG TTTGCCACGG GTATGGGTTC AACGGACGTG
GCCGTCGGCA TGGCTTTAGG TAAAACATGG CTTAAAGCGC CGCAGACTTT TAAAATAGAG
GTTGAAGGCG CGTTTAAAAA AGGTGTAGGC GCTAAGGACC TTATTTTGCA TTTAATAGGC
GTTATCGGCG CGGACGGCGC TACATATAAA GCGCTTGAGT TTCACGGTTC AACAATCAGA
AATATGGAAA TGGCAGACCG CTTTACCTTA GCCAATATGG CTGTGGAAGC GGGCGCGAAA
GCGGGCCTTT TCTTTACTGA TGAAAAAACA AGGGCTTACC TTGCCGAACG CGGCAGGGGG
GATAATTTTA AACTTATTTC CGCCGATGAA GGCGCTGATT ACGAAAAGGT TATTAAAATA
GACGCTTCCT CTTTAGAACC TACCGTTTCC TGCCCGCACA CGGTTGACAA TACAAAAACA
GTAGGCGAAC TTAAAGACAT TAAAGTTAAC CAGGTTTTTA TAGGCACCTG CACAAACGGA
CGTATAGAGG ATTTAAGAAT AGCAGCCGAG ATTTTGAAAG ATAAAAAAGT TAACCCCGGT
ACAAGAACTT TTATAACGCC CGCCTCGCGC GACGTTATGT TAGCCGCCTT AAAAGAAGGG
CTTATAGAAA TTTTTGTTAA GGCGGGCGCC AGCGTGCAAA CGCCTGGCTG CGGGCCTTGC
GTTGGCGTGC ACGGCGGCAT TTTGGGCGAT GGGGAAGTTT GTTTAGCCAC CCAAAACCGC
AATTTCCAGG GTCGCATGGG CAATACAAAA GGTTTTATTT ATCTTTCCTC GCCCGCAGTA
GCCGCTTACA GCGCTTTAAA AGGTTATATT TCCGACCCCA GGGAAATATT AAAATAA
 
Protein sequence
MPQTIAEKII SNHSGRRVKA GEFVIADVDL TAVQDGTGPL TVEELKKAGF TKLANPARTI 
LFIDHAAPSP RKELSNSQVV LRNFAKETGA ILSEIGEGVC HQLLAEKYVN PGEILIGADS
HTCTGGALGA FATGMGSTDV AVGMALGKTW LKAPQTFKIE VEGAFKKGVG AKDLILHLIG
VIGADGATYK ALEFHGSTIR NMEMADRFTL ANMAVEAGAK AGLFFTDEKT RAYLAERGRG
DNFKLISADE GADYEKVIKI DASSLEPTVS CPHTVDNTKT VGELKDIKVN QVFIGTCTNG
RIEDLRIAAE ILKDKKVNPG TRTFITPASR DVMLAALKEG LIEIFVKAGA SVQTPGCGPC
VGVHGGILGD GEVCLATQNR NFQGRMGNTK GFIYLSSPAV AAYSALKGYI SDPREILK