Gene Mlg_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0489 
Symbol 
ID4268357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp534659 
End bp535825 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID638125229 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_741333 
Protein GI114319650 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.410888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.730231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAG GCAGGACCTG GCGGGTGGCA GTGTGTCCCG GCGATGGCAT CGGGCCGGAG 
GTGATGGCGC CCACGGTCGC CGCGCTGCGG GCGGTGGCCG GACGCGAGGG GCTGGCGCTG
GAGCTCCAGC ACTACGACTG GCCGTCCCAC GACTGGCACC GTCGGCACGG CGAGATGATG
CCCGGGGACT GGCGCGAGCA GTTGGCCGCT CACGACGCCA TCCTGCTCGG CGCCCTGGGC
GACCCCGGGC CGACCAATGA TCCTGATCGC TACCTGCTCT CAGATGGTGT GTCGCTGGCG
CCGCTGCTGC AATTGCGCAA GGGGTTCGAC CTGTGGGCCT GCGAGCGGCC GGCGGTCCGG
CTGCCCGGTA CGCCTCAGTA CCTGGCCGAC CCCCGCGCCG AGGAACTGGA CATGCTGGTG
ATCCGCGAGA ACAGCGAGGG CGAGTATGTG GCCCAGGGCG GGCGCCTGGC GCCGGGCACG
GCGCGCGAGG TGGCCACTCA GGTGGAGGTG TTCACCCGCC TGGCCACCGA GCGGATCATC
CGCCACGCCT TCGAGCGCGC CCTACAGCGG GCCCACCTGC GCCAGACCGG CGAGCGCCCA
CCGCGCCCTT TTCCGCGGAC CGGCGGTGGC GAGGCCAACG CCCAGGTCTG CCTGATCACC
AAGCGAAACG CCCAGGCCTA CTGGGGCGAG ATGTGGACGG AGATCTTCGC CGAGGTGGCG
CCCGACTACC CCGAGATCGC CACCCACCAT GAACTGGTGG ACGCCGCCTG CATGAAGTTC
GTGACCCGTC CCTGGGTGTT CGACGTGGTG GTGGCCAGCA ATCTCCATGG CGACATCCTC
ACTGACCTGG CCGCGGTGCT CTGCGGCGGT ATGGGGGTTG CCCCCTCCTG TAACATCAAC
CCGCAGGATC GCCGTGTGCC ACCCCTGTTC GAGCCCACCC ACGGCAGCGC CCCGGACATC
GCCGGGCAAG GACTGGCCGG GCCTGAGGCC ATGCTGCTGA CCGCAGCGAT GATGCTGGAC
TGGATGGGCG AGGAGGACCC GGCCGCGGCC CGCGCCGGTG AACGCCTGCG CCTGGCGGTA
GCCGCCGACC TGCAGACCGG TAGCGGCGAG GCGCGGGGCA CCGAGGCAGT GGGGGCGGCC
ATCCTGGACC GTCTGGATCA GCAGTGA
 
Protein sequence
MAKGRTWRVA VCPGDGIGPE VMAPTVAALR AVAGREGLAL ELQHYDWPSH DWHRRHGEMM 
PGDWREQLAA HDAILLGALG DPGPTNDPDR YLLSDGVSLA PLLQLRKGFD LWACERPAVR
LPGTPQYLAD PRAEELDMLV IRENSEGEYV AQGGRLAPGT AREVATQVEV FTRLATERII
RHAFERALQR AHLRQTGERP PRPFPRTGGG EANAQVCLIT KRNAQAYWGE MWTEIFAEVA
PDYPEIATHH ELVDAACMKF VTRPWVFDVV VASNLHGDIL TDLAAVLCGG MGVAPSCNIN
PQDRRVPPLF EPTHGSAPDI AGQGLAGPEA MLLTAAMMLD WMGEEDPAAA RAGERLRLAV
AADLQTGSGE ARGTEAVGAA ILDRLDQQ