Gene Mlg_1230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1230 
Symbol 
ID4269761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1433777 
End bp1434862 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content67% 
IMG OID638125980 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_742069 
Protein GI114320386 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00169] 3-isopropylmalate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.121054 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA ACATCCTGAT CACCCCGGGC GACGGTATCG GTCCGGAGAT CGTGGCCGAG 
GCGCGCAAGC TGCTGGAGGC CCTGCGTGAC GACTTCGGCT TCGACTGCAC TTTGGAAGAG
GCCCCCATCG GCGGCGCTGG CTACGAGGCG CATGGCAAGC CGCTGCCGGA AGAGACCCTG
GCTCTCGCCC GGGAGGCCGA TGCCATCCTA TTGGGTGCCG TGGGTGGGCC GCGCTGGGAA
CAGCTGGATC GCCCCCTGCG TCCCGAACGC GGCCTCCTGG CCATCCGTGC GGAGTTGGGC
CTGTTCGGCA ACCTGCGTCC AGCCATCCTC TATCCGCAAC TGGCCGAGGC CTCCAGCCTG
CGCCATGAGA TCGTCGCCGG CCTGGACATC ATGATCGTCC GGGAGCTGAC CGGCGGCATC
TACTTCGGTG AGCCCCGGGG CATCCGCAGG CTGGAGAACG GCGAACGCCA GGGTTACAAC
ACCATGGTCT ACAGCGAGTC GGAGATCGAC CGCGTGGGCC GGCTGGCCTT TGACATCGCG
AGCAAGCGCG GCAGCCGAGT CTGCTCCGTG GACAAGGCCA ACGTGCTGGA GGTCTCCGAA
CTCTGGCGTG AGGTGATGGA ACGCGTGGCC CGGGATTACC CCGGTGTCGA GCTGAGCCAC
ATGTACGTGG ACAACGCCGC CATGCAGTTG GTGCGTGCGC CCAAACAGTT CGACGTGGTG
GTCACCAGCA ATCTGTTCGG TGACGTGCTC TCGGACTGTG CCGCCATGCT CACCGGCTCC
ATTGGCATGC TGCCCTCCGC CTCGCTGGAT GTGAACAGCA AGGGGCTGTA TGAGCCGGTG
CACGGTTCCG CGCCGGACAT CGCCGGCAAG GGGCTGGCCA ATCCGCTGGC CACCCTTCTG
TCAGTGGCCA TGATGCTGCG CTACAGTCTG GATCAGGGCG CCCTCGCCGA CCGGGTGCAG
CAGGCGGTGG GTGATGTGCT CAACCAGGGG CTGCGCACGC CGGATATCGC CGCCCGCCAG
TCGCGCACCG TCAGCACCGC CGAGATGGGT GACGCGGTGG TGGCCGCGCT GCGTGCCCGG
GGCTGA
 
Protein sequence
MTANILITPG DGIGPEIVAE ARKLLEALRD DFGFDCTLEE APIGGAGYEA HGKPLPEETL 
ALAREADAIL LGAVGGPRWE QLDRPLRPER GLLAIRAELG LFGNLRPAIL YPQLAEASSL
RHEIVAGLDI MIVRELTGGI YFGEPRGIRR LENGERQGYN TMVYSESEID RVGRLAFDIA
SKRGSRVCSV DKANVLEVSE LWREVMERVA RDYPGVELSH MYVDNAAMQL VRAPKQFDVV
VTSNLFGDVL SDCAAMLTGS IGMLPSASLD VNSKGLYEPV HGSAPDIAGK GLANPLATLL
SVAMMLRYSL DQGALADRVQ QAVGDVLNQG LRTPDIAARQ SRTVSTAEMG DAVVAALRAR
G