Gene Mlg_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0551 
Symbol 
ID4270306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp598468 
End bp600021 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content68% 
IMG OID638125292 
Product2-isopropylmalate synthase 
Protein accessionYP_741395 
Protein GI114319712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.235647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.000444348 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGAGA AAGAAAGACT CATCATCTTC GACACCACGT TGCGCGACGG CGAGCAGAGC 
CCCGGCGCCT CCATGACCCG CGAGGAGAAG GTGCGCATCG GCCGTGCCCT GGAGCGACTG
AAGGTGGATG TCATCGAGGC CGGTTTCCCC GCCGCCAGCG AAGGCGACTT CGAGTCGGTC
CGCGCCGTCG CCCGTGCGGT ACGGGGCAGC CGTATCTGCG GCCTGGCGCG CGCCCGCGAG
GACGATATCC GGCGGGCCGG TGAGGCGCTG CAGGAGGCCG AGGCCGGGCG TATCCACACC
TTCCTGGCCA CCTCGCCCAT CCACATGGAG AAGAAGCTGC GGATGACCCC CGACGAGGTG
GTGGATGCCG CCGTCCGGGC GGTGAAGCTG GCCCGCTCCC TTTGCGACGA TGTGGAGTTC
TCGCCGGAGG ACGCCGGGCG TTCCGATCCG GAGTTCCTCT GCCGGGTCAT TGAGGCGGTG
ATCGACGCTG GCGCCGGCAC CGTGAACATC CCCGATACGG TGGGCTACAA CCTGCCCGAG
CAATTCGGCG GGTTGATCGG GCGGCTGCGC GAGCGGGTGC CCAACTCCGA CAAGGCGGTC
TTCTCGGTGC ACTGCCACAA CGATCTGGGG GTGGCGGTGG CCAACTCCCT GGCCGCGGTG
ATGAACGGGG CCCGACAGGT GGAGTGCACC ATCAACGGCC TGGGCGAGCG GGCCGGCAAC
GCCGCACTGG AGGAGGTGGT CATGGCGGTG CGCACCCGGC AGGACTTCTT CCCCTGCGAC
ACCGGGATCG ATGCCCACGA GATCGTCCCT GCCAGCCGCC TGGTGGCCAA TATCACCGGT
TTCCAGCCCC AGCCGAACAA GGCCATCGTG GGCGCCAACG CCTTCGCCCA CGAGTCGGGC
ATCCACCAGG ACGGGGTGCT CAAGCACCGC GAGACCTACG AGATCATGCG CGCCGAGGAC
GTGGGCTGGC ACACCAACCG CATGGTGCTG GGCAAGCACT CGGGCCGCAA CGCCTTCCGC
GCCCGGCTCA AGGATCTGGG CATCGAGTTT GAGTCGGAGG AGCAGCTTAA CGAGGCCTTT
CAGCGCTTCA AGGGGCTGGC GGACAAGAAG CACGAGATCT TCGACGAGGA CCTGCAGGCG
CTGGTCACCG AGGCCAACCT GGAGCTGGAG AACGAGCGTT ACCGGCTGCT CGCGCTGCGG
GTCTGCTCCG AGACCGGGGA GACCCCGGAG GCGATTGTCA CCCTGGCGGT GGACGGCCAC
GAGCGCAGGG CGGTCTGTCC GGGCAGTGGT CCGGTAGATG CCGCCTTCAA GGCCATTGAG
GAGCTGGTCG GGGCGGCGGA TACCGAGCTG TTGCTCTACT CGGTGAGCAA TATCACCACC
GGGACCGACT CCCAGGGCGA GGTGACCGTG CGCCTGGAAC GGGGTGGGCG GATCGTCAAC
GGGCAAGGCT CGGACACCGA CATCGTCATC GCCTCGGCCA AGGCCTACCT GAACGCCCTG
AACAAGATCG ACCAGGGTGA GCTGCGGCGC CACCCGCAGG CGGCGGACGT TTAG
 
Protein sequence
MSEKERLIIF DTTLRDGEQS PGASMTREEK VRIGRALERL KVDVIEAGFP AASEGDFESV 
RAVARAVRGS RICGLARARE DDIRRAGEAL QEAEAGRIHT FLATSPIHME KKLRMTPDEV
VDAAVRAVKL ARSLCDDVEF SPEDAGRSDP EFLCRVIEAV IDAGAGTVNI PDTVGYNLPE
QFGGLIGRLR ERVPNSDKAV FSVHCHNDLG VAVANSLAAV MNGARQVECT INGLGERAGN
AALEEVVMAV RTRQDFFPCD TGIDAHEIVP ASRLVANITG FQPQPNKAIV GANAFAHESG
IHQDGVLKHR ETYEIMRAED VGWHTNRMVL GKHSGRNAFR ARLKDLGIEF ESEEQLNEAF
QRFKGLADKK HEIFDEDLQA LVTEANLELE NERYRLLALR VCSETGETPE AIVTLAVDGH
ERRAVCPGSG PVDAAFKAIE ELVGAADTEL LLYSVSNITT GTDSQGEVTV RLERGGRIVN
GQGSDTDIVI ASAKAYLNAL NKIDQGELRR HPQAADV