Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0551 |
Symbol | |
ID | 4270306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 598468 |
End bp | 600021 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638125292 |
Product | 2-isopropylmalate synthase |
Protein accession | YP_741395 |
Protein GI | 114319712 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00973] 2-isopropylmalate synthase, bacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.235647 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.000444348 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGAGA AAGAAAGACT CATCATCTTC GACACCACGT TGCGCGACGG CGAGCAGAGC CCCGGCGCCT CCATGACCCG CGAGGAGAAG GTGCGCATCG GCCGTGCCCT GGAGCGACTG AAGGTGGATG TCATCGAGGC CGGTTTCCCC GCCGCCAGCG AAGGCGACTT CGAGTCGGTC CGCGCCGTCG CCCGTGCGGT ACGGGGCAGC CGTATCTGCG GCCTGGCGCG CGCCCGCGAG GACGATATCC GGCGGGCCGG TGAGGCGCTG CAGGAGGCCG AGGCCGGGCG TATCCACACC TTCCTGGCCA CCTCGCCCAT CCACATGGAG AAGAAGCTGC GGATGACCCC CGACGAGGTG GTGGATGCCG CCGTCCGGGC GGTGAAGCTG GCCCGCTCCC TTTGCGACGA TGTGGAGTTC TCGCCGGAGG ACGCCGGGCG TTCCGATCCG GAGTTCCTCT GCCGGGTCAT TGAGGCGGTG ATCGACGCTG GCGCCGGCAC CGTGAACATC CCCGATACGG TGGGCTACAA CCTGCCCGAG CAATTCGGCG GGTTGATCGG GCGGCTGCGC GAGCGGGTGC CCAACTCCGA CAAGGCGGTC TTCTCGGTGC ACTGCCACAA CGATCTGGGG GTGGCGGTGG CCAACTCCCT GGCCGCGGTG ATGAACGGGG CCCGACAGGT GGAGTGCACC ATCAACGGCC TGGGCGAGCG GGCCGGCAAC GCCGCACTGG AGGAGGTGGT CATGGCGGTG CGCACCCGGC AGGACTTCTT CCCCTGCGAC ACCGGGATCG ATGCCCACGA GATCGTCCCT GCCAGCCGCC TGGTGGCCAA TATCACCGGT TTCCAGCCCC AGCCGAACAA GGCCATCGTG GGCGCCAACG CCTTCGCCCA CGAGTCGGGC ATCCACCAGG ACGGGGTGCT CAAGCACCGC GAGACCTACG AGATCATGCG CGCCGAGGAC GTGGGCTGGC ACACCAACCG CATGGTGCTG GGCAAGCACT CGGGCCGCAA CGCCTTCCGC GCCCGGCTCA AGGATCTGGG CATCGAGTTT GAGTCGGAGG AGCAGCTTAA CGAGGCCTTT CAGCGCTTCA AGGGGCTGGC GGACAAGAAG CACGAGATCT TCGACGAGGA CCTGCAGGCG CTGGTCACCG AGGCCAACCT GGAGCTGGAG AACGAGCGTT ACCGGCTGCT CGCGCTGCGG GTCTGCTCCG AGACCGGGGA GACCCCGGAG GCGATTGTCA CCCTGGCGGT GGACGGCCAC GAGCGCAGGG CGGTCTGTCC GGGCAGTGGT CCGGTAGATG CCGCCTTCAA GGCCATTGAG GAGCTGGTCG GGGCGGCGGA TACCGAGCTG TTGCTCTACT CGGTGAGCAA TATCACCACC GGGACCGACT CCCAGGGCGA GGTGACCGTG CGCCTGGAAC GGGGTGGGCG GATCGTCAAC GGGCAAGGCT CGGACACCGA CATCGTCATC GCCTCGGCCA AGGCCTACCT GAACGCCCTG AACAAGATCG ACCAGGGTGA GCTGCGGCGC CACCCGCAGG CGGCGGACGT TTAG
|
Protein sequence | MSEKERLIIF DTTLRDGEQS PGASMTREEK VRIGRALERL KVDVIEAGFP AASEGDFESV RAVARAVRGS RICGLARARE DDIRRAGEAL QEAEAGRIHT FLATSPIHME KKLRMTPDEV VDAAVRAVKL ARSLCDDVEF SPEDAGRSDP EFLCRVIEAV IDAGAGTVNI PDTVGYNLPE QFGGLIGRLR ERVPNSDKAV FSVHCHNDLG VAVANSLAAV MNGARQVECT INGLGERAGN AALEEVVMAV RTRQDFFPCD TGIDAHEIVP ASRLVANITG FQPQPNKAIV GANAFAHESG IHQDGVLKHR ETYEIMRAED VGWHTNRMVL GKHSGRNAFR ARLKDLGIEF ESEEQLNEAF QRFKGLADKK HEIFDEDLQA LVTEANLELE NERYRLLALR VCSETGETPE AIVTLAVDGH ERRAVCPGSG PVDAAFKAIE ELVGAADTEL LLYSVSNITT GTDSQGEVTV RLERGGRIVN GQGSDTDIVI ASAKAYLNAL NKIDQGELRR HPQAADV
|
| |