Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2682 |
Symbol | |
ID | 4269557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 3037765 |
End bp | 3039360 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638127441 |
Product | isocitrate lyase |
Protein accession | YP_743512 |
Protein GI | 114321829 |
COG category | [C] Energy production and conversion |
COG ID | [COG2224] Isocitrate lyase |
TIGRFAM ID | [TIGR01346] isocitrate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00000000577171 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCACAGT ACAGGAACGA CATCGAGGAA GTTGCCGGCC TGCGCCAGGC GCATGAAGGC ACCTGGGACG CGATCAACCC CGAATACGTG GCCCGCATGC GCGCCCAGAA CCGGTTCAAG ACCGGTCTGG ACATCGCCAA GTACACCGCC AAGATCATGC GCCAGGACAT GGCCGAGTAC GATGCCGATC CGGCCCAGTA CACCCAGTCC TTGGGGTGCT GGCACGGCTT CGTCGGCCAG CAGAAGATGA TCGCCATCAA GCGTCACTTC GGCACCACCA ACAAGCGCTA CCTCTACCTC TCCGGCTGGA TGGTTGCGGC GCTGCGCTCC GAGTTCGGCC CGCTGCCGGA CCAGTCCATG CACGAGAAGA CCGCCGTGCC GGCGCTGATC GAGGAACTCT ACACCTTCCT CCGCCAGGCC GACGCCCGTG AGATGAACCA CCTCTTCCGT GAGCTGGATG CGGCCCGCGA GGCCGGCGAT GAGGCCAAGG CCAACGAGAT TCTCGACAAG ATCGAGAACT TCGAGACCCA CATCGTGCCG ATCATCGCCG ACATCGACGC CGGCTTCGGT AACGAAGAGG CCACCTACCT GCTGGCCAAG AAGATGATCG AGGCGGGTGC CTGCTGCATC CAGATCGAGA ACCAGGTCGC CGACGAGAAG CAGTGCGGCC ACCAGGACGG CAAGGTCACC GTGCCCCACG CCGACTTCCT GGCCAAGGTC CGCGCCGTGC GCTATGCGTT CCTGGAGCTG GGCGTGAACG ACGGCGTGAT CGTTGCCCGC ACCGACTCCC TCGGCGCCGG CCTGACCAAG CAGATCGCCG TCACCGATGA GCCGGGCGAC CTGGGCGACA AGTACAACAG CTTCCTGGAC GGCGATTACA TCGACAGCGC CGAGGACATC AACAATGGCG ATGTGGTGGT CAAGTCCGAA GGCAAGCTGC TCAAGCCCAA GCGCCTCGCC AGCGGCCTCT ACGAGTTCCG CAAGGGCACC GGCTTTGACC GCGTGGTGCT GGACTGCATC ACCAGCCTGA AGAACGGCGC CGACCTGCTG TGGATCGAGA CCGAGAAGCC CCACGTCGGC CAGATCGCCG AGATGGTCAA CGCCATCCGC GAGGAAGTGC CGAACGCCAA GCTGGTCTAC AACAACAGCC CGTCCTTCAA CTGGACGCTG AACTTCCGCC AGCAGGTGTT CGATGCCTGG AAGGAAGAGG GCAAGGACGT CTCCCAGTAC GACCGCGACA ACCTGATGAG CGCCGAGTAC GACGACAGCG AGCTGGCGGC GGAAGCCGAC CGCTGGGTGC AGGAGTTCCA GCGCAAGGCC TCCGCAGAGG CGGGTATCTT CCACCACCTG ATCACCCTGC CGACCTACCA CACGGCCGCC CTGTCCACCG ACAACCTGGC CAAGGGCTAC TTCGGTGACC TGGGCATGCT GGCCTACGTG GATGGCGTGC AGCGTCAGGA GATCCGCCAG GGCGTGCCCA CCGTGAAGCA CCAGGACATG GCCGGCTCCA ACATCGGGGA CGACCACAAG GCCTACTTCT CCGGTGAGGC CGCGCTCAAG GCCGGCGGGA AGGGCAACAC CATGAACCAG TTCTAA
|
Protein sequence | MSQYRNDIEE VAGLRQAHEG TWDAINPEYV ARMRAQNRFK TGLDIAKYTA KIMRQDMAEY DADPAQYTQS LGCWHGFVGQ QKMIAIKRHF GTTNKRYLYL SGWMVAALRS EFGPLPDQSM HEKTAVPALI EELYTFLRQA DAREMNHLFR ELDAAREAGD EAKANEILDK IENFETHIVP IIADIDAGFG NEEATYLLAK KMIEAGACCI QIENQVADEK QCGHQDGKVT VPHADFLAKV RAVRYAFLEL GVNDGVIVAR TDSLGAGLTK QIAVTDEPGD LGDKYNSFLD GDYIDSAEDI NNGDVVVKSE GKLLKPKRLA SGLYEFRKGT GFDRVVLDCI TSLKNGADLL WIETEKPHVG QIAEMVNAIR EEVPNAKLVY NNSPSFNWTL NFRQQVFDAW KEEGKDVSQY DRDNLMSAEY DDSELAAEAD RWVQEFQRKA SAEAGIFHHL ITLPTYHTAA LSTDNLAKGY FGDLGMLAYV DGVQRQEIRQ GVPTVKHQDM AGSNIGDDHK AYFSGEAALK AGGKGNTMNQ F
|
| |