Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0926 |
Symbol | |
ID | 4268213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1051178 |
End bp | 1052281 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 638125678 |
Product | chorismate mutase / prephenate dehydratase |
Protein accession | YP_741770 |
Protein GI | 114320087 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACG AACAGACCTC CCCCGACGAT CAGGCCGCGC TGCAGGCCGT GCGTGCGCGC ATCGACGCCC TGGACGACGA GATCCTGCGG CTAATCAGCG AGCGCGCCCG GATGGCCGAA GAGGTGGCCC GGGTCAAGCG TGAGGCCGGG CACAGCAACG ATTTCTACCG CCCCGAGCGC GAGGCGCAGG TGCTGCGCCG GGTCCGCCAG TCCAACCCCG GCCCGCTGGG CGAGGAGGCG GTGACCCGGC TGTTCCGCGA GATCATGTCC GCCTGCCTGG CCATCCAGCT GCCCTTGCAG GTGGCCTTCC TGGGGCCCGA AGGCACCTAT ACCCAGGAGG CGGCACTCAA GCACTTCGGC CACGCCATGG GCACGGCACC GCTGAGCACT ATCGCCGCGG TCTTTCGTGA GGTGGAGTCC GGTGCCGCCC ACTACGGGGT GGTGCCGGTG GAGAACTCCA CCGAGGGGGT GGTCACCCAC ACGGTGGACC GCTTTCTCAA CTCGCCGCTG CAGATCGTCG GCGAGGTGCA GTTACCCATC CACCACGCCC TGGCCAGCCG CGAGCAGGAC TGGAACGCCA TCCGGCGTAT CTACTCCCAC CAGCAGGGAC TGGCCCAGTG CCGGGCCTGG GTCGATACCC ATCTGCCGGG CGTGGAACGG GTGCCGGTCA CCAGCACCGC CGAGGCGGCG CGGCTGGCGG CGGCCGAACG GGGTGCGGCG GCCATCGCCA GTGAGGCGGC CTGCGAGCTC TACGACTTGC CGGTGCTTGC CACCCACATC GAGGACGAGC CGGGCAACAC CACCCGCTTT CTGGTGGTGG GGCCGGAGTC TCCACCACCC AGCGGTGACG ACAAGACCTC CTTGGTGATC AGCCGGGCCA ACCAGCCGGG TGGCCTTTAC CGGCTGCTGG AACCATTAGC CAGGAATGGA GTGAACATGA CCCGGATCGA ATCCCGGCCC GCGCCGCAGG GCGTCTGGGA GTATGTGTTC TTCGTGGACC TGTTGGGTCA CGTGGAGGAC GAACCCGTCC GCCAGGCGTT GGCCGAGATC CGCGAACAGG CCAGCCTGTG CCGCGTCCTG GGCTCGTACC CGCGAGCGCT GTAA
|
Protein sequence | MSDEQTSPDD QAALQAVRAR IDALDDEILR LISERARMAE EVARVKREAG HSNDFYRPER EAQVLRRVRQ SNPGPLGEEA VTRLFREIMS ACLAIQLPLQ VAFLGPEGTY TQEAALKHFG HAMGTAPLST IAAVFREVES GAAHYGVVPV ENSTEGVVTH TVDRFLNSPL QIVGEVQLPI HHALASREQD WNAIRRIYSH QQGLAQCRAW VDTHLPGVER VPVTSTAEAA RLAAAERGAA AIASEAACEL YDLPVLATHI EDEPGNTTRF LVVGPESPPP SGDDKTSLVI SRANQPGGLY RLLEPLARNG VNMTRIESRP APQGVWEYVF FVDLLGHVED EPVRQALAEI REQASLCRVL GSYPRAL
|
| |