Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0929 |
Symbol | |
ID | 4268216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1054351 |
End bp | 1055691 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638125681 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_741773 |
Protein GI | 114320090 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.364376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAATTTC ATGTGCGACC CGGCGGGGCG CTGCGGGGGC GGCTCCGCGT CCCGGGCGAT AAATCCATCT CTCATCGCGC CATCATGCTC GGCGCCCTGG CCGAGGGCGA GACCCGCATC AGCGGCTTTC TGGAGGGTGC CGATGCCCTG GCCACGCTGC GCACCTTCCG TGCCATGGGG GTGGACATCG ACGGGCCCCA CCAGGGCCGG GTGACGGTGC AGGGGGTCGG TCTGCACGGG CTGCGGGCGC CGGACGGGCC GCTGGACCTG GGCAACTCCG GCACCTCCAT GCGCCTGCTC TGCGGGCTGC TGGCCGGGCA GTCCTTCGAT ACCACCCTCA CCGGCGATGC CTCCCTGTCC CGCCGGCCCA TGCGCCGGGT GATCGACCCG CTCACCGCCA TGGGGGCGGT GATCGAGAGC GGCCAGGGCG GCACGGCGCC GCTGACGGTG CGTGGCGGGC AACCCCTGCA CGGCATCGAC TACGAGCTGC CGGTGGCCAG CGCTCAGGTC AAATCCGCGC TGCTGCTGGC CGGCCTGTAC GCCCGGGGGC GTACCTGTGT CACCGAGCCG GCGCCCACCC GCGATCACAC CGAGCGCATG CTGGCCGGGT TCGGCTACCC GGTGCGGCAG GAGGGCCGCC GGGTCTGCAT CGAGGGCGGA GGGCGGCTGC GCGGCGGCGA GATCGATGTG CCGGCGGACA TCTCCTCGGC GGCCTTTTTT CTGGTCGGTG CGAGCATCGC CGAGGGCTCC GATATCACCC TGGAGCACGT GGGCATGAAC CCGACCCGCA CCGGGGTGGT CGACATCCTC CGCCTGATGG GGGCCGATAT CCAGGTGCAG AACGAGCGGG AAGTGGGGGG CGAGCCGGTG GCCGACCTGC GGGTGCGCAG TGCCCCCCTG AAGGGGGTTG CCATCCCCGA GGCGCTGGTA CCGCTCGCCA TCGATGAATT CCCGGTGCTC TTTGTCGCCG CCGCCTGCGC CGAGGGTGAG ACGCTGCTGA CCGGGGCCGA GGAGCTGCGG GTCAAGGAGA GCGACCGGAT CGCGGTGATG GCCGAGGGCT TGACCACCCT GGGGGTTACC GCCGAGCCGC AACCCGACGG TATGCGCATC GTCGGTCAGC CGGATTGGGG CGGGGGGCGG GTGCACAGCC ACGGGGATCA CCGTATCGCC ATGGCCTTCA CCATGGCCGC CACCCGGGCC CGTGAGCCGA TCGAGATCGA GGACTGCGCC AATGTGAACA CCTCGTTCCC CGGGTTCGTC GAGCTGGCCG GTGACGCAGG GGTGGCACTG ACCCGGGGCG ACGCAGAGGG ACGTGCCCAG CAAAGGAGCG ATTCACCATG A
|
Protein sequence | MEFHVRPGGA LRGRLRVPGD KSISHRAIML GALAEGETRI SGFLEGADAL ATLRTFRAMG VDIDGPHQGR VTVQGVGLHG LRAPDGPLDL GNSGTSMRLL CGLLAGQSFD TTLTGDASLS RRPMRRVIDP LTAMGAVIES GQGGTAPLTV RGGQPLHGID YELPVASAQV KSALLLAGLY ARGRTCVTEP APTRDHTERM LAGFGYPVRQ EGRRVCIEGG GRLRGGEIDV PADISSAAFF LVGASIAEGS DITLEHVGMN PTRTGVVDIL RLMGADIQVQ NEREVGGEPV ADLRVRSAPL KGVAIPEALV PLAIDEFPVL FVAAACAEGE TLLTGAEELR VKESDRIAVM AEGLTTLGVT AEPQPDGMRI VGQPDWGGGR VHSHGDHRIA MAFTMAATRA REPIEIEDCA NVNTSFPGFV ELAGDAGVAL TRGDAEGRAQ QRSDSP
|
| |