Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2606 |
Symbol | |
ID | 7271874 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2729608 |
End bp | 2730879 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643571202 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_002467599 |
Protein GI | 219853167 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.749464 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATCAGA CACTCCAGCA GCACGGGCCG GTGGACCTGG CCTTCACCGC TCCGCCCTCC AAGTCCTTCA CCCACCGGGC TCTGATCATC GCAGCCCTGG CTGACGGGGA GTCACTGATC CGCGGCCCCC TGATCGCGGA GGACACCCTG TTGACAGTCC GGGCACTCCA GGCACTCGGT GCCGATATCA CCGACACACC GGAAGGCTAC CGGGTGCAGG GGACCGACGG CCGGCCCGAC TGCGCAGAGG GGACCGTCCT GGACCTAAAG AACTCGGGGA CGAGCCTCCG GCTGCTCAGT TCGATCGCCC TCCTCTGCTC CAGCACCGCC GGGGTCACGC TGACCGGCTC CCCCCGGATG CAGCAGCGGC CGATCGGAGA ACTCGGCGAC GCGATCCGAA CCCTCGGCGG CTCGGTCCGA TACCTGGCCG CAGACGGTTA CCCGCCCTGT GTCGTGCAGG GGCCGCTCGT CGGCGGCGAA GCGACCCTCG ACGGCTCGGT CAGCAGCCAG TTCATCTCGT CGCTGCTGCT GGCTGCCCCA TATGCTGTCC GTCCGGTGGA CCTGAAGGTC GCCCGGCAGC CGGTCTCCCG ATCGTACCTG GAGATCACCG GTGCGGTGAT GGCCGCGTTC GGGGTTCCGG TCAGGCGGGT GGGGTACACC CACTTTACCG TCCAGCCGGC CCGGTACCGC GGACGGGAGT ACACGGTGGA GGGAGACTAC TCGTCCGCCT CGTACTTCTT CGCCCTGGCC GCCACCCTCG GCGGGAAGGT GACGGTCAGA AATCTGAATC ATGACTCGGT GCAGGGCGAC CGGCTCTTCG TTGCAGCGCT CAAAGCGATG GGCTGCCGGG TCACCAGGGA GACCGACGGC GTCACCATCG AGCGGACCAA AAACCTCCAC GGGATCTCCA TCGATATGAC CACAGCCCCT GACACGGTCC AGACCCTCGC GGTGGTGGCC GCCCTGGCCG ACTCGCCGAC AACCATCACC GGGGTCGGCC ATCTCCAGTA CAAGGAGAGC GACCGGGTGG CCGTGACCGC TGGGACCCTC AGAGCCCTCG GCTGCACCGT CGACATCAGC GCCGACGCGA TCACCATTCA TCCCGGACCC CTTCATGGCG GGGTGATCGA CCCGCACGAC GACCACCGGA CAGCGATGGC CTTCGCCGTC CTCGGGCTGG CCGTTGGTGA TGTCACGATC GAAGACCCCG CCTGTGTCGG CAAGTCGTTC CCGAAATTCT GGAACGCACT CGCCGCAGGA GGATTATTAT GA
|
Protein sequence | MDQTLQQHGP VDLAFTAPPS KSFTHRALII AALADGESLI RGPLIAEDTL LTVRALQALG ADITDTPEGY RVQGTDGRPD CAEGTVLDLK NSGTSLRLLS SIALLCSSTA GVTLTGSPRM QQRPIGELGD AIRTLGGSVR YLAADGYPPC VVQGPLVGGE ATLDGSVSSQ FISSLLLAAP YAVRPVDLKV ARQPVSRSYL EITGAVMAAF GVPVRRVGYT HFTVQPARYR GREYTVEGDY SSASYFFALA ATLGGKVTVR NLNHDSVQGD RLFVAALKAM GCRVTRETDG VTIERTKNLH GISIDMTTAP DTVQTLAVVA ALADSPTTIT GVGHLQYKES DRVAVTAGTL RALGCTVDIS ADAITIHPGP LHGGVIDPHD DHRTAMAFAV LGLAVGDVTI EDPACVGKSF PKFWNALAAG GLL
|
| |