Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1029 |
Symbol | |
ID | 4462778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1112343 |
End bp | 1113608 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639700048 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_843454 |
Protein GI | 116754336 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.650861 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGTAT CCGTCTCACG CTCCTCCATC TCGGGAACCG TCTGCGCACC GCCATCGAAG AGCTACACGC ACAGAGCTGT GCTCATCACA GCGCTATCAG ATTCAGGATG CGTGCACAGG CCGCTCATAT CTGCAGACAC AAGAGCCACG ATATCCGCGT GTGATGCCTT CGGGGCCGAT GTGAAGCTGC GGGGGGACTC TCTGGAGATC CAGGGAGTCT CAGGCGCGCC CAGAACCCCT GAGAACGTCA TAGATGTTCT GAACTCCGGG ACGACGCTAA GGTTCATGTC AGCAGTGGCG GCGCTCACAG ATGGCGCGGT CCTCACAGGC GACAGCTCGA TACGCAGCAG GCCGAACGGC CCACTCCTCA AGGCGCTCAA TGAGCTGGGC GCCGAGGCGT TCTCCATACG GGGAAACGAT CGCGCTCCGC TTGTCATAAG AGGACGGCTC AGAGGCGGAT CGACATCTCT GGATGGCAGC GTGAGCTCGC AGTTCCTTTC AGCCCTTCTG ATTGCATGCC CCCTCTCCAG CGGCGAGACG ACGATCTCGA TAAAGGGCGA GCTGAAGTCC AGGCCGTACG CGGAGATGAC CCTGGACATC CTCAGGAAGG CCGGAGCCGA GATATGCACT GATGGAGACA TCTTCCGCAT GCGTGGAGGC CAGAGTTACA GGCTTGCGGA GTACACAGTG CCGGGTGATT TCTCGTCTGC ATCTTATCCG CTGGCTGCAG CCGCGCTCGC GGGATCTGCG ACTGTGGAGG GTCTGTTCCC GTCGAGGCAG GGCGACTCTG CGATAGTCGA TATTCTCAGA GAGATGGGCG CTGAGGTCTC GTGGGACATG GAGAGTGGAG AGGTGAGGGT GTCTGGAGCT GATCTGAGGG GCAGAGAGAT AGATGCGAGC CAGACTCCAG ATCTCGTGCC AACGCTCGCG GTGCTTGGAG CAGTGGCTGA GGGGCGTACG GTCATAAAAA ACGCGGAGCA TGTCAGACAC AAGGAGACCG ACAGGATTCA TGCGATGGCG GTCGAGCTGA AGAAGATGGG CGCGAACATA CGGGAAAGGC CTGATGGCCT GGAGATCGAT GGGGGCGATC TGCACGGTGC GGATCTCCAC GGCTACCACG ACCACAGGAT CGTGATGGCA CTGACCCTAG CAGGGATCGT GGCAGGCGAT ACCAGGATTG ATACAGCGGA ATCGGTCGAT GTATCATATC CGGGATTCTT CGAGGATATG AGAAGGCTTG GAGCGAACGT GCGCGCCTCC ACATAA
|
Protein sequence | MIVSVSRSSI SGTVCAPPSK SYTHRAVLIT ALSDSGCVHR PLISADTRAT ISACDAFGAD VKLRGDSLEI QGVSGAPRTP ENVIDVLNSG TTLRFMSAVA ALTDGAVLTG DSSIRSRPNG PLLKALNELG AEAFSIRGND RAPLVIRGRL RGGSTSLDGS VSSQFLSALL IACPLSSGET TISIKGELKS RPYAEMTLDI LRKAGAEICT DGDIFRMRGG QSYRLAEYTV PGDFSSASYP LAAAALAGSA TVEGLFPSRQ GDSAIVDILR EMGAEVSWDM ESGEVRVSGA DLRGREIDAS QTPDLVPTLA VLGAVAEGRT VIKNAEHVRH KETDRIHAMA VELKKMGANI RERPDGLEID GGDLHGADLH GYHDHRIVMA LTLAGIVAGD TRIDTAESVD VSYPGFFEDM RRLGANVRAS T
|
| |