Gene Mthe_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1029 
Symbol 
ID4462778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1112343 
End bp1113608 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content59% 
IMG OID639700048 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_843454 
Protein GI116754336 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.650861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGTAT CCGTCTCACG CTCCTCCATC TCGGGAACCG TCTGCGCACC GCCATCGAAG 
AGCTACACGC ACAGAGCTGT GCTCATCACA GCGCTATCAG ATTCAGGATG CGTGCACAGG
CCGCTCATAT CTGCAGACAC AAGAGCCACG ATATCCGCGT GTGATGCCTT CGGGGCCGAT
GTGAAGCTGC GGGGGGACTC TCTGGAGATC CAGGGAGTCT CAGGCGCGCC CAGAACCCCT
GAGAACGTCA TAGATGTTCT GAACTCCGGG ACGACGCTAA GGTTCATGTC AGCAGTGGCG
GCGCTCACAG ATGGCGCGGT CCTCACAGGC GACAGCTCGA TACGCAGCAG GCCGAACGGC
CCACTCCTCA AGGCGCTCAA TGAGCTGGGC GCCGAGGCGT TCTCCATACG GGGAAACGAT
CGCGCTCCGC TTGTCATAAG AGGACGGCTC AGAGGCGGAT CGACATCTCT GGATGGCAGC
GTGAGCTCGC AGTTCCTTTC AGCCCTTCTG ATTGCATGCC CCCTCTCCAG CGGCGAGACG
ACGATCTCGA TAAAGGGCGA GCTGAAGTCC AGGCCGTACG CGGAGATGAC CCTGGACATC
CTCAGGAAGG CCGGAGCCGA GATATGCACT GATGGAGACA TCTTCCGCAT GCGTGGAGGC
CAGAGTTACA GGCTTGCGGA GTACACAGTG CCGGGTGATT TCTCGTCTGC ATCTTATCCG
CTGGCTGCAG CCGCGCTCGC GGGATCTGCG ACTGTGGAGG GTCTGTTCCC GTCGAGGCAG
GGCGACTCTG CGATAGTCGA TATTCTCAGA GAGATGGGCG CTGAGGTCTC GTGGGACATG
GAGAGTGGAG AGGTGAGGGT GTCTGGAGCT GATCTGAGGG GCAGAGAGAT AGATGCGAGC
CAGACTCCAG ATCTCGTGCC AACGCTCGCG GTGCTTGGAG CAGTGGCTGA GGGGCGTACG
GTCATAAAAA ACGCGGAGCA TGTCAGACAC AAGGAGACCG ACAGGATTCA TGCGATGGCG
GTCGAGCTGA AGAAGATGGG CGCGAACATA CGGGAAAGGC CTGATGGCCT GGAGATCGAT
GGGGGCGATC TGCACGGTGC GGATCTCCAC GGCTACCACG ACCACAGGAT CGTGATGGCA
CTGACCCTAG CAGGGATCGT GGCAGGCGAT ACCAGGATTG ATACAGCGGA ATCGGTCGAT
GTATCATATC CGGGATTCTT CGAGGATATG AGAAGGCTTG GAGCGAACGT GCGCGCCTCC
ACATAA
 
Protein sequence
MIVSVSRSSI SGTVCAPPSK SYTHRAVLIT ALSDSGCVHR PLISADTRAT ISACDAFGAD 
VKLRGDSLEI QGVSGAPRTP ENVIDVLNSG TTLRFMSAVA ALTDGAVLTG DSSIRSRPNG
PLLKALNELG AEAFSIRGND RAPLVIRGRL RGGSTSLDGS VSSQFLSALL IACPLSSGET
TISIKGELKS RPYAEMTLDI LRKAGAEICT DGDIFRMRGG QSYRLAEYTV PGDFSSASYP
LAAAALAGSA TVEGLFPSRQ GDSAIVDILR EMGAEVSWDM ESGEVRVSGA DLRGREIDAS
QTPDLVPTLA VLGAVAEGRT VIKNAEHVRH KETDRIHAMA VELKKMGANI RERPDGLEID
GGDLHGADLH GYHDHRIVMA LTLAGIVAGD TRIDTAESVD VSYPGFFEDM RRLGANVRAS
T