Gene Mpal_2606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2606 
Symbol 
ID7271874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2729608 
End bp2730879 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content67% 
IMG OID643571202 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_002467599 
Protein GI219853167 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.749464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAGA CACTCCAGCA GCACGGGCCG GTGGACCTGG CCTTCACCGC TCCGCCCTCC 
AAGTCCTTCA CCCACCGGGC TCTGATCATC GCAGCCCTGG CTGACGGGGA GTCACTGATC
CGCGGCCCCC TGATCGCGGA GGACACCCTG TTGACAGTCC GGGCACTCCA GGCACTCGGT
GCCGATATCA CCGACACACC GGAAGGCTAC CGGGTGCAGG GGACCGACGG CCGGCCCGAC
TGCGCAGAGG GGACCGTCCT GGACCTAAAG AACTCGGGGA CGAGCCTCCG GCTGCTCAGT
TCGATCGCCC TCCTCTGCTC CAGCACCGCC GGGGTCACGC TGACCGGCTC CCCCCGGATG
CAGCAGCGGC CGATCGGAGA ACTCGGCGAC GCGATCCGAA CCCTCGGCGG CTCGGTCCGA
TACCTGGCCG CAGACGGTTA CCCGCCCTGT GTCGTGCAGG GGCCGCTCGT CGGCGGCGAA
GCGACCCTCG ACGGCTCGGT CAGCAGCCAG TTCATCTCGT CGCTGCTGCT GGCTGCCCCA
TATGCTGTCC GTCCGGTGGA CCTGAAGGTC GCCCGGCAGC CGGTCTCCCG ATCGTACCTG
GAGATCACCG GTGCGGTGAT GGCCGCGTTC GGGGTTCCGG TCAGGCGGGT GGGGTACACC
CACTTTACCG TCCAGCCGGC CCGGTACCGC GGACGGGAGT ACACGGTGGA GGGAGACTAC
TCGTCCGCCT CGTACTTCTT CGCCCTGGCC GCCACCCTCG GCGGGAAGGT GACGGTCAGA
AATCTGAATC ATGACTCGGT GCAGGGCGAC CGGCTCTTCG TTGCAGCGCT CAAAGCGATG
GGCTGCCGGG TCACCAGGGA GACCGACGGC GTCACCATCG AGCGGACCAA AAACCTCCAC
GGGATCTCCA TCGATATGAC CACAGCCCCT GACACGGTCC AGACCCTCGC GGTGGTGGCC
GCCCTGGCCG ACTCGCCGAC AACCATCACC GGGGTCGGCC ATCTCCAGTA CAAGGAGAGC
GACCGGGTGG CCGTGACCGC TGGGACCCTC AGAGCCCTCG GCTGCACCGT CGACATCAGC
GCCGACGCGA TCACCATTCA TCCCGGACCC CTTCATGGCG GGGTGATCGA CCCGCACGAC
GACCACCGGA CAGCGATGGC CTTCGCCGTC CTCGGGCTGG CCGTTGGTGA TGTCACGATC
GAAGACCCCG CCTGTGTCGG CAAGTCGTTC CCGAAATTCT GGAACGCACT CGCCGCAGGA
GGATTATTAT GA
 
Protein sequence
MDQTLQQHGP VDLAFTAPPS KSFTHRALII AALADGESLI RGPLIAEDTL LTVRALQALG 
ADITDTPEGY RVQGTDGRPD CAEGTVLDLK NSGTSLRLLS SIALLCSSTA GVTLTGSPRM
QQRPIGELGD AIRTLGGSVR YLAADGYPPC VVQGPLVGGE ATLDGSVSSQ FISSLLLAAP
YAVRPVDLKV ARQPVSRSYL EITGAVMAAF GVPVRRVGYT HFTVQPARYR GREYTVEGDY
SSASYFFALA ATLGGKVTVR NLNHDSVQGD RLFVAALKAM GCRVTRETDG VTIERTKNLH
GISIDMTTAP DTVQTLAVVA ALADSPTTIT GVGHLQYKES DRVAVTAGTL RALGCTVDIS
ADAITIHPGP LHGGVIDPHD DHRTAMAFAV LGLAVGDVTI EDPACVGKSF PKFWNALAAG
GLL