Gene Msed_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1668 
Symboleno 
ID5105314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1610052 
End bp1611302 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content47% 
IMG OID640507562 
Productphosphopyruvate hydratase 
Protein accessionYP_001191747 
Protein GI146304431 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0148] Enolase 
TIGRFAM ID[TIGR01060] phosphopyruvate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0010021 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACG GTTTCTCGAT AGCTAATGTC CAGGGATATG AAATTATTGA TTCTAGAGGA 
AATTTAACTG TTAGAGCCAG GGTTACCTTG GAGTCCGGGA TAAGGGCTAC TGGAGACGCG
CCCTCAGGAG CGTCAAAGGG GACTCGGGAG GCCGTGGAAC TTAGGGACAA GGACGGCTCG
GTTAAGGGAG CCGTGGATTC CATAAACTAC TACATTTCAC CTGCCCTCAT GGGTCTAGAT
GTGAGGGAAC AGGGAAAAAT AGACAGGATA ATGATAGAGT TGGACGGGAC CGAGAATAAG
TCCAGGCTGG GAGCCAATGC AACCATAGCT ACGTCTATCG CAGTCGCCAA AACTGCGTCA
ATTTCCATGG GATTGGAACC TTTCATGTAT ATTGGCGGAG CCAGGACGCA TACCTTGCCT
GTACCCCTCC TCAACATCTT GAATGGGGGT CTGCACGCTG GTAACATGTT AAAGATCCAG
GAGTTCATGG TTATTCCGGT GAAATTTGAC ACTCTAAAGG AGGCCTTAAT TGCTTCTACC
AAGATTTATA AGACCCTCAA GTCGCTCGTT ACGGAGAGAT ACGGAAAGAT ATACACGGCT
TTAGGAGACG AAGGGGGTAT CTCCCCACCG CTCAGCGTTA CTGAGGACGC CCTAAAGCTG
GTGCATGAGG CGATCAAGAG GTCAGGAATG GAGGGGAGGG TCTTCATGGG GATGGACGCT
GCAGCCTCGG ATTTTTACAA CCCTGAGAAA GGGGTGTATG AAATAGATAA CACGAGTAAG
TCCCCGGACG AAATGATAGA GTTTTATGTT GATATAGCTA GCCGTTACCC CTTACTATAT
CTGGAAGATC CTTTCGAGGA GAACGATTTC AGCAGATATT CAGAGTTGCA GAGCAGAATT
AAGAACGTGA TAGTCACTGG AGATGACCTG TTCACCACTA ACGTGAGATA CCTTAGAAAG
GGAATCGAGA TGAAATCCGC TAGAGGAGTT ATTGTTAAGG CTAACCAGAT TGGAACGCTG
ACTGAAACCA TTCAATTCTT CGATCTAGCT AAGGACAACT CCATTAAGAC CGTAGTTAGT
CATAGGAGCG GTGAAACCGA AGACAGCTTC ATTGCCGACC TCGCCGTAGG CCTAAACAGT
GACTTCATAA AAACAGGAGC CCCTTCCAGA GGAGAAAGGA CATCAAAATA TAATAGATTA
CTTGAAATTG AAAATGAGTT CGGACTTGAA TATCTCGGTA GAAGGCTCTA G
 
Protein sequence
MMNGFSIANV QGYEIIDSRG NLTVRARVTL ESGIRATGDA PSGASKGTRE AVELRDKDGS 
VKGAVDSINY YISPALMGLD VREQGKIDRI MIELDGTENK SRLGANATIA TSIAVAKTAS
ISMGLEPFMY IGGARTHTLP VPLLNILNGG LHAGNMLKIQ EFMVIPVKFD TLKEALIAST
KIYKTLKSLV TERYGKIYTA LGDEGGISPP LSVTEDALKL VHEAIKRSGM EGRVFMGMDA
AASDFYNPEK GVYEIDNTSK SPDEMIEFYV DIASRYPLLY LEDPFEENDF SRYSELQSRI
KNVIVTGDDL FTTNVRYLRK GIEMKSARGV IVKANQIGTL TETIQFFDLA KDNSIKTVVS
HRSGETEDSF IADLAVGLNS DFIKTGAPSR GERTSKYNRL LEIENEFGLE YLGRRL