Gene Msed_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2013 
Symbol 
ID5105235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1945192 
End bp1946379 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content50% 
IMG OID640507901 
Producthydroxypyruvate reductase 
Protein accessionYP_001192077 
Protein GI146304761 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2379] Putative glycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCAAG TAATAGACAG GATACTAAGG CTCTCGGATC CGAGGCAGGC GTTAGAGAGG 
AAAGTTCAGG TAAGAAATAA CGAGCTGGTC GTGGAGGGCT CAAGGTTTCA GTTTTCAAAA
CCCCTTGTTA TTTCCGTAGG GAAGGCCTCC GTAAAGATGG CCAACTTCTT CCTGGAGAAG
CTAAGCAACT ACGAGGCAAT AGTTGTTAAA CCAAAGGGGG ACAATCTTAC GGTTGAGGGC
CACGCCCAGA TAATACATTC GTCTCATCCG TATCCAGACG ATGAGAGCTT CAGGGCTGGA
AAATTGGTAA GGGAAGCCCT CATGACGTGG GACTACGACC TAGTTATATT TCTCCTGTCA
GGGGGAGCCT CATCATTGAT GGAGGATCCA ATACCTCCTG CTCAACTCTA TGTGGAGACC
ATGAAGAAAC TGGTGACCTC AGGGCTAGGG ATAGATGAGA TTAACACTGT CAGGAAGCAT
CTTTCCAGGG TTAAGGGAGG CAGACTAGCT AGTTTGGCGA GATCTCAAGT AGTTACCCTG
GTGGTAAGTG ATGTCCCTGG AAACGATTTA TCCGTTGTGG GGAGCGGACC CACAATTCAA
GATCCCTCAA CGGTGGACGA GGCTAGGCAG ATCTTGGAAC AGCTAAATCT AGATTTAGTG
CAGTACCTAG AAGAGACGCC AAAGCAACTA AGCAACTCCT GGGTGTTTCT CATTCAGAGC
GTGAGCGACG TTCTCAAGGA TCTCACTGAT ATGCCAGGGG CGGTAATCCT CTCCAGTGAG
GTTAGGGGAG AGGCTAGGTC CCTGGGAAGC CTACTAGCCT CCATCGTGAA CACAAGGGAA
TTGAGCTTTA GGAGACCCTT CACAATTCTT CTTGGTGGTG AACCTGAGGT TACGGTAAGG
GGTCCTGCAG GCAAGGGCGG AAGAAACGGT GAGGTTTGCC TGTCCTTCCT GGAGTGGGTG
AAGGTCACCA ACGTTACCCT GTACGCCGTA GCTACTGACG GTATTGATGG TAACAGCGAA
TACGCTGGAT GTGTGGTTTC GGGAGGAATG GACGTACCAA AGAGGGAGAT AAGGAAAGCC
TTAGAAACTC ACTCTTCCTA CGAGTTGCTT GAGAGAATTG GGGCAGTAAT TAAAACGGGC
CCCACGGGGA CTAACGTAAA CAACGTTTAT GTCCTTATAG CTCCTTGA
 
Protein sequence
MDQVIDRILR LSDPRQALER KVQVRNNELV VEGSRFQFSK PLVISVGKAS VKMANFFLEK 
LSNYEAIVVK PKGDNLTVEG HAQIIHSSHP YPDDESFRAG KLVREALMTW DYDLVIFLLS
GGASSLMEDP IPPAQLYVET MKKLVTSGLG IDEINTVRKH LSRVKGGRLA SLARSQVVTL
VVSDVPGNDL SVVGSGPTIQ DPSTVDEARQ ILEQLNLDLV QYLEETPKQL SNSWVFLIQS
VSDVLKDLTD MPGAVILSSE VRGEARSLGS LLASIVNTRE LSFRRPFTIL LGGEPEVTVR
GPAGKGGRNG EVCLSFLEWV KVTNVTLYAV ATDGIDGNSE YAGCVVSGGM DVPKREIRKA
LETHSSYELL ERIGAVIKTG PTGTNVNNVY VLIAP