Gene Msed_0958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0958 
Symbol 
ID5104510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp884477 
End bp885676 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content53% 
IMG OID640506860 
Producthypothetical protein 
Protein accessionYP_001191053 
Protein GI146303737 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0243215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGTGG ATCCCCACCT ATGTATTGCC TGTAGGGGAG CCAAGTACCT TTGCGGTCTC 
AGTTATTGTC CTGTCCTAGT TAAGAACCTC TCCATGAAGG TAAAGGTGGG GAAGTTTGTC
GAGGGAGATT CTCCTCCCTC CGTCTTCGTG GGAAGGTTTG GTTATCCCAA GATTACCGTA
TATCCTTCGA CTCCCCCAGA GTTTGGAGAC ACTTCCATGT ACGAGGATCC TAGAGCCTGG
TTAGCCATGG ACATCAACAG GTTCTTGGCC ATGAGGATGT CCGTGGTACG AGGTGGAATT
CAGTTCAAGG TCAGTGAGGC TAGAGCTCCT GGAAGGGAGT TGTACGACGT TCAGGTGGCC
TCCCTCTCTC CTAGGCCTGT GGAAATGGAG CTTGACCTGG AGTCCGTTCC AAGGGGGAGA
GTTCTAAGCG AGACGGTTCC CCCTCTAGGT CCCTCAGCTC CCCTGAAGAG GCTTAGGCTA
GGCGCTCTAC CCCCTCCTGA GAGGGTAGTG GAGAAAGTCT TCCAGGAGAG GGACATGAAG
GCAGGAAAGG CAATAGAGAG GTTATACAGT GACGGTATTC CCGTGGAGAG GATAGCCCGC
CTTCTCAGCG TGGGTAACCT AGGTGTGGAG AGGAAGCTTG TCCCCACCAG GTGGAGTATT
ACGGCAGTGG ACAAGACTCT GTCGGACCTC CTCGTGAGGA AGATCAAGGA GTACCCCTCA
ATTGACCAGA TAGAGGTATA CGTGAGGAAG TTCAGGCTTA ACACCTTCGT GGCAATCCTG
GTTCCTGGTG AGTGGGCATT TGAGTGGGGA GAGGCGTGGT TCCCATCAAC GACGTGGAAC
ATGTGGGGGA GCTCGCCTCA GGTAGAGGTT GACTACGAGG GATATTTTGG GAGGAGAACA
TATCCTGATA TAGGTGGATG TTATTACTCC TCTAGGCTGG CCGTAGCTGA ACACCTGGAA
AGGAGGAGGA GACAGGCAAT TCCGATCCTG TGGAGGGAGA TCTATCCAGG TTTCTACTTC
CCTGTTGGAG TGTGGTTCGT CAGAGAAAAC GTCAGGGAAT TGCTGAGGGG TGAGAGCGTG
AAGTTCGACA CACTGAGCGA AGCGTTGAAG TTCCTTGAGG GAGTACTCAA GGTCAGTCCT
CACGAGTGGG CTAAACATTC TGGATTGATT CCCATGATAA GGTCGAGGTT ATTCCCATGA
 
Protein sequence
MYVDPHLCIA CRGAKYLCGL SYCPVLVKNL SMKVKVGKFV EGDSPPSVFV GRFGYPKITV 
YPSTPPEFGD TSMYEDPRAW LAMDINRFLA MRMSVVRGGI QFKVSEARAP GRELYDVQVA
SLSPRPVEME LDLESVPRGR VLSETVPPLG PSAPLKRLRL GALPPPERVV EKVFQERDMK
AGKAIERLYS DGIPVERIAR LLSVGNLGVE RKLVPTRWSI TAVDKTLSDL LVRKIKEYPS
IDQIEVYVRK FRLNTFVAIL VPGEWAFEWG EAWFPSTTWN MWGSSPQVEV DYEGYFGRRT
YPDIGGCYYS SRLAVAEHLE RRRRQAIPIL WREIYPGFYF PVGVWFVREN VRELLRGESV
KFDTLSEALK FLEGVLKVSP HEWAKHSGLI PMIRSRLFP