Gene Msed_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0157 
Symbol 
ID5105010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp126507 
End bp128312 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content42% 
IMG OID640506060 
Producthypothetical protein 
Protein accessionYP_001190258 
Protein GI146302942 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0273927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAA AGTTATCAAG AAAACTTTCA ATGAGCTATA TTGATCTAGT GGCCTGGTTA 
GCTGAGAATA GGGAGAGGTT GGTCGGTTGT AGAATAGATA ACATTTTTGC TACTAATTTA
CCGCATCTGT ACATCTTCGT TATTCATTGC CACAACGGCG ATTCTCAGCT TGTGATCCAG
CCCGGTAAAA GGGTACACTT TACAAAGTTT AACCATGAGA GACTGCTTGA TTCGAAGGCA
AAGATGCTAC GTGAGTTAAT AAGGGGTGAA CTGATAGAAG ACGTTGATGT AGTTAATGGA
GAAAGAATTT TAAGAATGAA ACTAAAGGAC AAAATTGTAT ATATAGAGCT TCTTCCAAAG
GGTACGTTAA TTGTGACGGA TAACGATAAT AGAATCAAGT TTGCACTGGA ACAAAGGGAG
TTCAAGGACA GAACACTGAA GCCTGGGGAG CTCTACGTTC TACCTCCGTC TCCAACAGAG
CTAAAACCTA ACGAGATAGA AAGCTATCTA AAGAAGGGCG CCCTTTCCAG GGTTCTAGGG
GTTCCACAGG AATTTCTTAA TATTCTCTCA ATAAATGCAA ATAATCTAGA TGAATTGGAG
GAAGCTAAGA AAAAACTGGA AAAAGTTATG CAAGATATAC AACACGGCGT AATTCAGCCT
TGCGTGGATT TGGAGAGAAC GGTGTGGCCA GTTCGCTTTC CGGGATGTAC CGAATTACCC
AGTTATAATG AGGCCTTGGA CAACTATTTC ACCTCGCTGG AAAAGGCGGA GCTTGAGAAA
CTGGTTGATG AGGGGGAAGA GAAGAAGCTT GAGGCCACAA TCTCCAAGCT CAAGGAGACT
TTGACTAAGA TGGAGGAGGA AGCTGAGACT TTGAGGAAGA AAGGTAAGGC AATAATGAAT
AATTATCTAG AGGTTGAGGA AAAGATTAAG GAGGGTGCGA AGGAAATAGA GATTGAAGGG
TTAAAGATAG AGATAGATCC CAAGATTTCA GCTTCTAAAA ACGCATCTCA ATATTTTGAA
AAGGCCAAGG AATTAGATGC AAAGATAAGG AGGACAAGGG AGACAATCGA GGAGTTGGAA
AAGAAAAAAC AGGAAATTAA GGCTAAGTCT AAGGAGACCA TTGAAGGAAG CAAGATTCTG
GTAAGAAAGA AGGAGTGGTA TGAAAGGTAT CATTGGACCA TCACATCTAA TGGTTTCATT
GTGATAGCTG GGAGGGATAT TGACCAGAAT GAGAGTATTG TGAGGAAAAT GCTAGAGGAC
AAGGATATCT TTTTGCATGC AGATATCCAG GGGGCTCCAG CCACTGTGAT TAAGAATCCA
GTTGGTATAG GAGAGCAGGA TCTAATGGAC GCTGCAGTGT TGGCAGGTTG TTACTCTAAA
GCGTGGAAAT TGGGGCTAGC TAGCATAGAT GTGTTTTGGG TTTACGGAGA GCAGGTCTCA
AAGTCGCCAC CCTCAGGCGA ATATCTGCCC AAGGGATCCT TCATGATCTA TGGGAAAAAG
AATTACATCA AGAACGTGAA ACTAGAGTTG ACAATTGGGG TAAACGTGGA AAGCGATTTC
AGGATTGAGG TGGGTTCATT TGAAGCTATT TCCAAAAGAT GTAAGGTATT TGTCACCATA
ACTCCAGGAG ATTCTGATCC AGAAAAACTA GGAGATAGAA TCAGCAGGAT ATTCGCTAGG
GAACTGGGTG TGGATGGGGT TAAGGCTCTG AAGGATGAAA TAGTGAGGAT GATCCCAGGG
AAATCCAAAA TTAAGGGCAC AACACACCAG CTGGCTAACT CAACCGGATT GAATCTTAAG
GATTAA
 
Protein sequence
MSIKLSRKLS MSYIDLVAWL AENRERLVGC RIDNIFATNL PHLYIFVIHC HNGDSQLVIQ 
PGKRVHFTKF NHERLLDSKA KMLRELIRGE LIEDVDVVNG ERILRMKLKD KIVYIELLPK
GTLIVTDNDN RIKFALEQRE FKDRTLKPGE LYVLPPSPTE LKPNEIESYL KKGALSRVLG
VPQEFLNILS INANNLDELE EAKKKLEKVM QDIQHGVIQP CVDLERTVWP VRFPGCTELP
SYNEALDNYF TSLEKAELEK LVDEGEEKKL EATISKLKET LTKMEEEAET LRKKGKAIMN
NYLEVEEKIK EGAKEIEIEG LKIEIDPKIS ASKNASQYFE KAKELDAKIR RTRETIEELE
KKKQEIKAKS KETIEGSKIL VRKKEWYERY HWTITSNGFI VIAGRDIDQN ESIVRKMLED
KDIFLHADIQ GAPATVIKNP VGIGEQDLMD AAVLAGCYSK AWKLGLASID VFWVYGEQVS
KSPPSGEYLP KGSFMIYGKK NYIKNVKLEL TIGVNVESDF RIEVGSFEAI SKRCKVFVTI
TPGDSDPEKL GDRISRIFAR ELGVDGVKAL KDEIVRMIPG KSKIKGTTHQ LANSTGLNLK
D