Gene Msed_0582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0582 
Symbol 
ID5103742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp535982 
End bp537427 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content47% 
IMG OID640506486 
Producthypothetical protein 
Protein accessionYP_001190681 
Protein GI146303365 
COG category[S] Function unknown 
COG ID[COG3372] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.566185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.500457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTACCAA GTGAACTTGC CAGGTTCAGG ATTTACGGAG ACGTTATCTA TCCCTCATTC 
GCTGGGGAAA GGGAAGTTGA ACTAGTCAAG GAGATGATTC CCTTCTATAA GGTTGGGATC
ACGTACGGTG AGGTTGAGGA GAACGTAAAA CTCATGGAAA GGATGTACGG TAAGACGACT
CAGGGAGCTA AGCTGGTGAG AGGTCTTCAT AGGGTCATCT CGAGGTACCT TACCCTATCT
GGTGAATCTC CCGTGGATCC CAGGAAGATA AGGGAGGAGC TCTTTGCTAA GGGACCTGCC
CTGACTCCAG AGGAAAGGGA AAGGAGACTA AGGGAGGTAA GCGAAAGGCT AGGGGTAGAC
GCTGAGAGGT TCATGTTTGG GGATATGGAT GAGAGCAAGG TAATCTCCAG GGTTGAGATA
CCGCCTCCGG AGGACATAGT TAAGGAGTAT AACCTTTCCC TACTGCAGAC AATACTTTTC
AGGTCGTACA AGGTAACCTT GACCACTGAG GGGAACTGGA AGGAACTTCT GAGAACTGTA
AAGAGACTTG GGTTGATGTA TACTGCGTAC TCTAACCCCG TGAGGATAGA GGTGATGGGT
CCCTATACCC TTCTGAAGCC CTCAGAGAAA TACGGAAGGA ATCTGGCTAT CCTGGTTCCT
TACGTGATTG GGACTGGAGG ATGGAGCATT GAGGCTGAAA TCATTCTAGG AAAGAGGAAG
AGAAGGGTTT ACCAGATGAA GGTAAGCAAC AACGAATGGA TTGGAGGAAG GCCAGAACAG
GGAAAGCTCT TTGATAGTTC AGTGGAGGAG GACTTCTACT GGAACTTCAG GGGAACAATC
AAGGATTGGA AACTGGAAAG GGAACCTGGA CCGCTCGTGG TCAACGGCAG GATATTCCTC
CCCGATTTCC TAGCTATCAA GGACGAGATA AGGGTCTACC TTGAGGTTGT GGGCTTTTGG
ACAGAGGAAT ACCTGAGGGA GAAGGTTAAG AAGCTTCAGG GAACTCAAGC TCTCGTTATA
CCCATTGTGA GTGAGGAACT TGGATCCGGG AAGATAGGCG ATTTACCAGT TATTACTTTT
AAGAGAAAGA TTGATCCAAC CAAAGTTTAC CGTGTTTTAA GGGAAATTGA GCAGTCGCTC
CCAACTAAGA AGGTTGAATA TGAGCTAGAT GGGAGTGACG TAATATCCAT AAAGGAGCTG
GCGATGAAGT ATGGTATCTC TGAGAATCTC CTCCGAAAAA ATCTGAGGGA ATTTCCAGGT
TACGTTCTTC TCAAAAACTA CTACGTAAGC CAGAAGCTTA TGGAACAACT CTCCAGGGAG
AATTTTTCAG GTAGAAAACT TCAGGAAGTC GTGAAGGAGA GAGGAGATTT CATAACCGAG
GTCTTGGATA AGCTAGGATA TAAGATAAAG TGGATAAACA TTGCAGACGC GGTGATCACA
AAGTGA
 
Protein sequence
MLPSELARFR IYGDVIYPSF AGEREVELVK EMIPFYKVGI TYGEVEENVK LMERMYGKTT 
QGAKLVRGLH RVISRYLTLS GESPVDPRKI REELFAKGPA LTPEERERRL REVSERLGVD
AERFMFGDMD ESKVISRVEI PPPEDIVKEY NLSLLQTILF RSYKVTLTTE GNWKELLRTV
KRLGLMYTAY SNPVRIEVMG PYTLLKPSEK YGRNLAILVP YVIGTGGWSI EAEIILGKRK
RRVYQMKVSN NEWIGGRPEQ GKLFDSSVEE DFYWNFRGTI KDWKLEREPG PLVVNGRIFL
PDFLAIKDEI RVYLEVVGFW TEEYLREKVK KLQGTQALVI PIVSEELGSG KIGDLPVITF
KRKIDPTKVY RVLREIEQSL PTKKVEYELD GSDVISIKEL AMKYGISENL LRKNLREFPG
YVLLKNYYVS QKLMEQLSRE NFSGRKLQEV VKERGDFITE VLDKLGYKIK WINIADAVIT
K