Gene Msed_0967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0967 
Symbol 
ID5104518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp893902 
End bp895290 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content46% 
IMG OID640506868 
Productselenium-binding protein 
Protein accessionYP_001191061 
Protein GI146303745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0835459 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.405151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATAC TGCCACCATT CAGGAGGGAT CCAACGTTCT ATCCATCCCC AAGGATGGCC 
ATGAACTCTC CCCCAGAGGA TTTGGCATAC GTGGCATCAC TTTACACGGG TACTGGTATT
AACAGGCCAG ATTTTCTAGC TGTAGTAGAT GTTAATCCGA AATCCGAAAC TTACTCTAGG
ATTGTGGGTA AAGTGGAAAT GCCTAATCTC AACGATGAGC TCCACCATTT CGGCTGGAAC
GCGTGTAGTT CCTCTCTCTG TCCCAACGGC AGGACAGACT TGGAAAGAAG GTTCCTAATA
GTGCCAGGTC TGAGATCGTC AAGAATTCAT GTCATAGACA CAAAGGATAA TCCCAGACAG
CCCAAGATAG TAAAGGTTGT GGAACCGTCA GAGGTTAGTA AAGTAACAGG CTACACGAGG
CTTCACACCG TACACTGCGG TCCAGATGGT ATTTACATAA GCGCATTCGG GAACGAGATA
GGTGAAGGGC CTGGAGGAAT ACTTTTACTA GATCATTTCA CCTTTGAACC CCTGGGAAAG
TGGGAGATAA ACAGAGGAGA CCAATACTTC GCGTATGATT TTTGGTGGAA TCTCCCCAAT
GAGGTCATGG TCACCAGCGA ATGGGCAGTT CCGAACACCA TCGAGGATGG CCTAAGGTTG
GAACATCTGG AGAAGGGCTA CGGAAACAGA ATTCACTTCT GGGACTTACG GAGAAGGAAA
AGGGTAAGTT CCATAACCTT GGGGGAAGAA AACAGAATGG CATTGGAACT AAGACCTCTC
CACGATCCCA CAAAGTTGAT GGGTTTCATT AACATGGTTG TAAGCCTTAA GGATTTAAGT
AGTTCCATAT GGCTTTGGTA TTTTGAAGAT GGAAAGTGGA ACGCCGAGAA GGTGATAGAG
ATTCCCGCTG AACCTGGTGA AGGATTACCC GAGATAATTA AGCAGTTTAA GGTTGTACCG
CCTCTCGTGA CCGATATCGA TCTATCCCTT GACGATAAGT TCCTTTACGT AAGTATGTGG
GGCATAGGAG AGGTTAGGCA ATATGACGTT AGTGACCCAT TTAGACCTAG ACTCGCAGGA
AAAGTTAGGC TAGGGGGAAT TCTTCATAGG GCCGATCATC CCTCAGGTTT CAGGTTAACT
GGTGGACCGC AGATGCTCGA GGTAAGTAGG GATGGAAGAA GGATATACGT GACTAATTCC
CTATATAGCA CATGGGACAA TCAGTTCTAC CCTGAAGGGT TGAAGGGCTG GATGGTAAAG
ATAAATTCAA GTCAAGACGG TGGACTAGAA GTTGACAAGG AATTTTTAGT GGATTTCGGA
GAAGCTAGGG CACACCAGGT GAGGCTTAAG GGTGGAGACG CTTCCTCTGA TTCCTATTGT
TATTCTTAG
 
Protein sequence
MGILPPFRRD PTFYPSPRMA MNSPPEDLAY VASLYTGTGI NRPDFLAVVD VNPKSETYSR 
IVGKVEMPNL NDELHHFGWN ACSSSLCPNG RTDLERRFLI VPGLRSSRIH VIDTKDNPRQ
PKIVKVVEPS EVSKVTGYTR LHTVHCGPDG IYISAFGNEI GEGPGGILLL DHFTFEPLGK
WEINRGDQYF AYDFWWNLPN EVMVTSEWAV PNTIEDGLRL EHLEKGYGNR IHFWDLRRRK
RVSSITLGEE NRMALELRPL HDPTKLMGFI NMVVSLKDLS SSIWLWYFED GKWNAEKVIE
IPAEPGEGLP EIIKQFKVVP PLVTDIDLSL DDKFLYVSMW GIGEVRQYDV SDPFRPRLAG
KVRLGGILHR ADHPSGFRLT GGPQMLEVSR DGRRIYVTNS LYSTWDNQFY PEGLKGWMVK
INSSQDGGLE VDKEFLVDFG EARAHQVRLK GGDASSDSYC YS