Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0967 |
Symbol | |
ID | 5104518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 893902 |
End bp | 895290 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506868 |
Product | selenium-binding protein |
Protein accession | YP_001191061 |
Protein GI | 146303745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0835459 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.405151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATAC TGCCACCATT CAGGAGGGAT CCAACGTTCT ATCCATCCCC AAGGATGGCC ATGAACTCTC CCCCAGAGGA TTTGGCATAC GTGGCATCAC TTTACACGGG TACTGGTATT AACAGGCCAG ATTTTCTAGC TGTAGTAGAT GTTAATCCGA AATCCGAAAC TTACTCTAGG ATTGTGGGTA AAGTGGAAAT GCCTAATCTC AACGATGAGC TCCACCATTT CGGCTGGAAC GCGTGTAGTT CCTCTCTCTG TCCCAACGGC AGGACAGACT TGGAAAGAAG GTTCCTAATA GTGCCAGGTC TGAGATCGTC AAGAATTCAT GTCATAGACA CAAAGGATAA TCCCAGACAG CCCAAGATAG TAAAGGTTGT GGAACCGTCA GAGGTTAGTA AAGTAACAGG CTACACGAGG CTTCACACCG TACACTGCGG TCCAGATGGT ATTTACATAA GCGCATTCGG GAACGAGATA GGTGAAGGGC CTGGAGGAAT ACTTTTACTA GATCATTTCA CCTTTGAACC CCTGGGAAAG TGGGAGATAA ACAGAGGAGA CCAATACTTC GCGTATGATT TTTGGTGGAA TCTCCCCAAT GAGGTCATGG TCACCAGCGA ATGGGCAGTT CCGAACACCA TCGAGGATGG CCTAAGGTTG GAACATCTGG AGAAGGGCTA CGGAAACAGA ATTCACTTCT GGGACTTACG GAGAAGGAAA AGGGTAAGTT CCATAACCTT GGGGGAAGAA AACAGAATGG CATTGGAACT AAGACCTCTC CACGATCCCA CAAAGTTGAT GGGTTTCATT AACATGGTTG TAAGCCTTAA GGATTTAAGT AGTTCCATAT GGCTTTGGTA TTTTGAAGAT GGAAAGTGGA ACGCCGAGAA GGTGATAGAG ATTCCCGCTG AACCTGGTGA AGGATTACCC GAGATAATTA AGCAGTTTAA GGTTGTACCG CCTCTCGTGA CCGATATCGA TCTATCCCTT GACGATAAGT TCCTTTACGT AAGTATGTGG GGCATAGGAG AGGTTAGGCA ATATGACGTT AGTGACCCAT TTAGACCTAG ACTCGCAGGA AAAGTTAGGC TAGGGGGAAT TCTTCATAGG GCCGATCATC CCTCAGGTTT CAGGTTAACT GGTGGACCGC AGATGCTCGA GGTAAGTAGG GATGGAAGAA GGATATACGT GACTAATTCC CTATATAGCA CATGGGACAA TCAGTTCTAC CCTGAAGGGT TGAAGGGCTG GATGGTAAAG ATAAATTCAA GTCAAGACGG TGGACTAGAA GTTGACAAGG AATTTTTAGT GGATTTCGGA GAAGCTAGGG CACACCAGGT GAGGCTTAAG GGTGGAGACG CTTCCTCTGA TTCCTATTGT TATTCTTAG
|
Protein sequence | MGILPPFRRD PTFYPSPRMA MNSPPEDLAY VASLYTGTGI NRPDFLAVVD VNPKSETYSR IVGKVEMPNL NDELHHFGWN ACSSSLCPNG RTDLERRFLI VPGLRSSRIH VIDTKDNPRQ PKIVKVVEPS EVSKVTGYTR LHTVHCGPDG IYISAFGNEI GEGPGGILLL DHFTFEPLGK WEINRGDQYF AYDFWWNLPN EVMVTSEWAV PNTIEDGLRL EHLEKGYGNR IHFWDLRRRK RVSSITLGEE NRMALELRPL HDPTKLMGFI NMVVSLKDLS SSIWLWYFED GKWNAEKVIE IPAEPGEGLP EIIKQFKVVP PLVTDIDLSL DDKFLYVSMW GIGEVRQYDV SDPFRPRLAG KVRLGGILHR ADHPSGFRLT GGPQMLEVSR DGRRIYVTNS LYSTWDNQFY PEGLKGWMVK INSSQDGGLE VDKEFLVDFG EARAHQVRLK GGDASSDSYC YS
|
| |