Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1529 |
Symbol | |
ID | 5104057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1490387 |
End bp | 1492987 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507416 |
Product | hypothetical protein |
Protein accession | YP_001191609 |
Protein GI | 146304293 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1361] S-layer domain |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.114366 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAATT TCATAGTTTT ACTACTTATA ATGATATTCT TATCACCGCT CTCTTTTCTT TTGGCTACTG CTACTCCTGC CTCTCCAACA GTTTCTGTCT TGGGATATGG TTGGGGTAGT CCCAGTTCGC CCACGAGCGC CTATCCAGGA TATACTGACT TCCCATTCTA TGTGGAAATA GGTGCAGTTG GTTCTGCGAC TCCACTCAGC GCATCCATTT CCTTCCCTTC AGGATCTCCG TTTATTACAG TGGGAGGGAA TCAAGCAGGC GTTATCCAAG GAAGCGGATA TTACCTCGCC ATCTTCTACT TAACCATTCC TAGCTCAGTA ACCCCCGGTT ATTACTCTGC AAAGGTCACA GTGGACTATA GCGTTCCCAT AGCAGACCAG GTTTGCGTTG AAAGTAAAAC GTTCAATATT CAGATTCCAG TGTCAGCTGT CGACTTTCCA ATACCTGTGG GAATTCAGGG AGGAACAAGT GGGGTTTCGC AACCCTTGGT TACAGGAGAG GGAGCCTCGC CCTTAACCAT CTCGGTCTCA AATCCCTCCA ACAACATAGT GGACAACGTT CTGGTTAACC TGACGCTCCC TTCAGGTTTG ATGAGCGAGA CGGGAAAGAA GTTCCTTATA TTTACGATTC CCTCAATCCC TCCTCAGGAG ACCATTCAAT CCACTCAGTT AGTTAACGTT ACGCAAAATG CTATCCCTGG TACCTATTCT CTCAACTATT CAGTAACATT TACCAACTAC CTCGGATACA GGTATTACGC AACTAACTCC AATATAACTT CAGGCAACGC CAACGTAACT CTGGTTAACA TACCCTTAAC CCTTACTATC TATCCGAAAA CGCCAATAAC CTTCTACGTA ACCTCTACCT CAGCTACTCC CTCCTCGCTT GTCTCTATTA CTTTACGGGC CAACTCGTCA TACAACGCAT TCATAGAATC CGTAACTCCA CAGACTAGTT TAACCCTTCT CAGTTCTAAC TTCACGCCCA CGACATTTCA GGGAACTGCC AACTTCAACT ACACATTTGA AGTCCCGCAA ACGCTAGCGC CAGGAGCTTA TCCCATAACC TTCACAATCA CGTATGAGAT CTTCGGACAA CAGCAAGAAA CCGCCGTAAC CACCTTCGTA CAGGTTAACT ATTACAACGC AATGCCTACC CTATCTGATC CTACCTGGTC CACGCTAGCT TCTCCTGGGA CTGCGGGGGT TACGCTAACG TTTCTGCTTA CCAATCCACT TCCTTATCCC ATTTCAGACG TTAATGTAAC AATTCTTCCA CCAGCGGGAA TGAGTTCAAC CTATATCTCA TACGTCGTTC CCACGCTCTC AGCCTCAGGA CAGGGCATAA ACTATGCCCA AGTGCCCTTC ACCATAACCT TAGCACAGAA CGTAACACCT GGGTATCATG AGATACCTTT CGTGGTAAGC TACTTCAGTA GCTACGGTTT TCATAAAGTG AGATCCCAGA TACCCGTTTA TGTTTACCCG CAGAGCCAGC TTCTTGCCTT AGTTAGTAAC GTCACAGTTT ATCAGGGAAC TCAGGCTGAA TTACCCATTG TCTTGGTGAA TTATGATCCC GTACCAGTAT CTTCAGTTTC TGCAGAGTTG AGGCTTACAG GTCTTAGCGT TGTGGGGTTC TCAAATCAGA CTTTTAACAT GGGTCCCAAT TCTAACGCAA CAGTAGTTTT CACCATTTCC GCTCAGGGAG TCGGTGCAGG GAGCTATCCA GCTACATTGA CCCTCGTGTA TAATTACGAG GGCGTTACCA AGACCGTAAC CTATACGATT CCAATTAACG TGCTTCCAGC TCAAAATATA GTGGACGTCT CCATTACTCC AACTGAGGTT TACTATGGTA CGATCAATAA TGTTACCGTA AGGCTAATCG ACACGGCTCA AACTCCGCTT AATAATGTCG TACTAAAGTT GTTTGGACCA TCCTCTGAAT TTTCTCTATC CCAGAACACG GTGGATATTG GGACCCTCTC TCCAGGTAAG AGTTATACCG TAACCTTGAA CCTGTTACCA ACTGTTTCCT CAACTACACC ACTGCCCCTT TCTGTAGAGG TACAGTATCT CTTACCCGGA AGTGGCGTCA TTACGCAGAG TTATAACTTC TCGTTGATAG CCACCGGACT AGTGGATCTG GTTCTACAGC AACCCACAAT ATCGTTCTCA AACGGCACGC TTACAGTTAC CGGAGTTCTA AATAATTTCG GCACAGCGTC AGCTAACTTC GTTACAGTAT ATGTGGATGG CAATTCCACT TACATAGGAA GCGTACCGCC CAATAGCCCT ACACCCTTCT CGACAACGTT AGTTATACCC TTCGCGGGTG GAAATACCTC TACAGCGAAA CCTCACCAAG TTAAGATAGT TGTGTCCTAC GAGGACTCCA TATATCAAAC CCATAACTTG ACTTACGTTC TAAGCTTCTC ACCATCTGCG ACCCATTTTA ATACCACGTT CACCAATTTC AGGCACTTCA GAAGTAGCAA TTCCCTTCTA CCAGAAATCG TGATAGCTAT CCTCCTTGTA ATAGTTATAG TTCTGGCTAT CCTCTTAGTC CTTAGGAAGG GGAAAAAGTG A
|
Protein sequence | MKNFIVLLLI MIFLSPLSFL LATATPASPT VSVLGYGWGS PSSPTSAYPG YTDFPFYVEI GAVGSATPLS ASISFPSGSP FITVGGNQAG VIQGSGYYLA IFYLTIPSSV TPGYYSAKVT VDYSVPIADQ VCVESKTFNI QIPVSAVDFP IPVGIQGGTS GVSQPLVTGE GASPLTISVS NPSNNIVDNV LVNLTLPSGL MSETGKKFLI FTIPSIPPQE TIQSTQLVNV TQNAIPGTYS LNYSVTFTNY LGYRYYATNS NITSGNANVT LVNIPLTLTI YPKTPITFYV TSTSATPSSL VSITLRANSS YNAFIESVTP QTSLTLLSSN FTPTTFQGTA NFNYTFEVPQ TLAPGAYPIT FTITYEIFGQ QQETAVTTFV QVNYYNAMPT LSDPTWSTLA SPGTAGVTLT FLLTNPLPYP ISDVNVTILP PAGMSSTYIS YVVPTLSASG QGINYAQVPF TITLAQNVTP GYHEIPFVVS YFSSYGFHKV RSQIPVYVYP QSQLLALVSN VTVYQGTQAE LPIVLVNYDP VPVSSVSAEL RLTGLSVVGF SNQTFNMGPN SNATVVFTIS AQGVGAGSYP ATLTLVYNYE GVTKTVTYTI PINVLPAQNI VDVSITPTEV YYGTINNVTV RLIDTAQTPL NNVVLKLFGP SSEFSLSQNT VDIGTLSPGK SYTVTLNLLP TVSSTTPLPL SVEVQYLLPG SGVITQSYNF SLIATGLVDL VLQQPTISFS NGTLTVTGVL NNFGTASANF VTVYVDGNST YIGSVPPNSP TPFSTTLVIP FAGGNTSTAK PHQVKIVVSY EDSIYQTHNL TYVLSFSPSA THFNTTFTNF RHFRSSNSLL PEIVIAILLV IVIVLAILLV LRKGKK
|
| |