Gene Msed_1529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1529 
Symbol 
ID5104057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1490387 
End bp1492987 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content46% 
IMG OID640507416 
Producthypothetical protein 
Protein accessionYP_001191609 
Protein GI146304293 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114366 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT TCATAGTTTT ACTACTTATA ATGATATTCT TATCACCGCT CTCTTTTCTT 
TTGGCTACTG CTACTCCTGC CTCTCCAACA GTTTCTGTCT TGGGATATGG TTGGGGTAGT
CCCAGTTCGC CCACGAGCGC CTATCCAGGA TATACTGACT TCCCATTCTA TGTGGAAATA
GGTGCAGTTG GTTCTGCGAC TCCACTCAGC GCATCCATTT CCTTCCCTTC AGGATCTCCG
TTTATTACAG TGGGAGGGAA TCAAGCAGGC GTTATCCAAG GAAGCGGATA TTACCTCGCC
ATCTTCTACT TAACCATTCC TAGCTCAGTA ACCCCCGGTT ATTACTCTGC AAAGGTCACA
GTGGACTATA GCGTTCCCAT AGCAGACCAG GTTTGCGTTG AAAGTAAAAC GTTCAATATT
CAGATTCCAG TGTCAGCTGT CGACTTTCCA ATACCTGTGG GAATTCAGGG AGGAACAAGT
GGGGTTTCGC AACCCTTGGT TACAGGAGAG GGAGCCTCGC CCTTAACCAT CTCGGTCTCA
AATCCCTCCA ACAACATAGT GGACAACGTT CTGGTTAACC TGACGCTCCC TTCAGGTTTG
ATGAGCGAGA CGGGAAAGAA GTTCCTTATA TTTACGATTC CCTCAATCCC TCCTCAGGAG
ACCATTCAAT CCACTCAGTT AGTTAACGTT ACGCAAAATG CTATCCCTGG TACCTATTCT
CTCAACTATT CAGTAACATT TACCAACTAC CTCGGATACA GGTATTACGC AACTAACTCC
AATATAACTT CAGGCAACGC CAACGTAACT CTGGTTAACA TACCCTTAAC CCTTACTATC
TATCCGAAAA CGCCAATAAC CTTCTACGTA ACCTCTACCT CAGCTACTCC CTCCTCGCTT
GTCTCTATTA CTTTACGGGC CAACTCGTCA TACAACGCAT TCATAGAATC CGTAACTCCA
CAGACTAGTT TAACCCTTCT CAGTTCTAAC TTCACGCCCA CGACATTTCA GGGAACTGCC
AACTTCAACT ACACATTTGA AGTCCCGCAA ACGCTAGCGC CAGGAGCTTA TCCCATAACC
TTCACAATCA CGTATGAGAT CTTCGGACAA CAGCAAGAAA CCGCCGTAAC CACCTTCGTA
CAGGTTAACT ATTACAACGC AATGCCTACC CTATCTGATC CTACCTGGTC CACGCTAGCT
TCTCCTGGGA CTGCGGGGGT TACGCTAACG TTTCTGCTTA CCAATCCACT TCCTTATCCC
ATTTCAGACG TTAATGTAAC AATTCTTCCA CCAGCGGGAA TGAGTTCAAC CTATATCTCA
TACGTCGTTC CCACGCTCTC AGCCTCAGGA CAGGGCATAA ACTATGCCCA AGTGCCCTTC
ACCATAACCT TAGCACAGAA CGTAACACCT GGGTATCATG AGATACCTTT CGTGGTAAGC
TACTTCAGTA GCTACGGTTT TCATAAAGTG AGATCCCAGA TACCCGTTTA TGTTTACCCG
CAGAGCCAGC TTCTTGCCTT AGTTAGTAAC GTCACAGTTT ATCAGGGAAC TCAGGCTGAA
TTACCCATTG TCTTGGTGAA TTATGATCCC GTACCAGTAT CTTCAGTTTC TGCAGAGTTG
AGGCTTACAG GTCTTAGCGT TGTGGGGTTC TCAAATCAGA CTTTTAACAT GGGTCCCAAT
TCTAACGCAA CAGTAGTTTT CACCATTTCC GCTCAGGGAG TCGGTGCAGG GAGCTATCCA
GCTACATTGA CCCTCGTGTA TAATTACGAG GGCGTTACCA AGACCGTAAC CTATACGATT
CCAATTAACG TGCTTCCAGC TCAAAATATA GTGGACGTCT CCATTACTCC AACTGAGGTT
TACTATGGTA CGATCAATAA TGTTACCGTA AGGCTAATCG ACACGGCTCA AACTCCGCTT
AATAATGTCG TACTAAAGTT GTTTGGACCA TCCTCTGAAT TTTCTCTATC CCAGAACACG
GTGGATATTG GGACCCTCTC TCCAGGTAAG AGTTATACCG TAACCTTGAA CCTGTTACCA
ACTGTTTCCT CAACTACACC ACTGCCCCTT TCTGTAGAGG TACAGTATCT CTTACCCGGA
AGTGGCGTCA TTACGCAGAG TTATAACTTC TCGTTGATAG CCACCGGACT AGTGGATCTG
GTTCTACAGC AACCCACAAT ATCGTTCTCA AACGGCACGC TTACAGTTAC CGGAGTTCTA
AATAATTTCG GCACAGCGTC AGCTAACTTC GTTACAGTAT ATGTGGATGG CAATTCCACT
TACATAGGAA GCGTACCGCC CAATAGCCCT ACACCCTTCT CGACAACGTT AGTTATACCC
TTCGCGGGTG GAAATACCTC TACAGCGAAA CCTCACCAAG TTAAGATAGT TGTGTCCTAC
GAGGACTCCA TATATCAAAC CCATAACTTG ACTTACGTTC TAAGCTTCTC ACCATCTGCG
ACCCATTTTA ATACCACGTT CACCAATTTC AGGCACTTCA GAAGTAGCAA TTCCCTTCTA
CCAGAAATCG TGATAGCTAT CCTCCTTGTA ATAGTTATAG TTCTGGCTAT CCTCTTAGTC
CTTAGGAAGG GGAAAAAGTG A
 
Protein sequence
MKNFIVLLLI MIFLSPLSFL LATATPASPT VSVLGYGWGS PSSPTSAYPG YTDFPFYVEI 
GAVGSATPLS ASISFPSGSP FITVGGNQAG VIQGSGYYLA IFYLTIPSSV TPGYYSAKVT
VDYSVPIADQ VCVESKTFNI QIPVSAVDFP IPVGIQGGTS GVSQPLVTGE GASPLTISVS
NPSNNIVDNV LVNLTLPSGL MSETGKKFLI FTIPSIPPQE TIQSTQLVNV TQNAIPGTYS
LNYSVTFTNY LGYRYYATNS NITSGNANVT LVNIPLTLTI YPKTPITFYV TSTSATPSSL
VSITLRANSS YNAFIESVTP QTSLTLLSSN FTPTTFQGTA NFNYTFEVPQ TLAPGAYPIT
FTITYEIFGQ QQETAVTTFV QVNYYNAMPT LSDPTWSTLA SPGTAGVTLT FLLTNPLPYP
ISDVNVTILP PAGMSSTYIS YVVPTLSASG QGINYAQVPF TITLAQNVTP GYHEIPFVVS
YFSSYGFHKV RSQIPVYVYP QSQLLALVSN VTVYQGTQAE LPIVLVNYDP VPVSSVSAEL
RLTGLSVVGF SNQTFNMGPN SNATVVFTIS AQGVGAGSYP ATLTLVYNYE GVTKTVTYTI
PINVLPAQNI VDVSITPTEV YYGTINNVTV RLIDTAQTPL NNVVLKLFGP SSEFSLSQNT
VDIGTLSPGK SYTVTLNLLP TVSSTTPLPL SVEVQYLLPG SGVITQSYNF SLIATGLVDL
VLQQPTISFS NGTLTVTGVL NNFGTASANF VTVYVDGNST YIGSVPPNSP TPFSTTLVIP
FAGGNTSTAK PHQVKIVVSY EDSIYQTHNL TYVLSFSPSA THFNTTFTNF RHFRSSNSLL
PEIVIAILLV IVIVLAILLV LRKGKK