Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0458 |
Symbol | |
ID | 5105454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 411401 |
End bp | 412876 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506364 |
Product | hypothetical protein |
Protein accession | YP_001190559 |
Protein GI | 146303243 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGGGA TCATGAGACA AATCTTAGGC TCAACATTGT TAATTTTACT ACTCGGATCT TTCATAGGAT TAATTGCCGG CTCTATTGGC ACTGGAGCCT CTACCACTCA AACTTACAAC GTCACCTTTA TGGAGCATGG GTTGCCCAGC GGAACCATGT GGTCCGTTAC CTTTAACGGA CAGACAAAGA ACTCCACGAG CAATGAGATA GTGTTTCAAG TTCAGGGAGC AAGTTACTCA TCCTTCTCAA TTCCAAACGT GGGCAACTAT ATTCCAACAC CTTCTAATGG ACAAGTGTTC GTGAATTCCT CGCTTAGTAT AAACGTGACG TTCGCTCTGC CATTCGTTAA GCTAGTGATT GTTAAGTTAG TGGTTTTACA GCAAAGTACC GGAGCCTCAG TCACGGAGCT ACAGCCTGGC ACATCATATG TTGTTGGTGT GGAGGTTCAG AATCAGGGTA ACGTTAATGC GCTCACAGAG GTGAATGAGA CAGTCCTCTA CAACGGAAAA GTCGTTACGT CTGACGTTCC CATAGCTAGC ATAGCTCCTG GGGCCTCGGA AACACTAAGC TTTATCTGGA CCCCATCCAC TGCAGGTATA TATACGTTCC TTGTTAACGT CAAGGCCAAT CCCAACATCT CAGTAAGTGA GACGTATCCA CTTTACGTGG GAGTATCGCC AGTGAACGTA TACAACGTGA GCTTTGTGCA GACAGGCCTA CCCGCAGGGA CACAATGGTC CGTTACCCTC AACGGCACAA CTAAATCATC AACCTCTAAC ATGATAACCT TCCAGGTTCC AGCCGGAACC TACACCTATT CGGTGCAGAA CGTCACAGGT TATCTAAGCA AGGATGTTAC AGGTGAAGTT ACGGTGAAAA ACAGCTCGGT AACGGTACAA ATCACTTTCC TCCCCTTGGT GTTTAAGCCA GTGGCAACGC TATTAGTTTC CTATAACGGT CAGGAAGTTA CCCAGTTACA AACCAACATT ACGTATGACT TGATAGTCAC TGTAAAGAAT GAAGGGAACA CTTCAGGTCA GGGTTATGTT CTCGTCATAG CATCTCAGGG TTCGACGACC GTCCTTAACA AGGCCTTGAA CTATACCTTA AAGCCAGGGC AGGCTGAGAA TTTTACCCTG CTCTTTAACC TCAACTCGAC GCAACCCCTT TCCATCAAGG TAAGCACATA CTCTTTGACC CCCAAGGGAG AAGTTCCAGT CTACAACTCA TCCTCACAGT TCACAGTTGT TCAACAGCCT ACCACGACGA AGACAACAAC CACTAACACA TCTACGTCAA CGACAAATAC GACGAAGACA ACAACCACTA ACACATCTAC GTCAACCACC ACTAACACAA CGACTCCTTC ACCTAAACCA TCCTCAGGCT CGTCTAATAC CTTACTAATT GTCGGAATAG TTGTAGTTGT TGTCGTTATT ATAGCGGTGG CAGTGATTTT CCTAAAACGG AAATAA
|
Protein sequence | MFGIMRQILG STLLILLLGS FIGLIAGSIG TGASTTQTYN VTFMEHGLPS GTMWSVTFNG QTKNSTSNEI VFQVQGASYS SFSIPNVGNY IPTPSNGQVF VNSSLSINVT FALPFVKLVI VKLVVLQQST GASVTELQPG TSYVVGVEVQ NQGNVNALTE VNETVLYNGK VVTSDVPIAS IAPGASETLS FIWTPSTAGI YTFLVNVKAN PNISVSETYP LYVGVSPVNV YNVSFVQTGL PAGTQWSVTL NGTTKSSTSN MITFQVPAGT YTYSVQNVTG YLSKDVTGEV TVKNSSVTVQ ITFLPLVFKP VATLLVSYNG QEVTQLQTNI TYDLIVTVKN EGNTSGQGYV LVIASQGSTT VLNKALNYTL KPGQAENFTL LFNLNSTQPL SIKVSTYSLT PKGEVPVYNS SSQFTVVQQP TTTKTTTTNT STSTTNTTKT TTTNTSTSTT TNTTTPSPKP SSGSSNTLLI VGIVVVVVVI IAVAVIFLKR K
|
| |