Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0149 |
Symbol | |
ID | 4462823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 136744 |
End bp | 140085 |
Gene Length | 3342 bp |
Protein Length | 1113 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639699157 |
Product | S-layer-like domain-containing protein |
Protein accession | YP_842589 |
Protein GI | 116753471 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01567] S-layer-related duplication domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00285183 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACTGCA AAAGTTTGAT TGTCATCGCG CTCATCGTAA TATCAGCTGG ATTTTTATCG GGAGGTGTGG CGGAGAATGC AGATAATAGG TCGGCTGAGC AGTCAGTAAC AAGTGCGGCT GCTCTACAGG ATGATTTCTT CTCCACAGAC ACCTCCTCCA GCGGTATAAC TGTGACATTC AAGAGGCTGG GAAATAACAC AGCAAAAACC CCGGTCTCTC TGGGATCCAG AAACACCACC GTGAGCACGG TGGATCGTGA TGCGGAAAAA AACGCAACAT CCGGTGCAAA TGCCACGACG GAGGGGAGGC CTGCAGGCGC TTCAGAGAAT GTGAGCTCTG CAGTGGAGCA GAACCGCACC GCTCAGAGCG CAGCGCCAAG CATGGCCATA GCTTCTGGCG CAGCTCCTGC ATCTAATGTG TCTGCAGTGG CCACCGAGGG GCTCACTGAG GGGAACCTGA CATCAAATGC GACGTCTGAG AGCGGCGCAT CAAACGCGAC TGCCGCTCCG GGCATATCCA CAAACCTCAC AGCAGTCGTG ACCCCAGAGG GTCTCGCTCC CGAGGGGAAT CTGACATCAA ATGTGACATC AGCAGTTGCA GAGAACGCAA GCCTGGCAGG CAACCAGAGC TCTAACCTGA CCGCGGTTCT CGTACCTGCA AACATCTCTG CGGAGAACCT GACCGCGGAG AACGTAACAG AAGAGGAGAA TGTAACTGCA GCTGAGAATG TGACCGAGGA AGTCACAGAG GAGGCTCCGG AGGAGGTTGG CGAGGAGGTC GCTGAGGAGG AGTTCACGGA CAGGATCTGG AGAGAGGGAA TGCCAGAGAC ATACACATGG ACGCCACAGA CGTTCTCAGG ATTCTTCTAT GATCTCGACG ATATGGTTGG CACCGAGAAG CTGACCGTCA GCCTGAGTCG CTCAGGCGGC GGCTACAACA GGGCCATCGA CACAGGGAAT ATAAGATACA CCAGTGATGT CCAGGACATA AGCTTTGAGT TCGACGACTG GGGGAAGTAT CAGGTACTCG GCTTCATGGC GGAGAAGTAC TTCGCCGGAT ATTCTGGAAC TGAAGTGGTT GATGATGTGA GCCTGATCAA CGAGAACCAG CTCAGGCGCG TGCTGATAGA CAGTGATGAT GAGAAGACCA TAACATCAGG CTCCGTGCTC CCGCTTGAAG AGGGATACGA GCTCAGGATC AAGGAGATAG ATATCAACGG CAACAAGGTC CATCTCGCAC TCGCAAAAGA TGGCGATGAG ATCGACAGCA AGGTGATATC TCCTGACGAT CTGAAGAGCG CCACGTACAT GTACGAAGAG GAGATCGGCG GCAAGGATGT TCCTCTCATA ATGGCTCACG TCTCGAATGT CTTCGCGGGC GCGGAGTCGA GTCTGGTCAC AATAGACGGA CTCTTCCAGA TATCAGATAC ATACGCCTCT GTCGAGGAGG GCGACAAATA CGACAAGATG GAGGTGGTGT CGGTCTCTGA CTCCGGAATC GAGCTGGAGA ATGAGGACTC AGTGACTCTC AGAAAGGGCA GAACGATCCA GCTGATGGGC GGTGTTGGTC TGCAGGTGGC CGACTCTGAT GTGCTCAGAT TTGCGCCTGT TGTCGAGAGA ACCGGATCCT ACGAAGTCAG AGGAACTGTG GTAAATCCAA ACAAGGTGGA TAGCTTCACC TGGACCCCCT ACAACTTCGA GGGCTTCTAC TACGATATCG ACGAGGATAT CGGGACGGAG AAGCTTGTCG CGAGGTTCTC TGGAAGCAAG ATCGATGATG GTGATCTGAA GTACGAGACC TCTCCGCAGC CTGTGGAGTT CGAGTTCAAT GGCTGGGGTA AGTATGATGT CATCGGGTTC ATGGCAGACA AGTACTTCGC TGGCTACAAC AATGAGACAC TGTTCACGGA TGAGTTCAGC ATCATAAACG ATGGCGAGTT GAGAAAAGTC CTGATAGATA GCGATGAGGA GAGCACGATA TCATCGGGAT CTGTTCTGCC TCTCGAGGAC GGCTACGAGC TCCAGATAAA AGAGGTGGAT CTCGACGGTA ACAAGGTCTG GCTCTCCCTG ACGAAGGATG GGGATGAGGT GGACAGCAAG GTCGTCACAC CGGTATCAGG AGACCTCGAG GCCTCAACAT ACACCTACAA GGTGCGTATC GGCTCCGAGG ATGTCCCGAT AATAGCAGCC CACATAAGCA ATGTCTTCCG CGGCAGAGAG GCGGATCTGG CGACAGTCGA CGGGATATTC CAGGTCTCGG ACACGCCTGA GTCTGTGGAG GAGGGCGACA AGCACGGCAA GATGGAGGTG GAATCGCTCT CCGATGATGG CATAACGATG AAGAACGACG GCTCGATAAG CCTCGGGAGG GGCAAGGATG TTGAGATCAT GGGCAACCTG AGGCTCAGGG TAGCCGACAA TCCGGAAAGA AACCTCTGCC CGATAGCCCT GCGTGTGGGC AAGACAGAGC CACTCAGGCT CAACCTGACA GAGGCGATCG TTGGTAAGCC AATAATGATA CAGGTCACAT CCGGCGGGCA GGCTGTGAGC GGAGCAAAGG TGCTTGTTGA CGGCAGGGAG ATCGGCACGA CAGATGCAGG CGGAATGATC AGATACACGC CGGAGAGAGC TGGAAGCGTT CAGGTCCAGG CGAAGCTATC TGGATACGAG GACGCGAGCG GGACGCTCCT GGTGAGAACT GAGGCCGAGT TGAGGAGGAT TGTGATAACA GCACCCCCAG AGGTCATGAG GGGCGAGACC TTCGTGGTGA CTGTCAGGGG AGGCGCAAAT GCCACCCAGG CGATCGCGGG AGCTAATGTG AGCATAGACA ACATGCCTGC TGGAGTGACC GACAGCAAGG GATCTGTCTC CGTATCGATA AACGATACTG GGGACCACAC GATATCTGTG GAGGCAGCAG GCTACGACAG GGCGACGAAG AGCGTGAAGG TGCTCTCTCC GATCAGCATC GTCGGTATAA ACGTCACGGG CGATGCGATC GCTGGTAAGC CGCTGAAGAT CGTCGCCGAG GTCCAGAACA CAGGAAAGGC GCCCGACTCC AGGCAGTTGC AGCTTCTGGT GAACAAGAAC GTCACTGGCA ACAAGAGCAT CACTGTTGCG CCTGGAGAGA CCGAGAAGGT CACATTTGAG TACAGGCCGA AGGAACCTGG CGTATACACC TTCGAGGTGG ATGGCATCCA GAAGACTGTA TCTGTCGAGG AGGCGAAGGG CGGATGGCTC GTGTGGGCCG TTGCCCTGCT GATAATCCTG CTCGCCGGAA TCGGTGTGTA TCTCTACAGG ACCGGTGAGC TGAAGGAGCT GAAGAAGCGC CTCAAGATGT GA
|
Protein sequence | MYCKSLIVIA LIVISAGFLS GGVAENADNR SAEQSVTSAA ALQDDFFSTD TSSSGITVTF KRLGNNTAKT PVSLGSRNTT VSTVDRDAEK NATSGANATT EGRPAGASEN VSSAVEQNRT AQSAAPSMAI ASGAAPASNV SAVATEGLTE GNLTSNATSE SGASNATAAP GISTNLTAVV TPEGLAPEGN LTSNVTSAVA ENASLAGNQS SNLTAVLVPA NISAENLTAE NVTEEENVTA AENVTEEVTE EAPEEVGEEV AEEEFTDRIW REGMPETYTW TPQTFSGFFY DLDDMVGTEK LTVSLSRSGG GYNRAIDTGN IRYTSDVQDI SFEFDDWGKY QVLGFMAEKY FAGYSGTEVV DDVSLINENQ LRRVLIDSDD EKTITSGSVL PLEEGYELRI KEIDINGNKV HLALAKDGDE IDSKVISPDD LKSATYMYEE EIGGKDVPLI MAHVSNVFAG AESSLVTIDG LFQISDTYAS VEEGDKYDKM EVVSVSDSGI ELENEDSVTL RKGRTIQLMG GVGLQVADSD VLRFAPVVER TGSYEVRGTV VNPNKVDSFT WTPYNFEGFY YDIDEDIGTE KLVARFSGSK IDDGDLKYET SPQPVEFEFN GWGKYDVIGF MADKYFAGYN NETLFTDEFS IINDGELRKV LIDSDEESTI SSGSVLPLED GYELQIKEVD LDGNKVWLSL TKDGDEVDSK VVTPVSGDLE ASTYTYKVRI GSEDVPIIAA HISNVFRGRE ADLATVDGIF QVSDTPESVE EGDKHGKMEV ESLSDDGITM KNDGSISLGR GKDVEIMGNL RLRVADNPER NLCPIALRVG KTEPLRLNLT EAIVGKPIMI QVTSGGQAVS GAKVLVDGRE IGTTDAGGMI RYTPERAGSV QVQAKLSGYE DASGTLLVRT EAELRRIVIT APPEVMRGET FVVTVRGGAN ATQAIAGANV SIDNMPAGVT DSKGSVSVSI NDTGDHTISV EAAGYDRATK SVKVLSPISI VGINVTGDAI AGKPLKIVAE VQNTGKAPDS RQLQLLVNKN VTGNKSITVA PGETEKVTFE YRPKEPGVYT FEVDGIQKTV SVEEAKGGWL VWAVALLIIL LAGIGVYLYR TGELKELKKR LKM
|
| |