Gene Mthe_0149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0149 
Symbol 
ID4462823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp136744 
End bp140085 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content55% 
IMG OID639699157 
ProductS-layer-like domain-containing protein 
Protein accessionYP_842589 
Protein GI116753471 
COG category 
COG ID 
TIGRFAM ID[TIGR01567] S-layer-related duplication domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00285183 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACTGCA AAAGTTTGAT TGTCATCGCG CTCATCGTAA TATCAGCTGG ATTTTTATCG 
GGAGGTGTGG CGGAGAATGC AGATAATAGG TCGGCTGAGC AGTCAGTAAC AAGTGCGGCT
GCTCTACAGG ATGATTTCTT CTCCACAGAC ACCTCCTCCA GCGGTATAAC TGTGACATTC
AAGAGGCTGG GAAATAACAC AGCAAAAACC CCGGTCTCTC TGGGATCCAG AAACACCACC
GTGAGCACGG TGGATCGTGA TGCGGAAAAA AACGCAACAT CCGGTGCAAA TGCCACGACG
GAGGGGAGGC CTGCAGGCGC TTCAGAGAAT GTGAGCTCTG CAGTGGAGCA GAACCGCACC
GCTCAGAGCG CAGCGCCAAG CATGGCCATA GCTTCTGGCG CAGCTCCTGC ATCTAATGTG
TCTGCAGTGG CCACCGAGGG GCTCACTGAG GGGAACCTGA CATCAAATGC GACGTCTGAG
AGCGGCGCAT CAAACGCGAC TGCCGCTCCG GGCATATCCA CAAACCTCAC AGCAGTCGTG
ACCCCAGAGG GTCTCGCTCC CGAGGGGAAT CTGACATCAA ATGTGACATC AGCAGTTGCA
GAGAACGCAA GCCTGGCAGG CAACCAGAGC TCTAACCTGA CCGCGGTTCT CGTACCTGCA
AACATCTCTG CGGAGAACCT GACCGCGGAG AACGTAACAG AAGAGGAGAA TGTAACTGCA
GCTGAGAATG TGACCGAGGA AGTCACAGAG GAGGCTCCGG AGGAGGTTGG CGAGGAGGTC
GCTGAGGAGG AGTTCACGGA CAGGATCTGG AGAGAGGGAA TGCCAGAGAC ATACACATGG
ACGCCACAGA CGTTCTCAGG ATTCTTCTAT GATCTCGACG ATATGGTTGG CACCGAGAAG
CTGACCGTCA GCCTGAGTCG CTCAGGCGGC GGCTACAACA GGGCCATCGA CACAGGGAAT
ATAAGATACA CCAGTGATGT CCAGGACATA AGCTTTGAGT TCGACGACTG GGGGAAGTAT
CAGGTACTCG GCTTCATGGC GGAGAAGTAC TTCGCCGGAT ATTCTGGAAC TGAAGTGGTT
GATGATGTGA GCCTGATCAA CGAGAACCAG CTCAGGCGCG TGCTGATAGA CAGTGATGAT
GAGAAGACCA TAACATCAGG CTCCGTGCTC CCGCTTGAAG AGGGATACGA GCTCAGGATC
AAGGAGATAG ATATCAACGG CAACAAGGTC CATCTCGCAC TCGCAAAAGA TGGCGATGAG
ATCGACAGCA AGGTGATATC TCCTGACGAT CTGAAGAGCG CCACGTACAT GTACGAAGAG
GAGATCGGCG GCAAGGATGT TCCTCTCATA ATGGCTCACG TCTCGAATGT CTTCGCGGGC
GCGGAGTCGA GTCTGGTCAC AATAGACGGA CTCTTCCAGA TATCAGATAC ATACGCCTCT
GTCGAGGAGG GCGACAAATA CGACAAGATG GAGGTGGTGT CGGTCTCTGA CTCCGGAATC
GAGCTGGAGA ATGAGGACTC AGTGACTCTC AGAAAGGGCA GAACGATCCA GCTGATGGGC
GGTGTTGGTC TGCAGGTGGC CGACTCTGAT GTGCTCAGAT TTGCGCCTGT TGTCGAGAGA
ACCGGATCCT ACGAAGTCAG AGGAACTGTG GTAAATCCAA ACAAGGTGGA TAGCTTCACC
TGGACCCCCT ACAACTTCGA GGGCTTCTAC TACGATATCG ACGAGGATAT CGGGACGGAG
AAGCTTGTCG CGAGGTTCTC TGGAAGCAAG ATCGATGATG GTGATCTGAA GTACGAGACC
TCTCCGCAGC CTGTGGAGTT CGAGTTCAAT GGCTGGGGTA AGTATGATGT CATCGGGTTC
ATGGCAGACA AGTACTTCGC TGGCTACAAC AATGAGACAC TGTTCACGGA TGAGTTCAGC
ATCATAAACG ATGGCGAGTT GAGAAAAGTC CTGATAGATA GCGATGAGGA GAGCACGATA
TCATCGGGAT CTGTTCTGCC TCTCGAGGAC GGCTACGAGC TCCAGATAAA AGAGGTGGAT
CTCGACGGTA ACAAGGTCTG GCTCTCCCTG ACGAAGGATG GGGATGAGGT GGACAGCAAG
GTCGTCACAC CGGTATCAGG AGACCTCGAG GCCTCAACAT ACACCTACAA GGTGCGTATC
GGCTCCGAGG ATGTCCCGAT AATAGCAGCC CACATAAGCA ATGTCTTCCG CGGCAGAGAG
GCGGATCTGG CGACAGTCGA CGGGATATTC CAGGTCTCGG ACACGCCTGA GTCTGTGGAG
GAGGGCGACA AGCACGGCAA GATGGAGGTG GAATCGCTCT CCGATGATGG CATAACGATG
AAGAACGACG GCTCGATAAG CCTCGGGAGG GGCAAGGATG TTGAGATCAT GGGCAACCTG
AGGCTCAGGG TAGCCGACAA TCCGGAAAGA AACCTCTGCC CGATAGCCCT GCGTGTGGGC
AAGACAGAGC CACTCAGGCT CAACCTGACA GAGGCGATCG TTGGTAAGCC AATAATGATA
CAGGTCACAT CCGGCGGGCA GGCTGTGAGC GGAGCAAAGG TGCTTGTTGA CGGCAGGGAG
ATCGGCACGA CAGATGCAGG CGGAATGATC AGATACACGC CGGAGAGAGC TGGAAGCGTT
CAGGTCCAGG CGAAGCTATC TGGATACGAG GACGCGAGCG GGACGCTCCT GGTGAGAACT
GAGGCCGAGT TGAGGAGGAT TGTGATAACA GCACCCCCAG AGGTCATGAG GGGCGAGACC
TTCGTGGTGA CTGTCAGGGG AGGCGCAAAT GCCACCCAGG CGATCGCGGG AGCTAATGTG
AGCATAGACA ACATGCCTGC TGGAGTGACC GACAGCAAGG GATCTGTCTC CGTATCGATA
AACGATACTG GGGACCACAC GATATCTGTG GAGGCAGCAG GCTACGACAG GGCGACGAAG
AGCGTGAAGG TGCTCTCTCC GATCAGCATC GTCGGTATAA ACGTCACGGG CGATGCGATC
GCTGGTAAGC CGCTGAAGAT CGTCGCCGAG GTCCAGAACA CAGGAAAGGC GCCCGACTCC
AGGCAGTTGC AGCTTCTGGT GAACAAGAAC GTCACTGGCA ACAAGAGCAT CACTGTTGCG
CCTGGAGAGA CCGAGAAGGT CACATTTGAG TACAGGCCGA AGGAACCTGG CGTATACACC
TTCGAGGTGG ATGGCATCCA GAAGACTGTA TCTGTCGAGG AGGCGAAGGG CGGATGGCTC
GTGTGGGCCG TTGCCCTGCT GATAATCCTG CTCGCCGGAA TCGGTGTGTA TCTCTACAGG
ACCGGTGAGC TGAAGGAGCT GAAGAAGCGC CTCAAGATGT GA
 
Protein sequence
MYCKSLIVIA LIVISAGFLS GGVAENADNR SAEQSVTSAA ALQDDFFSTD TSSSGITVTF 
KRLGNNTAKT PVSLGSRNTT VSTVDRDAEK NATSGANATT EGRPAGASEN VSSAVEQNRT
AQSAAPSMAI ASGAAPASNV SAVATEGLTE GNLTSNATSE SGASNATAAP GISTNLTAVV
TPEGLAPEGN LTSNVTSAVA ENASLAGNQS SNLTAVLVPA NISAENLTAE NVTEEENVTA
AENVTEEVTE EAPEEVGEEV AEEEFTDRIW REGMPETYTW TPQTFSGFFY DLDDMVGTEK
LTVSLSRSGG GYNRAIDTGN IRYTSDVQDI SFEFDDWGKY QVLGFMAEKY FAGYSGTEVV
DDVSLINENQ LRRVLIDSDD EKTITSGSVL PLEEGYELRI KEIDINGNKV HLALAKDGDE
IDSKVISPDD LKSATYMYEE EIGGKDVPLI MAHVSNVFAG AESSLVTIDG LFQISDTYAS
VEEGDKYDKM EVVSVSDSGI ELENEDSVTL RKGRTIQLMG GVGLQVADSD VLRFAPVVER
TGSYEVRGTV VNPNKVDSFT WTPYNFEGFY YDIDEDIGTE KLVARFSGSK IDDGDLKYET
SPQPVEFEFN GWGKYDVIGF MADKYFAGYN NETLFTDEFS IINDGELRKV LIDSDEESTI
SSGSVLPLED GYELQIKEVD LDGNKVWLSL TKDGDEVDSK VVTPVSGDLE ASTYTYKVRI
GSEDVPIIAA HISNVFRGRE ADLATVDGIF QVSDTPESVE EGDKHGKMEV ESLSDDGITM
KNDGSISLGR GKDVEIMGNL RLRVADNPER NLCPIALRVG KTEPLRLNLT EAIVGKPIMI
QVTSGGQAVS GAKVLVDGRE IGTTDAGGMI RYTPERAGSV QVQAKLSGYE DASGTLLVRT
EAELRRIVIT APPEVMRGET FVVTVRGGAN ATQAIAGANV SIDNMPAGVT DSKGSVSVSI
NDTGDHTISV EAAGYDRATK SVKVLSPISI VGINVTGDAI AGKPLKIVAE VQNTGKAPDS
RQLQLLVNKN VTGNKSITVA PGETEKVTFE YRPKEPGVYT FEVDGIQKTV SVEEAKGGWL
VWAVALLIIL LAGIGVYLYR TGELKELKKR LKM