Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0815 |
Symbol | |
ID | 4462106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 875293 |
End bp | 878667 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699831 |
Product | hypothetical protein |
Protein accession | YP_843244 |
Protein GI | 116754126 |
COG category | [S] Function unknown |
COG ID | [COG4743] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.676426 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTTC AGGCAGGCCA AGGCGTGGTT GTGTTTGCGC TTCAGGCGCT TCTGCTCTAT GCACTCATAT CAGCGGTATT CCCTGAGAAA GAGCCTCTTG GGTCTATTTT TCATAGGATC ATTCTCAGCA TCGGCGTGAC GCTGGTGGTG TCTCTGGCGG AGGTCGCACT GGGTATGGGG GAATTTCCCG CGCTGATTGT CGCCATCCTG GTTTTGATTC TCGTGGTCCT TGCGCACATC CGTAGGGGTG CGGTCCCGAG GAGGAGAAGG TTCTCGGTAC ACACCTACAG ATACGCTCTC AGGAGCAATC GAAGAAGCCT TCCGCAGAAG ATTGTGCCCG CGATCATCGT TCTTCTGATT GTTGCTGCAA TCTCTTTCGG CTACGTGACG GTGAAAAGCG ATAGGCAGGA GGGTTTCACA GAGTTCTACG TGACTGGTTT GAGCATAAAT AGTAATGGAA ACGCGACAGC GACTGTTGGC GTCATAAACC ACGAGGGAGT TGAGGCGAAC TACACGATCG ATGTTGAGAT GAATGGAGTG AGGGCAGCAC AGAGGTATGC ACATCTGGCC GATGGAGGAG CATATGAGGA GACGCTGGAG TGGGGCATAC CCATGCACAC GGGCGAGCTT TGCACGCTCA GAATTCTTCT TTACAGAAAT GGCGAGATGC TGAAGGATTA CCAGAAAACC CTGAGGATCA GCGATTACAT CACAGAGGCT CCTGTGGCGA ATATGTCAGA AAGCGGGGAT GCGCAGCAAG TGAACACAAC GTCGAATGTC ACGGCAGTCG CGAAGAACAT ATCACAGGAC ACATCTCTGC GCAGCACACC GGTGGCTTCT GCCACGAGAA GGGTCTCATC AGGGTCTGGA GGTTCCGGAT CATCCAGCAG CTCCGGCTCA TCATCCTCAT CAGGAATATC AAATGATGTG AGCGCTGCTT CGAAGATGCC AGGGAACGAG AGCGCGTTAG AGGGCAGCAC GACTGATATG AATCAATCAG CTGCTGCAGC AATGCCTGAC GAAAACGCAA CTGCAGTCGC GAACGCCACG CCGGCCAGTG TCTCTCTCAT GGCCGCTTTA ACAAGCGTTT CCGCCATCAT AAATCAGACG GGGGAGGGCG CTGATTTAAG TGATGCATCG GTGAATGCAT CCACAGCCCC AGCAGATCTT GAGAATAGGA GCACAGCTCC TTTGCCTGCG GTGTCGCTGA ACAGAAGCCC GAGCAGGAGC ACAAGCGCCG TGGAGAACAG ATCTCAGGAG ATTGTGCATC TGAAGGCGCG CCCTGAGATC GAGCGGCTCG AGCCGGATAA GGAGAGCCCT CAGAAGGTCG GAACCACGGT TGTCTGGAGT GTCAGAGCCT CAACGCATGC GGATGGTCTG AGCTACAGGT TCTTTTTGAA TGGCAGGCCG ATCACCAGCT GGAGCAGCAG CCCGAGCTGG ACCTGGTACA CATCGGGAAT TCCAGCTGGA GAATACAACG TGAGTGCGTG TGTGAGGGGT TCGAATAGCA CAGACTGCGA GGATACGGCA TGGTCGCTGT ACACACTGCT GCCGATGAAC CTGCCGCCGG ATCTGACTGC CGTGATATCA GATCCTGAGG GTCCGCAGCC ACAGGGCACA CAGATAAGAT GGTCCGCAGC CGCATCTGAC GAGGAGAACG ATACGCTGCT GTACAGGTTC TCCGTTGATG GTCTCACCGT TAGAGATTGG TCGAGATCCG GCATCTTCGA GATGAACACA TCCGCTCTCG CTCCGGGAAA TCATACGGTC CGCGTTGATG TGAGGGATGG ACTCCACTCA GCCGAATATG ATGATGCCAT TGAGCAGGTC GTGGAGATCG TGGCGGCAAA CGCCCCTCCT AAAATCAAGG GACTTGTGCC TAACAGGGAG AGCCCGCAGC CTGTTGGATC TGTGATTACA TGGACTGCAT CTGCGGATGA TCCCGACAAC GACACGCTGC TGTACAGATT CTTTGTTGAT GATGTGGCTG TGAGCGGATG GTCGACGAAT CCGCAGTTTG TATGGAACAC GAGCGGCTAC AGCGATGGCG AACACAGGGT GAGCGTTCTG GTTAAAGATA AACATCATGA GGATCACGAC AGGAGGGATG TGAGCTTCTC TCTCGAAAAG AAGAACTCTC CACCAAGGGC GCTGAGCCTC CTGCCTGACA GGGGGAGCCC GCAGCCTGTG GGCTCGAGGA TCACCTGGAG CGCACTTGCA GAGGATCCCG ATAACGATCC AATCCAGTAC AGGTTCTTTG TTGATGGTGT GGCTGTTAGC GACTGGTCGT CATCCGCGGT ATTCGAGCTC GATACATCCG GGCTCTCGCC GGGCAACCAC ACGCTCAGCG TTCATGTGAG AGATGGCATG CACGCGGATC ACGATGACGA GAAGGAGAAG TTCTTCGAGC TCAGCGTTCC CAACAGGCCG CCAGCGGGGC TGAAGCTCAT GCCATCTTTA GAAAGCCCGC AGCCTGTGGG ATCGGTGATA ACATGGACAG CCTCTGCAGA TGATCCCGAC AACGACACGC TGCTGTACAG ATTCTCGCTG AACGGGAGGG CGGTGACCGA CTGGTCGCCG TCAGGCAGCT GGTCTTGGAA CACATCCGGC CTCTCCGCCG GAGAGCACAC GGTGGGAGTC TGGGTCAGGG ATGGCCGTCA TGCAGGTGCG GTAGGCTTTG ATTCTGCTCA GACATCGAAG TTCATACTGA CACCGGAGAA CCGCCCACCG GAGATGATAT CGCTCAGACC AGACAGGGCC GGACCCTACG AGCCCGGGGC CGAGGTGACG TGGATCGCGG AGGCGAGGGA TCCCGAGGGC GATGCGCTCC TGTACAGGTT CTTTGTTGAT GGTGCTGCTG TGAGCGACTG GTCCGGCACA GAGCGCTGGA CTTTGAGCAC TTCAGAGGAG GGCAGGCACA GCATCACAGC AGCTGTGAGA GACACGTCTC ATGAGACCGT CCAGAGCATC ACATCTGAGT TCGCGGTGGA ATCAAAGATG ACTGTGAACC AGCCACCTGT GATGGAGGAT CTCGTGCCCG ATCTGAGCAG CCCGCAGCAT GCCGGCATAT CTGTAATATG GACTGCGAGG GCACATGACC CTGAGAACGA TCCTCTCATG TACAGATTCC TGGTCGACGG CTCTCCGGCC ACGGACTGGT CGTCGTCCAG CAAGTTCACC TGGAACACTG TGGGCGTGGC AGCGGGCGAT CACAACATCA CCGCGCAGGT CATGGATGGA AGCAGCATCA TCAGCATGTC ACGCAACTAT ACCATAAGAT CCATAGTGGA TGAGGCCCTC AGCGGCATCG AATCTTCGAG TAGCAGCGCT GCTCTCGGGA GCAAGAACAT AACATCCGTC AGGGTTGGAA GATAG
|
Protein sequence | MNVQAGQGVV VFALQALLLY ALISAVFPEK EPLGSIFHRI ILSIGVTLVV SLAEVALGMG EFPALIVAIL VLILVVLAHI RRGAVPRRRR FSVHTYRYAL RSNRRSLPQK IVPAIIVLLI VAAISFGYVT VKSDRQEGFT EFYVTGLSIN SNGNATATVG VINHEGVEAN YTIDVEMNGV RAAQRYAHLA DGGAYEETLE WGIPMHTGEL CTLRILLYRN GEMLKDYQKT LRISDYITEA PVANMSESGD AQQVNTTSNV TAVAKNISQD TSLRSTPVAS ATRRVSSGSG GSGSSSSSGS SSSSGISNDV SAASKMPGNE SALEGSTTDM NQSAAAAMPD ENATAVANAT PASVSLMAAL TSVSAIINQT GEGADLSDAS VNASTAPADL ENRSTAPLPA VSLNRSPSRS TSAVENRSQE IVHLKARPEI ERLEPDKESP QKVGTTVVWS VRASTHADGL SYRFFLNGRP ITSWSSSPSW TWYTSGIPAG EYNVSACVRG SNSTDCEDTA WSLYTLLPMN LPPDLTAVIS DPEGPQPQGT QIRWSAAASD EENDTLLYRF SVDGLTVRDW SRSGIFEMNT SALAPGNHTV RVDVRDGLHS AEYDDAIEQV VEIVAANAPP KIKGLVPNRE SPQPVGSVIT WTASADDPDN DTLLYRFFVD DVAVSGWSTN PQFVWNTSGY SDGEHRVSVL VKDKHHEDHD RRDVSFSLEK KNSPPRALSL LPDRGSPQPV GSRITWSALA EDPDNDPIQY RFFVDGVAVS DWSSSAVFEL DTSGLSPGNH TLSVHVRDGM HADHDDEKEK FFELSVPNRP PAGLKLMPSL ESPQPVGSVI TWTASADDPD NDTLLYRFSL NGRAVTDWSP SGSWSWNTSG LSAGEHTVGV WVRDGRHAGA VGFDSAQTSK FILTPENRPP EMISLRPDRA GPYEPGAEVT WIAEARDPEG DALLYRFFVD GAAVSDWSGT ERWTLSTSEE GRHSITAAVR DTSHETVQSI TSEFAVESKM TVNQPPVMED LVPDLSSPQH AGISVIWTAR AHDPENDPLM YRFLVDGSPA TDWSSSSKFT WNTVGVAAGD HNITAQVMDG SSIISMSRNY TIRSIVDEAL SGIESSSSSA ALGSKNITSV RVGR
|
| |