Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0198 |
Symbol | |
ID | 4462765 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 194168 |
End bp | 196756 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639699206 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_842637 |
Protein GI | 116753519 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.726788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGTTAC ATCTCCTGTT TGCTGTTCTT CTCATACTGC TCATAACAGC GGGTGGTGCA CCTGCGAAAG AGGTGCTTCT TCTCAACTCC TACAATCCGG GGATGTCCTG GACAGATGAT GTGATCGGCG GTGTGCGCCT CAGACTGGCG ATCGACGCTC CGAATGCGAA TCTCACTGTG GAGTACATGG ACACAAAAAA GGTGCTGCTG AACGAGTCCA GAATGGAGTT TCTGAAGAGG CTGTACTCCG AGAGATATGG TGAAAGAAAA TTCGATGTGA TCATATCTTC AGACGATGAC GCGTTCAGGT TTCTTCTGAC AAACAGAGAT GAGCTGTTCC CCGGAGTTCC TGTGGTCTTC TGCGGTGTGA AGGACTTCAG GCCTGAAATG CTGAGCAATG TCAGTGGATT CACAGGAGTT CTCCTTAACG TGAGCATCGA GGATACGATC GACCTCATGC TCAGGCTTCA TCCAGATACG AACAAGATAG TGGTTGTGAA CGACAACACC ACCACAGGGA TGGCAAACAG GAGGATACTC GAGGGCGTCA TTCCGAAGTT CAACATCACA TTTGATGTTC TTGATAATGT GACTGTGGAT GAGCTGCGTG AGAATGTATC CAGGCTTGGC CCTGGTGTGC TCGTCCTCCT ACTGACATTC AACAGAGACC GCGCTGGAGA GGTATTCACG TATGAGGAGA GCGCTGAGAT TCTCAGGCAG GTCAGCCGCG TCCCTGTCTA CGGCGTGTGG GAGATGTGCC TGGGGCATGG AATAGTCGGA GGATATCTCA GCAGCGGAGA TGCGCAGGGA ATGAAGGCCG CCGAGATCGC GGCGCGCATC CTCCACGGCG CGGATCCAGA GAGCATACCT ATCGTCAGCC ACAGCCCGAA TGTATACATG TTCGATATGC TCGAGCTCCG CAGATTCAAC ATCTCCCGCG GATCTCTCCC CGCGGAGAGC GAGATAATAA ACCGGCCGTA TCATGACAGG GCTGATCTGA GCCACATGAA CCTCAGCTGG CACGACCTGA GTGGTGCGAG CCTGAACCAG ACGTACCTGA ACGGCTCTGA CCTGAGCAAC GCCAATCTTA CAGGCGCCTA TCTGAGGTAC AGCATGATCT ACGATGCGAA CCTGAGTCTC GCAGACCTCT CCGGTGCAGA CATTGAGGGC GCTGATATCC ATAACACAGA TCTACGCGAG GCCAGGCTGA GAGGCGCAAA GCTCATCGGT GTGGACCTGA CCAGAAGCGA TCTCAGCCGC GCGGATCTCA CAGGAGCTCA CATGGAGATC GCCAGGCTCA GCGGCGCGTT GCTGACTGGC ACGATGATGG ACGGAGCGGA TCTGAATGGC ACCAAGATGG ACGGATGTAA TCTGAGCGGA GCGTATGTCA GGAGCGCATT CGTCTACCGT GCCAATCTCA GAGACGCGAA TCTCTCAGGA GCGAACATGA GCGGCTCGGA TCTCTCGGGC GTGGATCTCA CACGCGCAGC TCTCATCTAC TCAGACCTGA GAAACGCATC GATGCAGGAC TCTGTAATCA GAGATGCGAA TCTGACCGGT TCACAGCTTA CAGGCGCCAT TATGATGCAA TCAAACATCT CAGGCGCGAA CCTCTCATTC ACGGATCTCT CAAACACTGA TATGAGAAGA TGCTGTATGC TATTTACAGA TCTCGTAGGC GCGAGGTTGA ACAACGCCAG GCTCGACTCG TCCATGCTCT TCAGGGCAAA CCTCTCCCGG GCATCTCTGG TCTCCGCCAG CCTGCAGGGG GTGGATCTAT CAGGATCGGA TCTCTCGGAG GCGGATCTGA GGGGTGCTGA CATGACAAAT GCAAAGTTGA CGGAGACCGT GCTGGAGGGT GCGGATATGA GCGGCGCCAG GCTCCTCGGC GCGGATCTGA CCCAGGCGAG GATGCATGAT TTGATCTTAA CAAGAGCAAA CATGCTCGGC GCCAGGGCGA ACTGGGTGGA TCTGAGCGGC GCCAGATTAT CGAGAGCTCT GCTCACGAGA GCCGAGCTGT TCGGTGCGGA TCTTAGCGGC ACGGATCTGA GCGGTGCGGA TCTCGTAAAG GCATATGCCC TGAGGGCAAA CCTCTCGGGC GCGGACCTCA CAGATGCAAA GCTAGATGAC GCAGACTTCA GCGGGGCGAT TCTCAGAGGT GCGAAAATGC CGGAGCTCGT GATTCGCAGC GTCAACTTCG GGCAGGCGGA TCTCAGCGAT GCCGATATGT CAGGATGCCG TTTTGAGGCG CTCTACGTAT CAAACGCTGT GATGAGATCT GCGAATATGA GAAATGCCAT TTTCAGAGGG GTGATGTTCG AGAACTGCGA TCTGAGCATG GCGGATCTGA AGAGAATAAA GGCGACGGGT GTGTATCTCA CCAACACAAG CCTCTCCGGA GCTGATCTGC GGGACTCTGA GCTTTACTCG GTAGGATTTA CAAATGTCGA TCTGCGCGGA GCCAGGCTCG ATGGCATAAG GTACGACAGA CCGACGCTCG AGAGCCTGGC GCAGCAGAAC CTCGACGGGG TGAGCATGAG CGATGATCTC AGGAGGGATA TCGAGAGGGT CAGGAACGAA GCATCCTGA
|
Protein sequence | MRLHLLFAVL LILLITAGGA PAKEVLLLNS YNPGMSWTDD VIGGVRLRLA IDAPNANLTV EYMDTKKVLL NESRMEFLKR LYSERYGERK FDVIISSDDD AFRFLLTNRD ELFPGVPVVF CGVKDFRPEM LSNVSGFTGV LLNVSIEDTI DLMLRLHPDT NKIVVVNDNT TTGMANRRIL EGVIPKFNIT FDVLDNVTVD ELRENVSRLG PGVLVLLLTF NRDRAGEVFT YEESAEILRQ VSRVPVYGVW EMCLGHGIVG GYLSSGDAQG MKAAEIAARI LHGADPESIP IVSHSPNVYM FDMLELRRFN ISRGSLPAES EIINRPYHDR ADLSHMNLSW HDLSGASLNQ TYLNGSDLSN ANLTGAYLRY SMIYDANLSL ADLSGADIEG ADIHNTDLRE ARLRGAKLIG VDLTRSDLSR ADLTGAHMEI ARLSGALLTG TMMDGADLNG TKMDGCNLSG AYVRSAFVYR ANLRDANLSG ANMSGSDLSG VDLTRAALIY SDLRNASMQD SVIRDANLTG SQLTGAIMMQ SNISGANLSF TDLSNTDMRR CCMLFTDLVG ARLNNARLDS SMLFRANLSR ASLVSASLQG VDLSGSDLSE ADLRGADMTN AKLTETVLEG ADMSGARLLG ADLTQARMHD LILTRANMLG ARANWVDLSG ARLSRALLTR AELFGADLSG TDLSGADLVK AYALRANLSG ADLTDAKLDD ADFSGAILRG AKMPELVIRS VNFGQADLSD ADMSGCRFEA LYVSNAVMRS ANMRNAIFRG VMFENCDLSM ADLKRIKATG VYLTNTSLSG ADLRDSELYS VGFTNVDLRG ARLDGIRYDR PTLESLAQQN LDGVSMSDDL RRDIERVRNE AS
|
| |