Gene Mthe_0198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0198 
Symbol 
ID4462765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp194168 
End bp196756 
Gene Length2589 bp 
Protein Length862 aa 
Translation table11 
GC content55% 
IMG OID639699206 
Productpentapeptide repeat-containing protein 
Protein accessionYP_842637 
Protein GI116753519 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.726788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGTTAC ATCTCCTGTT TGCTGTTCTT CTCATACTGC TCATAACAGC GGGTGGTGCA 
CCTGCGAAAG AGGTGCTTCT TCTCAACTCC TACAATCCGG GGATGTCCTG GACAGATGAT
GTGATCGGCG GTGTGCGCCT CAGACTGGCG ATCGACGCTC CGAATGCGAA TCTCACTGTG
GAGTACATGG ACACAAAAAA GGTGCTGCTG AACGAGTCCA GAATGGAGTT TCTGAAGAGG
CTGTACTCCG AGAGATATGG TGAAAGAAAA TTCGATGTGA TCATATCTTC AGACGATGAC
GCGTTCAGGT TTCTTCTGAC AAACAGAGAT GAGCTGTTCC CCGGAGTTCC TGTGGTCTTC
TGCGGTGTGA AGGACTTCAG GCCTGAAATG CTGAGCAATG TCAGTGGATT CACAGGAGTT
CTCCTTAACG TGAGCATCGA GGATACGATC GACCTCATGC TCAGGCTTCA TCCAGATACG
AACAAGATAG TGGTTGTGAA CGACAACACC ACCACAGGGA TGGCAAACAG GAGGATACTC
GAGGGCGTCA TTCCGAAGTT CAACATCACA TTTGATGTTC TTGATAATGT GACTGTGGAT
GAGCTGCGTG AGAATGTATC CAGGCTTGGC CCTGGTGTGC TCGTCCTCCT ACTGACATTC
AACAGAGACC GCGCTGGAGA GGTATTCACG TATGAGGAGA GCGCTGAGAT TCTCAGGCAG
GTCAGCCGCG TCCCTGTCTA CGGCGTGTGG GAGATGTGCC TGGGGCATGG AATAGTCGGA
GGATATCTCA GCAGCGGAGA TGCGCAGGGA ATGAAGGCCG CCGAGATCGC GGCGCGCATC
CTCCACGGCG CGGATCCAGA GAGCATACCT ATCGTCAGCC ACAGCCCGAA TGTATACATG
TTCGATATGC TCGAGCTCCG CAGATTCAAC ATCTCCCGCG GATCTCTCCC CGCGGAGAGC
GAGATAATAA ACCGGCCGTA TCATGACAGG GCTGATCTGA GCCACATGAA CCTCAGCTGG
CACGACCTGA GTGGTGCGAG CCTGAACCAG ACGTACCTGA ACGGCTCTGA CCTGAGCAAC
GCCAATCTTA CAGGCGCCTA TCTGAGGTAC AGCATGATCT ACGATGCGAA CCTGAGTCTC
GCAGACCTCT CCGGTGCAGA CATTGAGGGC GCTGATATCC ATAACACAGA TCTACGCGAG
GCCAGGCTGA GAGGCGCAAA GCTCATCGGT GTGGACCTGA CCAGAAGCGA TCTCAGCCGC
GCGGATCTCA CAGGAGCTCA CATGGAGATC GCCAGGCTCA GCGGCGCGTT GCTGACTGGC
ACGATGATGG ACGGAGCGGA TCTGAATGGC ACCAAGATGG ACGGATGTAA TCTGAGCGGA
GCGTATGTCA GGAGCGCATT CGTCTACCGT GCCAATCTCA GAGACGCGAA TCTCTCAGGA
GCGAACATGA GCGGCTCGGA TCTCTCGGGC GTGGATCTCA CACGCGCAGC TCTCATCTAC
TCAGACCTGA GAAACGCATC GATGCAGGAC TCTGTAATCA GAGATGCGAA TCTGACCGGT
TCACAGCTTA CAGGCGCCAT TATGATGCAA TCAAACATCT CAGGCGCGAA CCTCTCATTC
ACGGATCTCT CAAACACTGA TATGAGAAGA TGCTGTATGC TATTTACAGA TCTCGTAGGC
GCGAGGTTGA ACAACGCCAG GCTCGACTCG TCCATGCTCT TCAGGGCAAA CCTCTCCCGG
GCATCTCTGG TCTCCGCCAG CCTGCAGGGG GTGGATCTAT CAGGATCGGA TCTCTCGGAG
GCGGATCTGA GGGGTGCTGA CATGACAAAT GCAAAGTTGA CGGAGACCGT GCTGGAGGGT
GCGGATATGA GCGGCGCCAG GCTCCTCGGC GCGGATCTGA CCCAGGCGAG GATGCATGAT
TTGATCTTAA CAAGAGCAAA CATGCTCGGC GCCAGGGCGA ACTGGGTGGA TCTGAGCGGC
GCCAGATTAT CGAGAGCTCT GCTCACGAGA GCCGAGCTGT TCGGTGCGGA TCTTAGCGGC
ACGGATCTGA GCGGTGCGGA TCTCGTAAAG GCATATGCCC TGAGGGCAAA CCTCTCGGGC
GCGGACCTCA CAGATGCAAA GCTAGATGAC GCAGACTTCA GCGGGGCGAT TCTCAGAGGT
GCGAAAATGC CGGAGCTCGT GATTCGCAGC GTCAACTTCG GGCAGGCGGA TCTCAGCGAT
GCCGATATGT CAGGATGCCG TTTTGAGGCG CTCTACGTAT CAAACGCTGT GATGAGATCT
GCGAATATGA GAAATGCCAT TTTCAGAGGG GTGATGTTCG AGAACTGCGA TCTGAGCATG
GCGGATCTGA AGAGAATAAA GGCGACGGGT GTGTATCTCA CCAACACAAG CCTCTCCGGA
GCTGATCTGC GGGACTCTGA GCTTTACTCG GTAGGATTTA CAAATGTCGA TCTGCGCGGA
GCCAGGCTCG ATGGCATAAG GTACGACAGA CCGACGCTCG AGAGCCTGGC GCAGCAGAAC
CTCGACGGGG TGAGCATGAG CGATGATCTC AGGAGGGATA TCGAGAGGGT CAGGAACGAA
GCATCCTGA
 
Protein sequence
MRLHLLFAVL LILLITAGGA PAKEVLLLNS YNPGMSWTDD VIGGVRLRLA IDAPNANLTV 
EYMDTKKVLL NESRMEFLKR LYSERYGERK FDVIISSDDD AFRFLLTNRD ELFPGVPVVF
CGVKDFRPEM LSNVSGFTGV LLNVSIEDTI DLMLRLHPDT NKIVVVNDNT TTGMANRRIL
EGVIPKFNIT FDVLDNVTVD ELRENVSRLG PGVLVLLLTF NRDRAGEVFT YEESAEILRQ
VSRVPVYGVW EMCLGHGIVG GYLSSGDAQG MKAAEIAARI LHGADPESIP IVSHSPNVYM
FDMLELRRFN ISRGSLPAES EIINRPYHDR ADLSHMNLSW HDLSGASLNQ TYLNGSDLSN
ANLTGAYLRY SMIYDANLSL ADLSGADIEG ADIHNTDLRE ARLRGAKLIG VDLTRSDLSR
ADLTGAHMEI ARLSGALLTG TMMDGADLNG TKMDGCNLSG AYVRSAFVYR ANLRDANLSG
ANMSGSDLSG VDLTRAALIY SDLRNASMQD SVIRDANLTG SQLTGAIMMQ SNISGANLSF
TDLSNTDMRR CCMLFTDLVG ARLNNARLDS SMLFRANLSR ASLVSASLQG VDLSGSDLSE
ADLRGADMTN AKLTETVLEG ADMSGARLLG ADLTQARMHD LILTRANMLG ARANWVDLSG
ARLSRALLTR AELFGADLSG TDLSGADLVK AYALRANLSG ADLTDAKLDD ADFSGAILRG
AKMPELVIRS VNFGQADLSD ADMSGCRFEA LYVSNAVMRS ANMRNAIFRG VMFENCDLSM
ADLKRIKATG VYLTNTSLSG ADLRDSELYS VGFTNVDLRG ARLDGIRYDR PTLESLAQQN
LDGVSMSDDL RRDIERVRNE AS