Gene Mthe_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0939 
Symbol 
ID4463332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1024616 
End bp1026760 
Gene Length2145 bp 
Protein Length714 aa 
Translation table11 
GC content58% 
IMG OID639699959 
ProductTPR repeat-containing protein 
Protein accessionYP_843367 
Protein GI116754249 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCAAT TAGTAGTGCT TCATGAGATA TTCGAGTCGT TCTCCAGGGG CACCGGCGCC 
CTCCGCGATA TCGAGCCCAG GGCGGTTGAG GGCAGGTGTG CCAGTCTTGT GGTCAAGCTC
CTGCAGAGCT GTCTCGATGG TGATATCGAT GAGGAGGTGT GTCGCGAGCT CTCCGAGTGC
CTGAGCAATA TCTATGAGCT CAAGAGGCTG GGCGACATAT GCCGGAGGGC AGGCATGCCC
GAGATCGCGA TGAAATGCTA CTCGAAGGCG CTCTCCCAGG CACAGGAGCC GCATGTCCGC
TCTGTGCTGC TGAACAACCT CGGCCAGGTC CATGCGCAGA AGGGCGACCT GAGCAAGGCG
ATTGTATACT ATAAAAAAGC CCTGGAGGGA TTCGAGTATG CAGGAGATCC GAGCAGCGCT
GCCCATGTCA TGGGGAACCT AGCATCTGCG TACAGGCGCG CGTACCAGTG GGATCGAGCC
GTGGAGTGCT ACTTCAAGGG TCTGAAGGGA TTTGAGAAGC TCAACGACAG CTTCGGCGTC
GCCCAGATGA CCGGATCCCT GGGAAGGGTG TACGCAGAGA TGGGCGAGAG GGAGCTTGCT
GTCTTGTATT ATGAGAAGAG CCTGAGGATG TTCGAGCAGC TCGGGGATCG CAGGAGCGCG
GCCTGGCTTC TGAGCCGCCT TGGAAGGGTG TACGCAGAGA TGGGAAGGTG GGACGACTCG
AAGAGATGTT TTGATAGAAG TTTGTCCATC TTCGAGGATC TCGGTCAGAG CCAGAACGCG
GGCATAGTCC TCTCAAACCT CGGCCGCTTC TACCTGGAGA AGGGGGATCC TGATTCAGCC
AGGGTTTCCC TGGAGCGCGC GCTGAAGCTG TTGAGAAAGG AGATGCTGCC TGTTTACCCG
AACACGGTTG CGGCGCTCGC GGCCGCTTAC AGCCTGCTCG CGAGCAGGTA CGCTGCTGAG
AGAAATGCGA AACAGTCGTC TCAGCTTTAC TCGAATGCAT CTGACTGCTT CAGCGAGCTT
GCGCTCCATC CCAGGATTCT CATATCGGAG CTGAAGGCGG CTGCTGGACA GGCGAGATCG
CTTTCCTATC TTGTGAAGCT CCGCGCAGAA CCCCGCGGGG ATGAGGCCGT CGCCTTGTGC
GAGAGAGCGA TATCTGCGCT CGAGAGCACG ATCGCCAGCA CCACATCGAA GGACAGGGAG
AAGGTCGCAT CACTCGTCCG GTGTCTGGAG GGGATAAAGG AGCTCTGGAG CATAGATCTC
TACGCAGCGG AGCCGTGGCG GATACTCAGC ATGGTCTCCG TGGCCTGTGA GTACCTGATG
GGTGGCATTC GGGAGCTTGC TAAGACCCTG GGACCGTACA AGGAGATGTA TGAAGCTCTT
GGTGCCATCA ACGGGGCGCT AGATGACGAG AGGCAGCGCA GAGACTCTTC GGCGAACCTG
AAGAGAGCTG CTGAGCATCT TCGGGCATCA CAGACTGAAA GCTTCGAGAA GATCGCCGAA
ACGCTCGAGA TGGCTGGAAG ATCCGAGCTT GTTCAGGGAA TGAGTCCGTC TGACATACTG
AACTATGGAG CTCACAGAAA GGCTCTCATG AGCATCGGCT GGGCTGCAGC GCAGAGCCTG
CTATCGGAGA TCGACAGGAC TGGATGGATC TACGCATGGG ACGAGTCGAT GAATCTCTCA
GAGATGGGGC CGGTTGGAAG GACCGAGCGG ATAAGGAGAA AGGCGGAGAG CATCGCGGAG
ATGAAAGAGG TGCCTGTGGT AGAGGTGGGT TCTGTGAACA TCAGGGGCTC TGCAGTTGAT
GTGCCGGTGG AGAGCGCTCT CGGGAACACA TCCATTGTGC CCACACAGAC CGTGCTTGTG
TCTGCTGCAA CATCGCTGCC TGCTCGTCCT GAGACCGTGC GCCCTGAGAT TACGCTTGAG
GGGCCATACG TCATCGAGAG CTTCGAGGAT TATGAGCAGG AGGGTGCATT TTACGATGAG
CCTCACAGAG ACAATCCCAC TGCTGGTACC GGTTGCGCAG AGTCTGCCTA TGTGGCTCAA
AGAGGCCATC TGCGGGACTC GATCGCCGCG AGGATCATCG TGGGGCTGAT AGCGCTGGCA
GGGACGGCAT ACGCGATAAT GCATGTGATT CTGGGGGTTC TGTGA
 
Protein sequence
MDQLVVLHEI FESFSRGTGA LRDIEPRAVE GRCASLVVKL LQSCLDGDID EEVCRELSEC 
LSNIYELKRL GDICRRAGMP EIAMKCYSKA LSQAQEPHVR SVLLNNLGQV HAQKGDLSKA
IVYYKKALEG FEYAGDPSSA AHVMGNLASA YRRAYQWDRA VECYFKGLKG FEKLNDSFGV
AQMTGSLGRV YAEMGERELA VLYYEKSLRM FEQLGDRRSA AWLLSRLGRV YAEMGRWDDS
KRCFDRSLSI FEDLGQSQNA GIVLSNLGRF YLEKGDPDSA RVSLERALKL LRKEMLPVYP
NTVAALAAAY SLLASRYAAE RNAKQSSQLY SNASDCFSEL ALHPRILISE LKAAAGQARS
LSYLVKLRAE PRGDEAVALC ERAISALEST IASTTSKDRE KVASLVRCLE GIKELWSIDL
YAAEPWRILS MVSVACEYLM GGIRELAKTL GPYKEMYEAL GAINGALDDE RQRRDSSANL
KRAAEHLRAS QTESFEKIAE TLEMAGRSEL VQGMSPSDIL NYGAHRKALM SIGWAAAQSL
LSEIDRTGWI YAWDESMNLS EMGPVGRTER IRRKAESIAE MKEVPVVEVG SVNIRGSAVD
VPVESALGNT SIVPTQTVLV SAATSLPARP ETVRPEITLE GPYVIESFED YEQEGAFYDE
PHRDNPTAGT GCAESAYVAQ RGHLRDSIAA RIIVGLIALA GTAYAIMHVI LGVL