Gene Mthe_1280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1280 
Symbol 
ID4461948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1388538 
End bp1390052 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content53% 
IMG OID639700297 
ProductNHL repeat-containing protein 
Protein accessionYP_843698 
Protein GI116754580 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.164299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAGA TTTGTAAGAC TGCGATGAAT CCTCCTGCGA CCACAAGAGT GATCTGCACA 
CAATCCAGAG CAAACCCCAT GATAGCCCCT GAGTTCCCAG AGGATCTGGT ATGGCTCAAC
ACCGACCGCA GGTACACACT GAGAGACCTC CGCGGCAGGT TTGTCCTCAT TGACTTCTGG
ACATACTGCT GCATAAACTG CATGCATGTC GTTACCGACC TAAAAATGCT TGAGGAGAGA
TATCCTGAGC TTGTGGTGAT AGGGGTTCAC ACCGCCAAGT TCGAGAACGA GATGAGAGTG
GAGAACATCG AGAAGGCGAT AATGCGATAT GGAATCGAGC ATCCAGTCAT CGTGGATAGC
GATCGAACCC TGTGGCGCGC ATACGGTATA AGGGCCTGGC CGTCTTTCGT CCTCATAGCG
CCGGACGGGG AGATCCTCGG GAGGACCTCT GGAGAGGGGA TATTCTCGAT CCTTATGCCA
ATAATGGAGC AGCTCATTCC GGAGTACGAG AAGCGCGGAA GCCTCCATCA TGGAAAGCCG
GCGCCCAGAG CGACACATAA GGGCGTTTCC GGAGCACTCT CCTTCCCCGG AAAGGTGATC
TCTGGTGGAG ACAATATCTT CATCGCGGAT TCGAACAATA ACAGAATACT GATCGTCTCT
CCTGACGGTG ATCTGATGGA CGTTATCGGC TCCGGAGAGA GGGGATACAG TGATGGAGAT
TTCAGCGAGG CACGGCTCTT CAGACCCCAG GGGATCGCTA TCGTCGGGGA TGTTGTTTAC
ATCGCAGATA CAGGCAACCA CATGGTCAGG GCGGCGGATC TGAGAAGAAG AACTCTTGTG
AGGATGGCAG GAACGGGAAA GTCACGGCAT CCTGGCCTTG GGGGCAGAGG CGCTGAAGTA
TCACTGAGCT CCCCGTGGGA TCTCGTTTTC GTTCAAGATC ATCTCTACAT TGCGATGGCA
GGATCACATC AGATATGGAG GATGGATCTT GAGGGGATGG TAGAGCCTTA TGCAGGATCA
GGTATCGAGG GGCTCGCTGA TGGACCTCTG GAGCAGGCTC GCCTGGCCCA GCCGTCCGGG
CTGACGACCG ATGGGAATAG GATATACTTC GTCGACAGCG AATCTTCATC ACTCAGGGTA
ATAGATGGCG ATGTGAGAAC GCTCATCGGA AGGGATCTCT TTTACTTTGG GGACATCGAT
GGTGATTTTG GGAGGGCCAG GCTTCAGCAT CCACTGGGGC TTTTTTACAA AGAGGGATCC
ATTTATGTCG CGGATACCTA CAACCACAGG ATCAAGAAAG CTGACCTCTC GAGCGGATCC
ATTCACACCA CCGCCGGAAC TGGGAGTCCC GGTTTCGCAG ATGGTCCTGG TGCTCAGGCT
GCGTTTAATG AGCCCTCCGG CCTCACCTTT CTGGGGGATT CGTTATTCAT AGCGGATACC
AACAATCACG CCGTTAGGAT ATACGATCAG AGATCAGGGG ATGTCTCCAC GATGAGAATC
GATACGAAGA AATAA
 
Protein sequence
MMQICKTAMN PPATTRVICT QSRANPMIAP EFPEDLVWLN TDRRYTLRDL RGRFVLIDFW 
TYCCINCMHV VTDLKMLEER YPELVVIGVH TAKFENEMRV ENIEKAIMRY GIEHPVIVDS
DRTLWRAYGI RAWPSFVLIA PDGEILGRTS GEGIFSILMP IMEQLIPEYE KRGSLHHGKP
APRATHKGVS GALSFPGKVI SGGDNIFIAD SNNNRILIVS PDGDLMDVIG SGERGYSDGD
FSEARLFRPQ GIAIVGDVVY IADTGNHMVR AADLRRRTLV RMAGTGKSRH PGLGGRGAEV
SLSSPWDLVF VQDHLYIAMA GSHQIWRMDL EGMVEPYAGS GIEGLADGPL EQARLAQPSG
LTTDGNRIYF VDSESSSLRV IDGDVRTLIG RDLFYFGDID GDFGRARLQH PLGLFYKEGS
IYVADTYNHR IKKADLSSGS IHTTAGTGSP GFADGPGAQA AFNEPSGLTF LGDSLFIADT
NNHAVRIYDQ RSGDVSTMRI DTKK