Gene Mthe_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0046 
Symbol 
ID4462738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp41252 
End bp43309 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content53% 
IMG OID639699055 
Productcarbohydrate-binding and sugar hydrolysis 
Protein accessionYP_842489 
Protein GI116753371 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0357051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCCTGT GGCGGGCTGT GCTGATAATG TCTCTGGTGG TCATGATAGT ATCCGCATCC 
AGCGCGAGGA TTCTGGTTGT GGATCAGGAG GCGACGGGCG CGATCAGAAG CGTGAATGCA
GCGCTCGCAA ACGCGCAGAA CGGTGACGAG ATACTTGTAA TGGCGGGGTA TTATAGGGAG
AGCGTGGATG TCAATAAGTT GGTCTCCATA AAAGGATATG GTGCGATCAT CGACGGCATG
GGCCGTACGA CCTTCCAGAT GAGATCATAC GGGAGCAGCA TATCCAACCT CACGATCCTC
GGGAGCGGCA GGGATCCGGC AGTCGTGGTA GAGGGATATG CATCTGTGAC CGGCTGCAAC
ATCAAAAACT CCTCCATTGG AATCGCGGCA TACAAGGGAG TGGTATCGAA CAACACTATT
TCAGCTACCG CATATGGGAT CAGGCTGAAC GGCTCCGCGA TTGTAGAGAA CAACACGATA
TCAGGACCGC TGGGGGCCGG CATAGAGGTG AGGTGCAACA GCTCCAGGAT CCTCAGTAAC
ATTGTGACCT CTGCAGGAAA TGCGATAGAT GTGGTCGGAA ACAACAACAC CGTTCTTTTC
AACCACCTCT CCTCATCGAA CATAGGCATA CGGCTGAAGT CATCTTCTGA TAACATAATG
ATGAATAACA CCTGCGAGAA GAACAGGATC GCTGGAGTTT ATCTGGAGGA TTCCAGGAAC
AACAGCGTAT CCTCGAACAG GTTTCTCCGC AACGGCAACG GTATCCTCCT GAAATCCTCT
TCGGGTAATA AGATAATCGG AAATATCGTA GAGCTGAACG AGTACGGGAT ATCGATGAAG
GGCTCTGGTG GTAACCTGCT CAGAGAGAAC GTGCTGATAT CAAATATATA CAGCTTGAGG
ATAGAGGCAG GCAACCTAGG AGATCTCAGG CTCATATCAG ATCCCAGGGC CAGGATGGAC
GACAACTCAT TCAACCAGAG CATCGATGAG TCGAACACGA TAGATGGCAG GCCGGTTCTG
TATCTGGTAG GTGCTCGCAA CGTTTCGATA GATAAGAGCT ACGGATTTGT GGGGCTCATA
GACTGCGAGA ACATCACGAT GAGAAACCAG AGCATATCGA ACAGCAGCGC GGCCATCATG
CTTGTGGGAG CAATCGACTC CAGGATCTCG GACTGCAGGC TCTCCAGAAG CGAGATAGGG
TTATCGATCC TCGACTCGAA AGGCTGTGTG GTGGAGAACA CAACCGCAGA GTCCTGCGGG
ATCGGCTTCT GGTTCGGTCG GTCGCAGGAT ATCTTCGTGA GAAAGAGCGC GGCGCTGAAC
TGCACAGAGA GCGGATTCAG GCTCGAGGGT GACAGGAGAT CGAGGATAAC GGACTCCCGG
GCAGAGAACT GCACTGCCGG GATGCACCTC CTTGACGCCC TCTCCTCAGA GATCGTGCGT
TCCAGGATAT CCTCAAGCAG TGAGGATGGC ATACGGCTGG TCAGGTCGCA CAGATCAACG
GTGAAGGAGA ATGAAGTGAC AGGAAACGAC AACGGCATAG CGATCTCCGG CTCAAATCAG
TGCGTTCTTG CGATGAACAA CGCGAGCGCC AATGGGATAG GGATAAGGAT CGAGCAGCTC
TCCGGAGGAT CTGCTGCTGA TAACATCGCT TTCCGGAACA GGGAGGGGAT CTTCGTGAAC
GGGGTGAAGG AATTCCAGTT CATCGGCAAC AACATCAGCA TGAACGAAAG ATTCGGAATG
CGAATGGGAA GCAGCTCAGG CTGCAATATA AGCGATAATA GATTCGTAGG CAACGGCATG
CTCGGCCTGA GCCTGACAGA CTGCAGCGAC AACAGAATCT ACCACAACAG TTTCATCGAG
AATGGTTCAA TGTTTGGACA GAACGCCGTG GATAACGGAA GCAACCTCTG GGATATGGGG
CCGGTGATCG GAGGAAACTA CTGGTCGGAT CATCAGGTTA GTGGAAATCC CGGAGATACT
CCAAAGAATG TGCCATCAAA GGGCGTGGAC AGATATCCAT TCGAGAGGGA TAATGGATGG
AGGTCACCCG CTCTGTGA
 
Protein sequence
MGLWRAVLIM SLVVMIVSAS SARILVVDQE ATGAIRSVNA ALANAQNGDE ILVMAGYYRE 
SVDVNKLVSI KGYGAIIDGM GRTTFQMRSY GSSISNLTIL GSGRDPAVVV EGYASVTGCN
IKNSSIGIAA YKGVVSNNTI SATAYGIRLN GSAIVENNTI SGPLGAGIEV RCNSSRILSN
IVTSAGNAID VVGNNNTVLF NHLSSSNIGI RLKSSSDNIM MNNTCEKNRI AGVYLEDSRN
NSVSSNRFLR NGNGILLKSS SGNKIIGNIV ELNEYGISMK GSGGNLLREN VLISNIYSLR
IEAGNLGDLR LISDPRARMD DNSFNQSIDE SNTIDGRPVL YLVGARNVSI DKSYGFVGLI
DCENITMRNQ SISNSSAAIM LVGAIDSRIS DCRLSRSEIG LSILDSKGCV VENTTAESCG
IGFWFGRSQD IFVRKSAALN CTESGFRLEG DRRSRITDSR AENCTAGMHL LDALSSEIVR
SRISSSSEDG IRLVRSHRST VKENEVTGND NGIAISGSNQ CVLAMNNASA NGIGIRIEQL
SGGSAADNIA FRNREGIFVN GVKEFQFIGN NISMNERFGM RMGSSSGCNI SDNRFVGNGM
LGLSLTDCSD NRIYHNSFIE NGSMFGQNAV DNGSNLWDMG PVIGGNYWSD HQVSGNPGDT
PKNVPSKGVD RYPFERDNGW RSPAL