Gene Mthe_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0401 
Symbol 
ID4462595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp407855 
End bp411496 
Gene Length3642 bp 
Protein Length1213 aa 
Translation table11 
GC content54% 
IMG OID639699405 
Producthypothetical protein 
Protein accessionYP_842834 
Protein GI116753716 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.051931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGGAC TGGGCATTGT TTGTAGCCCG ACTACGCGAA GCTGTATCCG GTTTTCAGCT 
TGGCTTAGCC TCGGTCTCCT TTTCGTATCC CTCCTGCCAT GTACGATACC CTCAGCATGC
GCCTGGCCCT TCTGGCTGCC CCTCCCTACC TCCTCCAGCC CAGAACTGGA GATGACAGTC
AGAACCGATT TTACGGATGT CGCCGCCGGC GATGTACTGG ATTACACCTG CATCCTCAGA
AACACCGGAG ATCGCACACT CTCAAATATA TCTGTTACAG ATGGCACCAG AGAGATAGGG
TTTCGCGACA CCCTGCCACC AGGGGACGTC TTCAGCATAA GAGCGAAGAC GCCACCTATC
TTATCACCAT CACGGTTCAA ACTCACAGCA AGCTCAGATG GGAGAGAGTT GGCATCCCAA
GAGTTCAGCA TCGGGGTTGT GGAGAAGGAG GAGCGCAGTA CACCATACCC TGCTCTCAGG
GCAACTGTTG AGAGCAGCAG GATCGGAGGA GGTTCTGAGC ACAGGATACG GATCGCGAAT
ACTGGGAAGA CCGAGCTGAG ACGCGTTAAG ATAGTCGACC GCAGGAACAG CGTCCTCGGC
ATAATACCTG CGCTTGCCCC AGGGGAGAGC CATACGCTTG TGATAGGCTC AGTGGATGTT
ACAGGTCTTA GAGTCCTGGC TTTAAACGAG GCCGGAGAGC ATCTTGCCGG TGATGTCAGA
TACACTGGGT CACGCCCGGG ATCCGCAATC GCTGCTGAAT CCTCAGAGAC CACCCTCCGG
ACTATGGAGT ACAGGCCATC CAGCAGCCTG GATCTGATCC TAGAGACGAG CACTACCAGA
GCACGAGCTG GGGAGAATAT CAGCTACCGA TGTACTGTCA TAAACTCTTG CAGCGACGTT
ATCTACAACG TGGATCTGCT CTGCCATGAG AAGAGAATGA CAACCCAGTT CATCCTTCCC
AAAAGATCTG TATGCATAGA GGGTAACTTC ACCATCAACT CCACGACAGA GATCAGAGCG
AACGTATCCG GGAGCGACAG GAATGGTGCT CTAGTCACGA ATGAGACATC TGTGCTGATA
TGGGTCGTCT CACCTGAGCT GAATATAAGT GCCAGCGCCG ACCCCCCAAA GATAATATCG
GGAGGGGTGA CAAATATCAC CGTCAAACTG GAGAATTCCG GATCTGATTT GCTCAGGGAT
ATCGTTGTGG AGGACAGCTT CGGCCATATA GGGGAGATAA AAGAGCTGAA GCCAGGTGAG
GTGAGCCATG TAAGCAGGAA GCGACAGATC ACCTCATCGA TAGATGATAT TGTAATGGCG
TCTGCCAGGG ACTCAACTGG CAAGGAGATT TATGCGTTCT CACAGATAGC TGTGCATGTC
TACCGAGCCG GTATGAACCT CTCGCTGGAT CCATCAGAGC TTTTTCTCTA TCCGGGAGAG
GAGACAGAGG TCCTGTGCAC AGTGGAGAAC ACCGGGGAGG TATCTCTCAG GAACGTGACG
CTTGCTGGAA TCAGGAGCGC CAGGATCGGG GAGATCCCGC CAGGAACTGA GACGAGAATA
GCTGCTTCCA TCTCATCCGA TATCTCAAAA GATGTAACGA TCACCGCAAC CGGATACGGG
GATGGCGATC AGGTCGTCTC CGACACTGCG GTTCTATCGA TGAAGGTAAT CAGGCCGAAC
ATAACGCTGT CTGTGATGCC GACTCGCGTT GTGACCCCTG CTGGAGATGA ATTCAACGTG
AGCTGTCTCG TCTCCAACTC CGGCACGGAT CTCCTGCATG ATGTGCAGAT CTCGGAATCC
GAGATCGGGA CTGTTGCATC CCTTGGAGAC CTGGCGCCTG GCGAGTTCAG AACAGTCACT
CCTTCAATTT CCGTTAACAA GAACCGCACG CTGAGATTCA CGGCCGCCGG AAGAGACGCC
CGCGGCAGGA TCTGGAGCGC AGATGCATCC TCTGAGGCGG TTGTGGTCGA GGCAGCTTTG
AATCTCAAGA TCAGCGTCTC GCCAGAGGCG GCACGCTTTG GGGATAAGGT GAGGATCTCA
TGCACCGTTG AGAACGTGGG CGGTGTGCCG CTGCAGGAGA TATTCGTAAC GAGCAGGGTC
ATAGGGCCCA TGGGAAGCAT AGATTTCCTG GCCCCGAAGC AGAGGAAAAC TGTGACCATG
ACGATACCGC TCACAATGGA CATCTCGGAT ACGGTTACAG CTGAGGGGCT GACCCCATCG
AGAGGGCATG TTCTGGATAG GGAGCCCCTG AGCATAAGGC TTCTCAAGGG GCTCGAGCCG
AGGCCATCGG GAAAATCGCA GAACAGCGCT CCGGCTGCAT CAAAAGCAGG CGTGTATGTT
AACGATTCTT CAGTAAAAGA GATCGGCGCG AATGCATTAA ACGAGAGCGA ATCGATCGCA
TCGCTCGCCA AAACAGTCAC ATCAAACACT ACCAGTTCCT TCACAGATGG TGCAGCTGGG
AATTCATCCT CCGGAAACGC GGTCGCATCT GAGAACGTGA GCAGAGCTGT AAAGGAGAGG
GCATCTGATG TGCTCAGCTC AGATGGACAC CGCGAAAGCA TTCCTACATC GAAAGATCAG
TCTGCTCTCT CAATCCTCAG AGATGTGATC AGATACATCC AGGAGTTGAT AACGAGAAAG
CAGATCGATG TCTCGAGACA GAACGCCACT GGCAGAGATG TGTCATCTGA ATCTGGCAAC
ATTTCCCTGG AAGCATACGA TCCATCCCTG AACACAACAC TGGGGATAGA GAGCGTGAGG
GATATTCGGG TTCCGGTCCG CATTCTGGAC GTCACTGCTA TACCGCCAGA ACCTGTTGCA
GGTGCTCCTG TCAGGGTTGT CGCCCACATC AGAGGCGACT CAGATGTTGA TACTGTCGTC
CTGGAGTACG GGGTGGCGGA CCAGAGCATA GCCAGAGCCG ATATGGTAAA CATGAAGAGG
ACCTCCCAGA TCAAGATGCT CCTTGAAAGC GGTAGCAAGA GTGACGGCTA TTGGAGCTGC
TTGATCCCCG GGCAGAGCGC GGGGACAATT CTGGGGATAT CGGTCAGGGC CGAATGCGGC
CAGAGCAGCG CAGAGGATGG GCCGTACATG CTGCAGTGGG TCTCCCAGGC TCCATCAAGA
AGACCGAGCC CTCTGCCTGA GACGAAAACA AAGGGAGACG GAATGCTCTA CATCGAGTCC
ACGACTGTTA GCGGAAGAGG GGAGGTCTCG ATAAGAGACA CGTTCATGGA GAACTCTATG
TCATATGAGG AAGACCTGAA GGGAAGCGGG AGCCTGGACA TGGAGTCTGT CAGGTGCATG
GAGAAGTCGA ACCCCACGGT CAACTTCACC GAGATCAAGG ATATCTCCTT CGATGGCGGC
GTTCTGAAGG GCTTTAAGCG GTTCGACTCG CCTGTCTTCC ACGGCGGGAT GGGCGCAAGC
ATCACAGAGC GATTCAACAT GAGCCAGCTC GATAAGAGCG AGCGAGGGAT GATTCGATCT
ATAAATTACA GAAACAACAC GGTGGCGTTC TCCACAGAGC AGGCGTTCGA GGGAATGTGG
AATACCAGAA CCGAGTACTC CAAGTTCAGC AAGAAGATGA AGGCTGATCA GAGACTCAAT
GGAAGCTTCC AGATACAGAA GAACATAAAG TTCCAGGACT AG
 
Protein sequence
MKGLGIVCSP TTRSCIRFSA WLSLGLLFVS LLPCTIPSAC AWPFWLPLPT SSSPELEMTV 
RTDFTDVAAG DVLDYTCILR NTGDRTLSNI SVTDGTREIG FRDTLPPGDV FSIRAKTPPI
LSPSRFKLTA SSDGRELASQ EFSIGVVEKE ERSTPYPALR ATVESSRIGG GSEHRIRIAN
TGKTELRRVK IVDRRNSVLG IIPALAPGES HTLVIGSVDV TGLRVLALNE AGEHLAGDVR
YTGSRPGSAI AAESSETTLR TMEYRPSSSL DLILETSTTR ARAGENISYR CTVINSCSDV
IYNVDLLCHE KRMTTQFILP KRSVCIEGNF TINSTTEIRA NVSGSDRNGA LVTNETSVLI
WVVSPELNIS ASADPPKIIS GGVTNITVKL ENSGSDLLRD IVVEDSFGHI GEIKELKPGE
VSHVSRKRQI TSSIDDIVMA SARDSTGKEI YAFSQIAVHV YRAGMNLSLD PSELFLYPGE
ETEVLCTVEN TGEVSLRNVT LAGIRSARIG EIPPGTETRI AASISSDISK DVTITATGYG
DGDQVVSDTA VLSMKVIRPN ITLSVMPTRV VTPAGDEFNV SCLVSNSGTD LLHDVQISES
EIGTVASLGD LAPGEFRTVT PSISVNKNRT LRFTAAGRDA RGRIWSADAS SEAVVVEAAL
NLKISVSPEA ARFGDKVRIS CTVENVGGVP LQEIFVTSRV IGPMGSIDFL APKQRKTVTM
TIPLTMDISD TVTAEGLTPS RGHVLDREPL SIRLLKGLEP RPSGKSQNSA PAASKAGVYV
NDSSVKEIGA NALNESESIA SLAKTVTSNT TSSFTDGAAG NSSSGNAVAS ENVSRAVKER
ASDVLSSDGH RESIPTSKDQ SALSILRDVI RYIQELITRK QIDVSRQNAT GRDVSSESGN
ISLEAYDPSL NTTLGIESVR DIRVPVRILD VTAIPPEPVA GAPVRVVAHI RGDSDVDTVV
LEYGVADQSI ARADMVNMKR TSQIKMLLES GSKSDGYWSC LIPGQSAGTI LGISVRAECG
QSSAEDGPYM LQWVSQAPSR RPSPLPETKT KGDGMLYIES TTVSGRGEVS IRDTFMENSM
SYEEDLKGSG SLDMESVRCM EKSNPTVNFT EIKDISFDGG VLKGFKRFDS PVFHGGMGAS
ITERFNMSQL DKSERGMIRS INYRNNTVAF STEQAFEGMW NTRTEYSKFS KKMKADQRLN
GSFQIQKNIK FQD