Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0401 |
Symbol | |
ID | 4462595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 407855 |
End bp | 411496 |
Gene Length | 3642 bp |
Protein Length | 1213 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639699405 |
Product | hypothetical protein |
Protein accession | YP_842834 |
Protein GI | 116753716 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.051931 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGGAC TGGGCATTGT TTGTAGCCCG ACTACGCGAA GCTGTATCCG GTTTTCAGCT TGGCTTAGCC TCGGTCTCCT TTTCGTATCC CTCCTGCCAT GTACGATACC CTCAGCATGC GCCTGGCCCT TCTGGCTGCC CCTCCCTACC TCCTCCAGCC CAGAACTGGA GATGACAGTC AGAACCGATT TTACGGATGT CGCCGCCGGC GATGTACTGG ATTACACCTG CATCCTCAGA AACACCGGAG ATCGCACACT CTCAAATATA TCTGTTACAG ATGGCACCAG AGAGATAGGG TTTCGCGACA CCCTGCCACC AGGGGACGTC TTCAGCATAA GAGCGAAGAC GCCACCTATC TTATCACCAT CACGGTTCAA ACTCACAGCA AGCTCAGATG GGAGAGAGTT GGCATCCCAA GAGTTCAGCA TCGGGGTTGT GGAGAAGGAG GAGCGCAGTA CACCATACCC TGCTCTCAGG GCAACTGTTG AGAGCAGCAG GATCGGAGGA GGTTCTGAGC ACAGGATACG GATCGCGAAT ACTGGGAAGA CCGAGCTGAG ACGCGTTAAG ATAGTCGACC GCAGGAACAG CGTCCTCGGC ATAATACCTG CGCTTGCCCC AGGGGAGAGC CATACGCTTG TGATAGGCTC AGTGGATGTT ACAGGTCTTA GAGTCCTGGC TTTAAACGAG GCCGGAGAGC ATCTTGCCGG TGATGTCAGA TACACTGGGT CACGCCCGGG ATCCGCAATC GCTGCTGAAT CCTCAGAGAC CACCCTCCGG ACTATGGAGT ACAGGCCATC CAGCAGCCTG GATCTGATCC TAGAGACGAG CACTACCAGA GCACGAGCTG GGGAGAATAT CAGCTACCGA TGTACTGTCA TAAACTCTTG CAGCGACGTT ATCTACAACG TGGATCTGCT CTGCCATGAG AAGAGAATGA CAACCCAGTT CATCCTTCCC AAAAGATCTG TATGCATAGA GGGTAACTTC ACCATCAACT CCACGACAGA GATCAGAGCG AACGTATCCG GGAGCGACAG GAATGGTGCT CTAGTCACGA ATGAGACATC TGTGCTGATA TGGGTCGTCT CACCTGAGCT GAATATAAGT GCCAGCGCCG ACCCCCCAAA GATAATATCG GGAGGGGTGA CAAATATCAC CGTCAAACTG GAGAATTCCG GATCTGATTT GCTCAGGGAT ATCGTTGTGG AGGACAGCTT CGGCCATATA GGGGAGATAA AAGAGCTGAA GCCAGGTGAG GTGAGCCATG TAAGCAGGAA GCGACAGATC ACCTCATCGA TAGATGATAT TGTAATGGCG TCTGCCAGGG ACTCAACTGG CAAGGAGATT TATGCGTTCT CACAGATAGC TGTGCATGTC TACCGAGCCG GTATGAACCT CTCGCTGGAT CCATCAGAGC TTTTTCTCTA TCCGGGAGAG GAGACAGAGG TCCTGTGCAC AGTGGAGAAC ACCGGGGAGG TATCTCTCAG GAACGTGACG CTTGCTGGAA TCAGGAGCGC CAGGATCGGG GAGATCCCGC CAGGAACTGA GACGAGAATA GCTGCTTCCA TCTCATCCGA TATCTCAAAA GATGTAACGA TCACCGCAAC CGGATACGGG GATGGCGATC AGGTCGTCTC CGACACTGCG GTTCTATCGA TGAAGGTAAT CAGGCCGAAC ATAACGCTGT CTGTGATGCC GACTCGCGTT GTGACCCCTG CTGGAGATGA ATTCAACGTG AGCTGTCTCG TCTCCAACTC CGGCACGGAT CTCCTGCATG ATGTGCAGAT CTCGGAATCC GAGATCGGGA CTGTTGCATC CCTTGGAGAC CTGGCGCCTG GCGAGTTCAG AACAGTCACT CCTTCAATTT CCGTTAACAA GAACCGCACG CTGAGATTCA CGGCCGCCGG AAGAGACGCC CGCGGCAGGA TCTGGAGCGC AGATGCATCC TCTGAGGCGG TTGTGGTCGA GGCAGCTTTG AATCTCAAGA TCAGCGTCTC GCCAGAGGCG GCACGCTTTG GGGATAAGGT GAGGATCTCA TGCACCGTTG AGAACGTGGG CGGTGTGCCG CTGCAGGAGA TATTCGTAAC GAGCAGGGTC ATAGGGCCCA TGGGAAGCAT AGATTTCCTG GCCCCGAAGC AGAGGAAAAC TGTGACCATG ACGATACCGC TCACAATGGA CATCTCGGAT ACGGTTACAG CTGAGGGGCT GACCCCATCG AGAGGGCATG TTCTGGATAG GGAGCCCCTG AGCATAAGGC TTCTCAAGGG GCTCGAGCCG AGGCCATCGG GAAAATCGCA GAACAGCGCT CCGGCTGCAT CAAAAGCAGG CGTGTATGTT AACGATTCTT CAGTAAAAGA GATCGGCGCG AATGCATTAA ACGAGAGCGA ATCGATCGCA TCGCTCGCCA AAACAGTCAC ATCAAACACT ACCAGTTCCT TCACAGATGG TGCAGCTGGG AATTCATCCT CCGGAAACGC GGTCGCATCT GAGAACGTGA GCAGAGCTGT AAAGGAGAGG GCATCTGATG TGCTCAGCTC AGATGGACAC CGCGAAAGCA TTCCTACATC GAAAGATCAG TCTGCTCTCT CAATCCTCAG AGATGTGATC AGATACATCC AGGAGTTGAT AACGAGAAAG CAGATCGATG TCTCGAGACA GAACGCCACT GGCAGAGATG TGTCATCTGA ATCTGGCAAC ATTTCCCTGG AAGCATACGA TCCATCCCTG AACACAACAC TGGGGATAGA GAGCGTGAGG GATATTCGGG TTCCGGTCCG CATTCTGGAC GTCACTGCTA TACCGCCAGA ACCTGTTGCA GGTGCTCCTG TCAGGGTTGT CGCCCACATC AGAGGCGACT CAGATGTTGA TACTGTCGTC CTGGAGTACG GGGTGGCGGA CCAGAGCATA GCCAGAGCCG ATATGGTAAA CATGAAGAGG ACCTCCCAGA TCAAGATGCT CCTTGAAAGC GGTAGCAAGA GTGACGGCTA TTGGAGCTGC TTGATCCCCG GGCAGAGCGC GGGGACAATT CTGGGGATAT CGGTCAGGGC CGAATGCGGC CAGAGCAGCG CAGAGGATGG GCCGTACATG CTGCAGTGGG TCTCCCAGGC TCCATCAAGA AGACCGAGCC CTCTGCCTGA GACGAAAACA AAGGGAGACG GAATGCTCTA CATCGAGTCC ACGACTGTTA GCGGAAGAGG GGAGGTCTCG ATAAGAGACA CGTTCATGGA GAACTCTATG TCATATGAGG AAGACCTGAA GGGAAGCGGG AGCCTGGACA TGGAGTCTGT CAGGTGCATG GAGAAGTCGA ACCCCACGGT CAACTTCACC GAGATCAAGG ATATCTCCTT CGATGGCGGC GTTCTGAAGG GCTTTAAGCG GTTCGACTCG CCTGTCTTCC ACGGCGGGAT GGGCGCAAGC ATCACAGAGC GATTCAACAT GAGCCAGCTC GATAAGAGCG AGCGAGGGAT GATTCGATCT ATAAATTACA GAAACAACAC GGTGGCGTTC TCCACAGAGC AGGCGTTCGA GGGAATGTGG AATACCAGAA CCGAGTACTC CAAGTTCAGC AAGAAGATGA AGGCTGATCA GAGACTCAAT GGAAGCTTCC AGATACAGAA GAACATAAAG TTCCAGGACT AG
|
Protein sequence | MKGLGIVCSP TTRSCIRFSA WLSLGLLFVS LLPCTIPSAC AWPFWLPLPT SSSPELEMTV RTDFTDVAAG DVLDYTCILR NTGDRTLSNI SVTDGTREIG FRDTLPPGDV FSIRAKTPPI LSPSRFKLTA SSDGRELASQ EFSIGVVEKE ERSTPYPALR ATVESSRIGG GSEHRIRIAN TGKTELRRVK IVDRRNSVLG IIPALAPGES HTLVIGSVDV TGLRVLALNE AGEHLAGDVR YTGSRPGSAI AAESSETTLR TMEYRPSSSL DLILETSTTR ARAGENISYR CTVINSCSDV IYNVDLLCHE KRMTTQFILP KRSVCIEGNF TINSTTEIRA NVSGSDRNGA LVTNETSVLI WVVSPELNIS ASADPPKIIS GGVTNITVKL ENSGSDLLRD IVVEDSFGHI GEIKELKPGE VSHVSRKRQI TSSIDDIVMA SARDSTGKEI YAFSQIAVHV YRAGMNLSLD PSELFLYPGE ETEVLCTVEN TGEVSLRNVT LAGIRSARIG EIPPGTETRI AASISSDISK DVTITATGYG DGDQVVSDTA VLSMKVIRPN ITLSVMPTRV VTPAGDEFNV SCLVSNSGTD LLHDVQISES EIGTVASLGD LAPGEFRTVT PSISVNKNRT LRFTAAGRDA RGRIWSADAS SEAVVVEAAL NLKISVSPEA ARFGDKVRIS CTVENVGGVP LQEIFVTSRV IGPMGSIDFL APKQRKTVTM TIPLTMDISD TVTAEGLTPS RGHVLDREPL SIRLLKGLEP RPSGKSQNSA PAASKAGVYV NDSSVKEIGA NALNESESIA SLAKTVTSNT TSSFTDGAAG NSSSGNAVAS ENVSRAVKER ASDVLSSDGH RESIPTSKDQ SALSILRDVI RYIQELITRK QIDVSRQNAT GRDVSSESGN ISLEAYDPSL NTTLGIESVR DIRVPVRILD VTAIPPEPVA GAPVRVVAHI RGDSDVDTVV LEYGVADQSI ARADMVNMKR TSQIKMLLES GSKSDGYWSC LIPGQSAGTI LGISVRAECG QSSAEDGPYM LQWVSQAPSR RPSPLPETKT KGDGMLYIES TTVSGRGEVS IRDTFMENSM SYEEDLKGSG SLDMESVRCM EKSNPTVNFT EIKDISFDGG VLKGFKRFDS PVFHGGMGAS ITERFNMSQL DKSERGMIRS INYRNNTVAF STEQAFEGMW NTRTEYSKFS KKMKADQRLN GSFQIQKNIK FQD
|
| |