Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2732 |
Symbol | |
ID | 7270840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2860286 |
End bp | 2863687 |
Gene Length | 3402 bp |
Protein Length | 1133 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643571323 |
Product | HEAT domain containing protein |
Protein accession | YP_002467718 |
Protein GI | 219853286 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCTGC ATGATCTCTT CAGGCCCAAC CTCGGCAAGC TGGCCGAACG ACAGGACGTC AGCGGGCTGA TTGCGGCCCT CTCTCATCGG GATCCAATAG TACAGCGGCA GGCCAGTACC ATTCTCCGTT CGATGGGAAG GATCAGCGTT CCACTGCTGC TCGATACTCT CCAGCATGGA AACGCTGAGC TTCAGCAGCG AGTGTCTGCG GTGCTCAAGG AGATCCCCAA CCTCCCGGTG GAGACCCTGA TCACTGCCCT CGGTGATGAC ACCGGCCATA CCTTTCCCGG CGTCGCGACG TTGATCGCAA CGATGGGCGA TCAGGTAGTG CTACCCCTGA TGGGAGCGCT GAACAGTGAG TATGAGGAGA TCCGGAAGGG GGCTATCGTT GCGCTCGGGA TGATGGGTGA ACCGGCCAGG CCGCAGTTGC TGGCTGCCCT GATGCACCCA TCGTACCGGA TCCGATACGG GGCGGCGCTG GCCCTCGATC GGACTGGGTG GGTGCCACAG GATGAGCGGG AGAAGTTTCG GTACCTGTTT GCGACTGGCC AGTGGTCGGA ACTGGTGAAG AAACGCAAGG CCGTGGTCCT CCCGCTGCTG GCCGCTCTGG AAGACAGTCA TTATGCGGTG AGACGGGATG TCGCTCTGGC CCTCGGTTCG ATTGGGGATC TTCGGGCTAT CGATCCCCTT CGACGGCTGC TGATCACTGA TCCCGAGGAG ACGGTCCGTG CAGGAGCTGC CGAGGCGCTC GGTCTCCTCG TCGACGACCA GGCGATCCCG GCGCTTCGGG CTGCGATCAA TGACCATGCG CACAGTGTCA GAATGGCGGC AGCTCAGGCC CTGGCTAGGA TCAACTGGGT TCCTGATGAC GAACAGGAGC AGTTGACCCT GTTGGTGGCT ACAGAGCAAT GGTCCCAGTT GATAAGGCAT GGGCCTGCTG CGATTCCGGT GTTGATCACG GCCCTCGGCG ATGAATATTA TGGGATCCGG ACCGGTAGCG CCGAGGCCCT CTGGAGAATC GGCGATCCTG CCGTGAAATC TCTGATCCGG GCCCTCCACG ACCAGAACCC AGCCATTCGG GCCGGATCTA CCGGTGTGCT GATGCGGCTC GGTGCCTTTC CCAGGGAACA GTCTGCACCG ACCTATATCG AACAGATCAG GCCCAACATC ACTCCCCCAC TGCATATCAG ACGGGAGGTC TATCCGGATA TCTCATGGAC CGAGGACATT TCAGCATCCG GGTTGACACC CACACCCTGT GAGCCCGATC CTGTCGAGGA CGATCTGGCC CTCCTCCTCT CCCCGACCGG GACGCAGTCG CCAGAGGAGA TGATGACGGC AGAGCCGGCG GTGGGAGGAT CCATTGAATT TACCCGGGAT ATTGAGGATG GACCATCCCT TCCCTTATCA GATGATCTGG AGGATGCGGA CCAGATCCTC GCTGGGCTGG ATGATTCTCT GCCCCCGATC GATCTCTCCT CTCTCCAATC GACGACGATG ACTATTGAGG CACCACCATT CTGGGATCTC CCAGTTGAGG ATGATCTGGC GACCCCAGAA CCTCTCTCTT CCTGGCTGGA TTCAGATGAC CTGATCATCC CACAGATGGA TGACGGGACA CCTTCCATCA CCCCCTCTTC GGTCCTTCCA GTTGTGGGTG GTCCAGGATT GGAGACTGCC GGCTCCATGG ATAATGAAGC GTCCGGCTGG GCTCCGGTCC AGGAGTCGGC CTTCCTCTCC CTGATGGAGA CACTCCAGCG GAATTCTCCT GATGATGAAG AACGATCCCA GCTGATCTCC TCGTACCTTG CCGATCAACA TGGAGTTCTC TCCAGTGACC AGCCCTCGGT GATTGTCAGC GCCGTCGGTC CAGGCCTCTC CGCAGAGGAC GATGCGGTCA GGATGGTGGC TGCTGAGGTG CTCGGACAGA TCGGGATCGA ATCGGTGCCT CTGCTGCTCA AGGCGCTTCA GGATCCATAC TACCAGGTCA GGGTCACAGC TGCCGACGCT CTCGGTCAGA TCCGGGATCA GCGAGCTCTC TCTCCGCTGG TCAACCTGCT GATCGAGGAC GAGGATGAGG AGGTCAGGAG TCAGGCGGCT CATGCCCTTG GAGAGCTCCG GAACCCTGCT ACAACCGAGG TTCTGGTCCG GGCATTGCAT GATCAGTACC CGGTGGTCAG AGGGGCCGCG GCCAGGGCCT TGGGCATGAT TGGAAATCGG CAGGGGATCG GCCCCCTCGT GGCCCTCTTC GAGTCCGGTG ATCGTGCTAC GGCAGAGGAT GTGATCTGGG CCCTGAGGAC GCTCGGGGCT GATCAGGATC TTGCCATGCA GGCCTCAGAC ACTGCAGATG AGAGACGACA AAAGGATGCT AAGATGTTGC TGGCCCGCCT TCATGAAGAA GAAACAACCC TCTCCCCTCT CCCTCCCAGG CAGTTTCATG CCGTGCTGTC AGGGGAGGTC TCTCCTATTC ATGAGATCAC GATTCCTCAG GAAGGCCCTC CCCCGATTGA GGTCACCCAA TTGAACCCAA CCCCTCGGTC AGTTCCACCG CTCCCTGAAC CTATCGACGA GTCTGGAGGG CTGGAATCCG GTACAGTTCC TTCCGAGGAT CCTGATTTTG TTCTACTGAT CGAGGCCTCT GTGCATGAAG AGGCCCGGGT TCGGAAGAAG GTCGCTCATG CCCTTGCCAA GAGCCATGAC CCTCGGGCTG GGGACCTGCT CAGAACGCTG CTGACTGATG AGGATGAGGA CGTTCGGGCA TCTGCATCCG CGTCACTCGG GTTGCTCGGA GACCAGGCTG CGGTGCCGGA CCTGATCACT GCACTCGAGG ATCAGAGCGA CGAGGTCGTC ATGCGGGCAG CCCGGTCGCT CGGTGAAATT CAGGACCCTG CTGCTGCTGC ACCATTGATC CAGTTGCTGG ATGCCGACGA CTATGGGGTC AGACAGGTCG CTGGGGAGGC TCTGACAGCA CTCGGTTCAG GGGCCACCGA GGCGCTGGTG GAAGCCCTCA ACGATCCGGA GAAGGAGATC CGCGCCGGCT CTGCAGAGAG CCTCGCGGCT GCCGGTTGGA CACCTACAGA TACCGTCCAG GAGGTAGGAT ACCTGATCGC AGAGGAGCGG TGGTCTGAAA TCGGCCGTTT TGGTGAGGAT GCCCTCCCTC CTCTCGCCCA GTTTATCAAC GATCCAGATC CTGAGATCCG GCTCGGGGTG GTCAGTGCTC TTGCCAAGAT CGGTGGGCCT TCTGCTGCGG TTCTCCTCGA ACATGCCGCT GCCGACTCAT CGTACCTGGT TCGAAAGCGG GCAGGGCTGC TCCTTCGGGA GGAGAGCGCG ACCGAACAGT CTGAGCAACC AGAGCAACTT GAACTTGAGG AACCGGTTCC GGAGATCCAG GAAGAGGAGT GA
|
Protein sequence | MWLHDLFRPN LGKLAERQDV SGLIAALSHR DPIVQRQAST ILRSMGRISV PLLLDTLQHG NAELQQRVSA VLKEIPNLPV ETLITALGDD TGHTFPGVAT LIATMGDQVV LPLMGALNSE YEEIRKGAIV ALGMMGEPAR PQLLAALMHP SYRIRYGAAL ALDRTGWVPQ DEREKFRYLF ATGQWSELVK KRKAVVLPLL AALEDSHYAV RRDVALALGS IGDLRAIDPL RRLLITDPEE TVRAGAAEAL GLLVDDQAIP ALRAAINDHA HSVRMAAAQA LARINWVPDD EQEQLTLLVA TEQWSQLIRH GPAAIPVLIT ALGDEYYGIR TGSAEALWRI GDPAVKSLIR ALHDQNPAIR AGSTGVLMRL GAFPREQSAP TYIEQIRPNI TPPLHIRREV YPDISWTEDI SASGLTPTPC EPDPVEDDLA LLLSPTGTQS PEEMMTAEPA VGGSIEFTRD IEDGPSLPLS DDLEDADQIL AGLDDSLPPI DLSSLQSTTM TIEAPPFWDL PVEDDLATPE PLSSWLDSDD LIIPQMDDGT PSITPSSVLP VVGGPGLETA GSMDNEASGW APVQESAFLS LMETLQRNSP DDEERSQLIS SYLADQHGVL SSDQPSVIVS AVGPGLSAED DAVRMVAAEV LGQIGIESVP LLLKALQDPY YQVRVTAADA LGQIRDQRAL SPLVNLLIED EDEEVRSQAA HALGELRNPA TTEVLVRALH DQYPVVRGAA ARALGMIGNR QGIGPLVALF ESGDRATAED VIWALRTLGA DQDLAMQASD TADERRQKDA KMLLARLHEE ETTLSPLPPR QFHAVLSGEV SPIHEITIPQ EGPPPIEVTQ LNPTPRSVPP LPEPIDESGG LESGTVPSED PDFVLLIEAS VHEEARVRKK VAHALAKSHD PRAGDLLRTL LTDEDEDVRA SASASLGLLG DQAAVPDLIT ALEDQSDEVV MRAARSLGEI QDPAAAAPLI QLLDADDYGV RQVAGEALTA LGSGATEALV EALNDPEKEI RAGSAESLAA AGWTPTDTVQ EVGYLIAEER WSEIGRFGED ALPPLAQFIN DPDPEIRLGV VSALAKIGGP SAAVLLEHAA ADSSYLVRKR AGLLLREESA TEQSEQPEQL ELEEPVPEIQ EEE
|
| |