Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0049 |
Symbol | |
ID | 7272218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 49008 |
End bp | 51887 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643568707 |
Product | HEAT domain containing protein |
Protein accession | YP_002465167 |
Protein GI | 219850735 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGATTT TTGATATATT TACCCCGGAT ATCCAGAAGA TGCGCGATGC AGGTGATGTC CGGGGGCTGG CGAAGGCGCT TGGGGATTCG AATCTGGAGA TCGCCCGGAA GGCGGCAGCA GCGCTGGTGG GTCTTCCCTC GTCCAGTGCC GTGATTCCGT TGATCCGGTC ACTTTCATCC CCTGACAAGG ATATCAGGCG TCTGAGCACG GCTGCACTTG GTGCGACCGG GGACCCACAT GCGCTCCCTG CCGTTCTGGA AGCAACGGAG GATGGAGATC TCGGCGTAAG GCTGGAGGCA GTAAAAGCAC TTGCAAAATT CCACGATCCT GAAGCAAATT TGCTTATTAC CCGGTTTACC GCTGACAGCA ATCTGGACAT CCGGATGGCG GCAGTCTCTG CACTGGGGCA GACCGGGGAT CCCACATCGA TCGAACCTCT CCTCCATCTG CTCGTGGATC CCCATTACGG CATACGTGAG GTCTCTGCAT ATGCACTCGA CTCTCTCGGC TGGGTGCCGG CGAACGATCG GGATAAGGCT TTCTATTTTA TCGCAAAACG GGAATGGAGA GGCCTGTTCA ACCTCCAGTC GGTGGCAGTA AAAGTTCTTG TCTGGGCATT GAAAGATGAA TATTATGCTG TCAGGCAGGG GGCGGCCTCG ACCCTGGGAA AACTCAAGGA TCTCCGGGCT GTCAGGGCAC TGGTCTCTGC TCTTTCCGAT GAAGAGAGCA GTGTTCGGAT GGAAGTGGTC TCCGCTCTGG GTGAGATAGG CGACCTTCGG ACTGTCCCTA TCCTGGTACG GGTACTTGAC GATGATTATA TCGGTGTGCG CATGACTGCC GCCTCTGTTC TCGATGGCAT GGGATGGAAA CCGTCAACTG AGAACGACCT GATCCTCTAT CTCCTCGCCA AGGAACGATG GATGGATATT GCAGTTATCG GGAAACGATC GACGCAGGTT CTGGCAAAAC GACTCAATGA TCCTAACTAT AGCACCCGTG ATGAAGTCGG AAAGATTCTG CAGAGACTGG GTGAGCATGC ACAAGAACCG ATGCTTTCTG CATTGAAGAA TCCTGACCCG GATGTCCGGT CAAGGGCTGT GTGGATCCTC GGGAATATCA GGACCAGGCA GGCTGTTGGC CCGATCATTC GAATCCTCAG CGATGACAAT CCAGCATGTA GAGAGGAAGC TGTCAGAGCG CTGGGAAAGA TCGGGGACCC CCGTGCCATC CCGTTCCTCA ACCGTGTCCT CGGGCGTGAA ATTCTTCTCG CCCCGGTGGC GATCCGGGCG CTGGGGCAGA TCGCTCATCC CGCTTCTGTC AAGGCATTGA TGCCGTATCT TGGGAGTGCA GACCGGGAGA TCCGGCTGCA TACGATCCTG GCACTGGGAG AGAACGGTGA TAAAAAGATA TCAGGTGCCA TCGCTCATGC GGTGAAGGAT TCCGATCCTG AGGTACGGGT TGCTGCTATT ACCGTTATCA GCAAGTTCCC TTCGAACGAG GTGTACGCCC TGATCCGGGC TGCACTCGAA GATGTTCATC CCGATGTCCG GTATGCAGCC CTCCTCGCCA TCTCCACGTG GCAGGCCGAC GACACGATCC CACTGATGGT CAGGCGACTG GAAGATGAAG ATCAGAAGAT CTGCAGGATT GCTGCACAGG CGCTGAATCG ACGGCGCTGG CAGCCGGCAT CCCTGCCGGA GCGAAGAAAT TACCTGATTG CTATGGGTCA GTGGCGGGAT CTGGAACGTC TGGACTATGT CAATAAAGAG CTGCAGAAGA AGGAATCGGA TCTCGCATGC CAGATCGTCC CCCGGGGAAG CATCATGGAT CCTGTACCTC CGCTGTCGGG ATCTCCAGGA GAGCGGGTTG ATGGAGATCC AACGAGACCG GAGGGTGCAG GAATAACCGG TGAACCCGAT AGCCTACTGG CACTGATCCG GACGCTGGGG GACCAGAAGG AGATGCAGGC ACTCCGGTGG AGATCTGCTG AAGGGCTCGG GGGACTGGGC GATAAGCGGG GGGTTGAGCC ACTCATTGTC GCGTTATTTG ACCCGGATTC TGAGTTACGT TGGCGTGCTG CGTTTTCTCT AGGCCTGCTG CATGATGAGC GTGCCATCGA ACCACTTGCA ACCGCCCTCC GCAGTGATGA TCAGCTGACT GTCAGGGTAC GGGTTGCAGA AGCGCTCGGA CAATTTAAAA AACCGGTCGT CATCAGGCCG CTCATTCACT CACTTGGTGA CGTTCATCCG GATGTCAGGG ATATGGCAAT CCGTTCGCTT GGAGAGATCG GAACTGAGAG TGCGATTAAC GCAATCCTGA CCGGGCTCCT CGATGCGGAT GAGACTGTCC GCGAGAGCGT CATCGATACC CTCTTAAAAC TCGGTGCCAT GGCAGGCAGG TCGTTGGTGA AGAATCTCAA AAATAGAAAC CCAGAGGTCA AAAAAGGAGT TTTGACCGTT TTTATGCGGA TGAAACCAGC GATCAGTTTC CGCATCCTGG TCTCTGAACT GGAGAATGGT GACTGGGAAG TCCGCCAGAT GGTTGCCGCT GCACTGGATT CTCTTGACTG GCAACCAGGG GATCCCTTTC AGAGGGCGAT ATATCTGTTT GCACAGCGCG ACTGGAGAGC CCTTGAAGCG CAGGGGAAGA CCGCTGAGGG AATTCTGATC CGGGGGACTG CCGACAGCGA TCCCGCGATT CGGAGGGCTT CAGTGGAACT TCTGGGTTTG ATTGGGGACC GTCGTACTAT TCCTTCTCTG ACCGAGGTCA TGTATGACGA GAACCGGGAA GTCCGTCTCT CCTCGATAAA AACTCTGCTG AAGAGGCAGG GCGGGGAGTC CTCCAGGCTT ATCTCCACTC TGAAACGAAA CGTGAAATAA
|
Protein sequence | MGIFDIFTPD IQKMRDAGDV RGLAKALGDS NLEIARKAAA ALVGLPSSSA VIPLIRSLSS PDKDIRRLST AALGATGDPH ALPAVLEATE DGDLGVRLEA VKALAKFHDP EANLLITRFT ADSNLDIRMA AVSALGQTGD PTSIEPLLHL LVDPHYGIRE VSAYALDSLG WVPANDRDKA FYFIAKREWR GLFNLQSVAV KVLVWALKDE YYAVRQGAAS TLGKLKDLRA VRALVSALSD EESSVRMEVV SALGEIGDLR TVPILVRVLD DDYIGVRMTA ASVLDGMGWK PSTENDLILY LLAKERWMDI AVIGKRSTQV LAKRLNDPNY STRDEVGKIL QRLGEHAQEP MLSALKNPDP DVRSRAVWIL GNIRTRQAVG PIIRILSDDN PACREEAVRA LGKIGDPRAI PFLNRVLGRE ILLAPVAIRA LGQIAHPASV KALMPYLGSA DREIRLHTIL ALGENGDKKI SGAIAHAVKD SDPEVRVAAI TVISKFPSNE VYALIRAALE DVHPDVRYAA LLAISTWQAD DTIPLMVRRL EDEDQKICRI AAQALNRRRW QPASLPERRN YLIAMGQWRD LERLDYVNKE LQKKESDLAC QIVPRGSIMD PVPPLSGSPG ERVDGDPTRP EGAGITGEPD SLLALIRTLG DQKEMQALRW RSAEGLGGLG DKRGVEPLIV ALFDPDSELR WRAAFSLGLL HDERAIEPLA TALRSDDQLT VRVRVAEALG QFKKPVVIRP LIHSLGDVHP DVRDMAIRSL GEIGTESAIN AILTGLLDAD ETVRESVIDT LLKLGAMAGR SLVKNLKNRN PEVKKGVLTV FMRMKPAISF RILVSELENG DWEVRQMVAA ALDSLDWQPG DPFQRAIYLF AQRDWRALEA QGKTAEGILI RGTADSDPAI RRASVELLGL IGDRRTIPSL TEVMYDENRE VRLSSIKTLL KRQGGESSRL ISTLKRNVK
|
| |