Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlab_0965 |
Symbol | |
ID | 4794877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanocorpusculum labreanum Z |
Kingdom | Archaea |
Replicon accession | NC_008942 |
Strand | + |
Start bp | 959089 |
End bp | 960237 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640099629 |
Product | hypothetical protein |
Protein accession | YP_001030402 |
Protein GI | 124485786 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG2006] Uncharacterized conserved protein [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGTA TAACCGGCTC GGCACGATGC AGCACGTATG AAAGAGAACA GGTACGTGAA GCCGTGGAAA AAGCGATAAA AGCTTCCGGC GGCCTGCCGC GAGTCGACGG CAGAAAAGTT CTGCTCAAGC CAAATCTCCT CTCTGACGCC GGAATCGACC GTGCGATCAC CACGCATCCG GAAATCGTCT ACGCGGTCGG GAAAATGATC ATCGAAGCCG GCGGGATCTT AACGATCGCC GACAGTCCGG GGGCCGGGAT CATCTACTCA CCGAGGGTTC TCAAGAGAGT CTACCACAAA TGCGGGATCG AGAAGGTCGC CGACGAACTC GGCATCAAAC TCTCCTACGA CACCGGATAT CAGGAACGTT CCTGTCCTGC CGGAAACGTC ATGAAACGAT TCACGGTCAT CAATCAGGCA TGCGAGGCGG ACGTCATCAT CTCGGTCTGC AAACTCAAAA CCCACATGTT CACGCATTTT TCCGGCGCCG TCAAAAACAC CTTCGGCGTC GTACCCGGGC TCGACAAACC CGTTTTTCAC TCCAGATTCC CTGACGCGAT CGATTTTTCC GAGATGCTTG TTGACTTGAA CGAGCTCATC GAGCCGGACT TCGTCGTCAT GGATGCGGTC GTCGGTATGG AAGGAAACGG CCCGATGGGC GGCCAGCCCC GTGAAGTCGG CTACATCCTT GCTTCCCACT CGGTCTACGG GCTGGACATC TCCGCACAGA CACTCGTCAA CATGCGGCCG GAATGCATCG GAACGACGAT GGCCGCCCTT CGCCGCGGAC TCGTCGACGA GGTCGAGGTC GAAGGAGACG AGATCATACC GATCAAGGAC TACGTCCACC CGTCCACCTA CAGCGGCCGC CAGAAAGAGT CCTGGCACAA AAAAAATATC TACCGAAAAC TTCAAAGGGT CGGCAAAAGA TACGAACCCT CTCCCGTCAT CAACAACAAA AAATGCATCG GCTGCGGGCA GTGTGCACGG ATTTGCCCGG TGAAAGCCGT TGAAATCATA AACGACAAGG CGCTCTTCGA TCTGACCAAC TGTATTCGTT GTTACTGTTG TCATGAGATG TGCCAGTATC ACGCGATCGA AATGAAGCGC TCGATGAGCG GAAAAGTCAT CCACAAAGTC ATCCACTGA
|
Protein sequence | MKSITGSARC STYEREQVRE AVEKAIKASG GLPRVDGRKV LLKPNLLSDA GIDRAITTHP EIVYAVGKMI IEAGGILTIA DSPGAGIIYS PRVLKRVYHK CGIEKVADEL GIKLSYDTGY QERSCPAGNV MKRFTVINQA CEADVIISVC KLKTHMFTHF SGAVKNTFGV VPGLDKPVFH SRFPDAIDFS EMLVDLNELI EPDFVVMDAV VGMEGNGPMG GQPREVGYIL ASHSVYGLDI SAQTLVNMRP ECIGTTMAAL RRGLVDEVEV EGDEIIPIKD YVHPSTYSGR QKESWHKKNI YRKLQRVGKR YEPSPVINNK KCIGCGQCAR ICPVKAVEII NDKALFDLTN CIRCYCCHEM CQYHAIEMKR SMSGKVIHKV IH
|
| |