Gene Mlab_0965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0965 
Symbol 
ID4794877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp959089 
End bp960237 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content55% 
IMG OID640099629 
Producthypothetical protein 
Protein accessionYP_001030402 
Protein GI124485786 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein
[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTA TAACCGGCTC GGCACGATGC AGCACGTATG AAAGAGAACA GGTACGTGAA 
GCCGTGGAAA AAGCGATAAA AGCTTCCGGC GGCCTGCCGC GAGTCGACGG CAGAAAAGTT
CTGCTCAAGC CAAATCTCCT CTCTGACGCC GGAATCGACC GTGCGATCAC CACGCATCCG
GAAATCGTCT ACGCGGTCGG GAAAATGATC ATCGAAGCCG GCGGGATCTT AACGATCGCC
GACAGTCCGG GGGCCGGGAT CATCTACTCA CCGAGGGTTC TCAAGAGAGT CTACCACAAA
TGCGGGATCG AGAAGGTCGC CGACGAACTC GGCATCAAAC TCTCCTACGA CACCGGATAT
CAGGAACGTT CCTGTCCTGC CGGAAACGTC ATGAAACGAT TCACGGTCAT CAATCAGGCA
TGCGAGGCGG ACGTCATCAT CTCGGTCTGC AAACTCAAAA CCCACATGTT CACGCATTTT
TCCGGCGCCG TCAAAAACAC CTTCGGCGTC GTACCCGGGC TCGACAAACC CGTTTTTCAC
TCCAGATTCC CTGACGCGAT CGATTTTTCC GAGATGCTTG TTGACTTGAA CGAGCTCATC
GAGCCGGACT TCGTCGTCAT GGATGCGGTC GTCGGTATGG AAGGAAACGG CCCGATGGGC
GGCCAGCCCC GTGAAGTCGG CTACATCCTT GCTTCCCACT CGGTCTACGG GCTGGACATC
TCCGCACAGA CACTCGTCAA CATGCGGCCG GAATGCATCG GAACGACGAT GGCCGCCCTT
CGCCGCGGAC TCGTCGACGA GGTCGAGGTC GAAGGAGACG AGATCATACC GATCAAGGAC
TACGTCCACC CGTCCACCTA CAGCGGCCGC CAGAAAGAGT CCTGGCACAA AAAAAATATC
TACCGAAAAC TTCAAAGGGT CGGCAAAAGA TACGAACCCT CTCCCGTCAT CAACAACAAA
AAATGCATCG GCTGCGGGCA GTGTGCACGG ATTTGCCCGG TGAAAGCCGT TGAAATCATA
AACGACAAGG CGCTCTTCGA TCTGACCAAC TGTATTCGTT GTTACTGTTG TCATGAGATG
TGCCAGTATC ACGCGATCGA AATGAAGCGC TCGATGAGCG GAAAAGTCAT CCACAAAGTC
ATCCACTGA
 
Protein sequence
MKSITGSARC STYEREQVRE AVEKAIKASG GLPRVDGRKV LLKPNLLSDA GIDRAITTHP 
EIVYAVGKMI IEAGGILTIA DSPGAGIIYS PRVLKRVYHK CGIEKVADEL GIKLSYDTGY
QERSCPAGNV MKRFTVINQA CEADVIISVC KLKTHMFTHF SGAVKNTFGV VPGLDKPVFH
SRFPDAIDFS EMLVDLNELI EPDFVVMDAV VGMEGNGPMG GQPREVGYIL ASHSVYGLDI
SAQTLVNMRP ECIGTTMAAL RRGLVDEVEV EGDEIIPIKD YVHPSTYSGR QKESWHKKNI
YRKLQRVGKR YEPSPVINNK KCIGCGQCAR ICPVKAVEII NDKALFDLTN CIRCYCCHEM
CQYHAIEMKR SMSGKVIHKV IH