Gene Mlab_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0544 
Symbol 
ID4794809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp518911 
End bp520185 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content54% 
IMG OID640099202 
Producthypothetical protein 
Protein accessionYP_001029985 
Protein GI124485369 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.128255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0869849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCTCTG AATATCAAAA GGAAACACTC AGCATTCATG CCGGACAAAA ACCGGATGAA 
GCGACAGGGG CACGAACCGA ACCGATCTAC ATGACCACGG CATATGTCTT CAAAGACGCA
AAGGAAGCGG CTGCACGATT CGATCTCTCG CTGGACGGAA ACATCTACAC CAGGCTTACG
AATCCAAACA ACACCTCGTT TGAAAAACGG ATCTCCGCGA TCGAGGGAGG GACAGCCGCA
ATCAGTACTG CCTCGGGGAT GGCGGCAATA AGCACCCTTG TCCTTGCCCT TACCAACCCG
GGTGACGAGA TTGTTTCGGC CGATAATCTG TACGGAGGGA CATTCGAGCT TTTCAGCCTG
ACCCTCCCGA ACTTCGGACG GACGGTCCGG TTTGTTCCCT CGAACGATCT CGAAGCCTTA
AAGGCCGCCA TCAATGAAAA GACACGGGCC GTCTACTTTG AATCGCTCGG CAACCCCAAA
CTTGACATCC CGGATTTCGA GGAGATCGGA AAAATAGCTC ACGAAGCCGG AGTTCCTTTT
ATTGTGGACA ACACGGTAGG GATCGGAACG GTCCGTCCTC TCGAGCATGG AGCGGATCTT
GTTGTTATGT CGGCAACAAA ATATGCCAAC GGACATGGGA ATTCCCTTGC AGGCGTGATC
GTCGAAAACG GCAGATTCCC CTGGGACAAC GGCAAATTCC CCAAGTTCAC CGAACCTGAT
CCGGCATACA AAGGTCTCGT GCACTACAAA GCATTCGGTC CGGCAACCGT ATCGGCCAGT
ATTCGAATCT CCCTGATGCG GGATCTTGGG GCGACCCTTT CACCGTTCAA CGCCTGGCTC
ACTTCGATCG GTCTTGAAAC GCTCTACCTC CGTGTCGCCC GCCATGCGGA GAATGCCCTT
ATTGTTGCGA AGCATCTCGC ATCCCACGAA AAGGTCGCAT GGGTCAACTA TCCGGGTCTT
CCAGGGCATC CCTCGGAAAA GAACCGGGAA AAATACTTCG GCGGATCCGG CGGTCCCCTT
CTCACCTTCG GCGTCAAAGG AGGATATGAG GCGGCCGTCA CCGTACAGAA TAATGTCCAG
CTCATCTCGC TTCTGGCAAA CATCGGCGAT GCAAAAACCC TCATCATCCA TCCAGCCTCG
ACGACCCATC AGCAGCTTAC CGAAGAAGAA CAGATTTCCA CAGGGGTCAG ACCCGATACG
ATCCGCCTCT CGGTCGGTCT TGAAAATCCG ATCGACATCA TCGCCGATCT GGACCATGCC
CTCTCATACA TCTAG
 
Protein sequence
MVSEYQKETL SIHAGQKPDE ATGARTEPIY MTTAYVFKDA KEAAARFDLS LDGNIYTRLT 
NPNNTSFEKR ISAIEGGTAA ISTASGMAAI STLVLALTNP GDEIVSADNL YGGTFELFSL
TLPNFGRTVR FVPSNDLEAL KAAINEKTRA VYFESLGNPK LDIPDFEEIG KIAHEAGVPF
IVDNTVGIGT VRPLEHGADL VVMSATKYAN GHGNSLAGVI VENGRFPWDN GKFPKFTEPD
PAYKGLVHYK AFGPATVSAS IRISLMRDLG ATLSPFNAWL TSIGLETLYL RVARHAENAL
IVAKHLASHE KVAWVNYPGL PGHPSEKNRE KYFGGSGGPL LTFGVKGGYE AAVTVQNNVQ
LISLLANIGD AKTLIIHPAS TTHQQLTEEE QISTGVRPDT IRLSVGLENP IDIIADLDHA
LSYI