Gene Mboo_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_2044 
Symbol 
ID5411171 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2120616 
End bp2121911 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content58% 
IMG OID640869286 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001405201 
Protein GI154151583 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.974269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAAA AGAAACTCAA CCTTGGAACG CTTGCCCTGC ACGCAGGGCA GGTTCCGGAC 
CCGGCCACCG GGTCACGGAC AGTACCGATC TACCAGACCT CCTCGTATGT GTTCAAGAGC
ACGGAACACG CTGCCAACCT GTTTGGTCTG CGGGAACTGG GGAACATCTA CACCCGGCTC
ATGAACCCGA CCACCGATGT GTTCGAGAAG CGCATTGCCG CCATCGAGGG AGGAACCGGG
GCGCTTGCCA CGGCATCAGG CCAGGCAGCA ATCACCTACG CGCTCCTCAA CATCACCCGG
CCCGGGGACG AGATCGTCTC TGCCGATAAC CTGTACGGCG GTACCTATGA ACTGTTCCAC
TACACGCTCC CGAAGCTCGG GAGGACGGTA GTCTTTGTTG ACTCCACCAA GCCCGAGGCG
TTCAGGAATG CAATTACTCC CAAGACCCGT GCCATCTATG CCGAGACCGT GGGTAATCCG
AAACTCGATA CCCCTGACTT TGAAGCGATT GCAAAGATCG CCCACGACAA TGGCATCCCG
GTGGTTGTGG ACAACACCAC CGGTGTCGGC CTTGTCCGCC CGATTGACCA TGGCGTAGAC
ATTGTCGTTC ATTCGGCCAC GAAGTACATC GGCGGCCACG GCAACTCCAT CGGCGGCGTG
ATCGTTGATT CGGGCAAGTT CGCCTGGAAC AACGGCAAGT TCCCCGAGTT CACCGAACCG
GACCCGGGCT ACCACGGCCT CAAATACTGG GATGCGTTCG GGAACTTCCC CGGCCTCGGA
AACGTTGCCT TCATCTTCAA GATCCGGGTT TCACTGCTCC GGGATACGGG AGCAGTCTTA
AGCCCGTTTA ACGCCTGGCT CTTCCTTATC GGCCTTGAGA CCCTCCACCT GCGTGTGCCA
CGCCACTCCG AGAATGCCTT TGCCGTTGCA AAGTTCCTCA AAGGTCATCC CAAGGTCGCA
TGGGTCAACT ACCCCGGGCT CCCGGAGCAC CCCAGCCACA CCTTAACCAA GAAATACCTC
CACGGCGGTT TCGGCCCCCT CGTCGGTGTC GGGATCAAGG GTGGGGAGAC CGCAAGCAGG
AAGTTCATCG ATTCCCTCAA GCTCTTCAGT AACCTCGCTA ATATCGGCGA TTCAAAGAGC
CTTGTGATCC ACCCGGCAAC CACCACCCAC CAGCAGCTTA CCGCTGAGGA ACAGGCCAAG
ACCGGCGTTA CTCCGGATGC CGTCCGCCTT TCCGTCGGTA CTGAGGATAT CGAGGATATC
ATCGAAGACC TTAATCAGGC CCTTGCCCAA GTCTAG
 
Protein sequence
MSEKKLNLGT LALHAGQVPD PATGSRTVPI YQTSSYVFKS TEHAANLFGL RELGNIYTRL 
MNPTTDVFEK RIAAIEGGTG ALATASGQAA ITYALLNITR PGDEIVSADN LYGGTYELFH
YTLPKLGRTV VFVDSTKPEA FRNAITPKTR AIYAETVGNP KLDTPDFEAI AKIAHDNGIP
VVVDNTTGVG LVRPIDHGVD IVVHSATKYI GGHGNSIGGV IVDSGKFAWN NGKFPEFTEP
DPGYHGLKYW DAFGNFPGLG NVAFIFKIRV SLLRDTGAVL SPFNAWLFLI GLETLHLRVP
RHSENAFAVA KFLKGHPKVA WVNYPGLPEH PSHTLTKKYL HGGFGPLVGV GIKGGETASR
KFIDSLKLFS NLANIGDSKS LVIHPATTTH QQLTAEEQAK TGVTPDAVRL SVGTEDIEDI
IEDLNQALAQ V