Gene Mboo_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMboo_1994 
Symbol 
ID5410418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Methanoregula boonei 6A8 
KingdomArchaea 
Replicon accessionNC_009712 
Strand
Start bp2058685 
End bp2060088 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content57% 
IMG OID640869236 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001405151 
Protein GI154151533 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGGTG ATATCTTCGG ATTCCTTTCG AGGGCAAATC CGTTCTCAAA GAAAGATGCA 
GACAAGAAAG AACAGAGTAA GGAACAAAAC AAAACCGAGG CGATAGCCAT TACCGAAAAA
AAACTCAACC TTGGAACGCT TGCCCTGCAC GCGGGGCAGG TTCCGGACCC GGCCACCGGG
TCACGGACAG TACCGATCTA CCAGACCTCC TCGTATGTGT TCAAGAGCAC GGAACACGCT
GCCAACCTGT TTGGTCTGCG GGAACTGGGG AATATCTACA CCCGGCTCAT GAACCCGACC
ACCGATGTGT TCGAGAAGCG CATTGCCGCC ATCGAGGGAG GAACCGGGGC GCTTGCCACG
GCATCAGGCC AGGCAGCAAT CACCTACGCG CTCCTCAACA TCACCCGGCC CGGGGACGAG
ATCGTCTCTG CCGATAACCT GTACGGCGGT ACCTATGAAC TGTTCCACTA CACGCTCCCG
AAGCTCGGGA GGACGGTAGT CTTTGTTGAC TCCACCAAGC CCGAGGCGTT CAGGAATGCA
ATTACTCCCA AGACCCGTGC CATCTATGCC GAGACCGTGG GTAATCCGAA ACTCGATACC
CCTGACTTTG AAGCGATTGC AAAGATCGCC CACGACAATG GCATCCCGGT GGTTGTGGAC
AACACCACCG GTGTCGGCCT TGTCCGCCCG ATTGACCATG GCGTAGACAT TGTCGTTCAT
TCGGCCACGA AGTACATCGG CGGCCACGGC AACTCCATCG GCGGCGTGAT CGTTGATTCG
GGCAAGTTCG CCTGGAACAA CGGCAAGTTC CCCGAGTTCA CCGAACCGGA CCCGGGCTAC
CACGGCCTCA AATACTGGGA TGCGTTCGGG AACTTCCCCG GCCTCGGAAA CGTTGCCTTC
ATCTTCAAGA TCCGGGTTTC ACTGCTCCGG GATACGGGAG CAGTCTTAAG CCCGTTTAAC
GCCTGGCTCT TCCTTATCGG CCTTGAGACC CTCCACCTGC GTGTGCCACG CCACTCCGAG
AATGCCTTTG CCGTTGCAAA GTTCCTCAAA GGTCATCCCA AGGTCGCATG GGTCAACTAC
CCCGGGCTCC CGGAGCACCC CAGCCACACC TTAACCAAGA AATACCTCCA CGGCGGTTTC
GGCCCCCTCG TCGGTGTCGG GATCAAGGGT GGGGAGACCG CAAGCAGGAA GTTCATCGAT
TCCCTCAAGC TCTTCAGTAA CCTCGCTAAT ATCGGCGATT CAAAGAGCCT TGTGATCCAC
CCGGCAACCA CCACCCACCA GCAGCTTACC GCTGAGGAAC AGGCCAAGAC CGGCGTTACT
CCGGATGCCG TCCGCCTTTC CGTCGGTACT GAGGATATCG AGGATATCAT CGCTGATCTC
AGGCAGGCAC TGGACAAGGT ATAA
 
Protein sequence
MTGDIFGFLS RANPFSKKDA DKKEQSKEQN KTEAIAITEK KLNLGTLALH AGQVPDPATG 
SRTVPIYQTS SYVFKSTEHA ANLFGLRELG NIYTRLMNPT TDVFEKRIAA IEGGTGALAT
ASGQAAITYA LLNITRPGDE IVSADNLYGG TYELFHYTLP KLGRTVVFVD STKPEAFRNA
ITPKTRAIYA ETVGNPKLDT PDFEAIAKIA HDNGIPVVVD NTTGVGLVRP IDHGVDIVVH
SATKYIGGHG NSIGGVIVDS GKFAWNNGKF PEFTEPDPGY HGLKYWDAFG NFPGLGNVAF
IFKIRVSLLR DTGAVLSPFN AWLFLIGLET LHLRVPRHSE NAFAVAKFLK GHPKVAWVNY
PGLPEHPSHT LTKKYLHGGF GPLVGVGIKG GETASRKFID SLKLFSNLAN IGDSKSLVIH
PATTTHQQLT AEEQAKTGVT PDAVRLSVGT EDIEDIIADL RQALDKV