Gene Mext_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4604 
Symbol 
ID5832252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5144326 
End bp5145618 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content69% 
IMG OID641370398 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001642043 
Protein GI163854000 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.105232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCG AAACGATCGC CCTTCATGCC GGCTTCGACC ACGATCCGGC CACGCACGCG 
GTCGCTGTGC CGATCTATCA GAGCGTCGCC TACGCCTTCG ACAGCGCCGA CCACGGCGCC
GCCCTTTTCA ACCTGGAGGA AGAGGGTTTT CGCTACAGCC GGATCGCTAA CCCGACGGTC
GCCGTGCTGG AGCGGCGCGT GGCGGAACTG GAGGGCGGCC ATTCGGCGCT CGCCGTCGCG
TCGGGGCAGG CCGCACTCCA CTACGCCATC GCCACCCTGG CGGATCATGG CGGCAACATC
GTCGCGGTGC CCCAGCTCTA CGGCACGACG CATACGCTGC TGGCCCACGT CCTGCCGCGC
CAGGGCATCA CCTGCCGCTT CGCCGCGAGC GATCGGGACG CGGATATCGC GGCGCTGATC
GATGGCGACA CCCGCGCCGT CTACTGCGAA TCGATCGGCA ATCCGGCCGG CAACATCTGC
GATATCGAGG CCCTGGCGGC CGTGGCCCAC GCCCACGGCG TACCGCTCGT GGTCGACAAC
ACCGTGCCGA CCCCGATTCT GATGCGGCCG ATCGATTACG GGGCCGACAT CGTCATCGCC
TCGCTCACCA AGTTCATGGG CGGCCACGGC ACCACGCTCG GCGGCATCAT CGTCGATTCC
GGCCGCTTCG ACTGGACGGC GCAGGCTGAG CGCTTCCCGA TGTTCACGCG GCCGGACGTC
TCCTATCACG GCCTCGTCTA CGCCGACCAT TTCGGCCGCG GTGCCTTCGC CGCGCGGGCG
CGCAGCGTCT ACCAGCGCAC CACCGGCGCC GTGCTGCCGG CGATGTCGGC CTTCCTGCTG
CTGCAAGGCA TCGAGACGGT GGCGCTGCGG GTCGAGCGCC ATGTCGCGAA CGCGCGCAAG
GTCGCCGAGC ACCTGCGGGC GCATCCGCAG ATCGCCTGGG TGAACTATGC CGGGTTCGCC
GACAGCCCGA ACCACCCGAT GGCGCGCAAG TACCTGAAGG GCGAAGGCTC TTCGCTCCTG
ACCTTCGGCG TTGTGGGCGG GTTCGCGGGC GGCAAGACGT TCTACGACGC GCTGAAGCTG
GTGAAGCGCC TCGTCAACAT CGGCGATGCC AAGTCGCTCG CCTGCCATCC GGCCTCGACG
ACGCACCGGC AGATGACCCC CGACGAGCAG CGGGTCGCGG GCGTGCTGCC GGAGACGATC
CGGCTCAGCG TCGGCATCGA GCATATCGAC GACATCCTCG AAGACCTCGA CCAGGCGCTC
GCCGCCGTGG CCCCCGCCGC ACTCGCGGCC TGA
 
Protein sequence
MRSETIALHA GFDHDPATHA VAVPIYQSVA YAFDSADHGA ALFNLEEEGF RYSRIANPTV 
AVLERRVAEL EGGHSALAVA SGQAALHYAI ATLADHGGNI VAVPQLYGTT HTLLAHVLPR
QGITCRFAAS DRDADIAALI DGDTRAVYCE SIGNPAGNIC DIEALAAVAH AHGVPLVVDN
TVPTPILMRP IDYGADIVIA SLTKFMGGHG TTLGGIIVDS GRFDWTAQAE RFPMFTRPDV
SYHGLVYADH FGRGAFAARA RSVYQRTTGA VLPAMSAFLL LQGIETVALR VERHVANARK
VAEHLRAHPQ IAWVNYAGFA DSPNHPMARK YLKGEGSSLL TFGVVGGFAG GKTFYDALKL
VKRLVNIGDA KSLACHPAST THRQMTPDEQ RVAGVLPETI RLSVGIEHID DILEDLDQAL
AAVAPAALAA