Gene Amuc_2013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2013 
Symbol 
ID6275754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2445725 
End bp2447011 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content59% 
IMG OID642614072 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001878604 
Protein GI187736492 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.024983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.048822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA ACCATCGATT TGAAACACGC CAGATCCATG TGGGTCAGGA AAGTCCGGAT 
CCGGCCACCG ATGCGCGCGC CGTGCCTATT TACGCCACCA CTTCGTATGT CTTCAAAGAC
TCGGAACAGG CGGCAGGCCG TTTTGCCCTG GCGGAGCCGG GCAATATTTA TAACCGCCTG
ATGAACCCTA CCGCAGATGT TTTTGAAAAA CGCATCGCCT CCCTGGAAGG AGGGACGGCC
GCGCTGGCCG TCTCCACCGG GGCTGCCGCT GTCACGTATG CCATCCAGAA CATCGCCCGG
GCCGGGGACC ACATCGTTTC TTCTTCCACC GTATATGGCG GGACGTATAA TCTCTTTGCC
AATACGCTGG CGGACGCCGG CATAGAAACC ACTTTCGTGG ATGCAAGGGA CGTTCAGAAT
TTTTCCAGGG CCATCCGGAA CAATACCAAG GCCCTGTACG TGGAAAGCCT GGGCAACCCG
AACTGCGACA TCGTGGATAT GGAAGCGCTG GCGGAAGTGG CGCACGCCCA CGGCATCCCG
CTCATTGTGG ACAGCACGTT CGCCACGCCC TTCCTGTTCC GCCCCCTGGA ACACGGAGCG
GACATCGTGG TGCATTCCGC TACCAAATTC ATCGGCGGCC ACGGCACGGT GATGGGCGGC
GTGATTGTGG ACGGCGGTAA ATTCGACTGG ACGCAGAACG ACAAGTTCCC CGGCATCAGC
AAGCCCAACC CCAATTACCA CGGAGCCGTG TTCGCTGAGG TATGCGGCAA TCTGGCCTAT
ATCGTCAAAA TCCGGGCCAC CCTGCTGCGG GATACGGGAG CCACCATCAG CCCGTTCAAC
TCCTTCCTGC TGCTCCAGGG GCTGGAAACA CTCTCCCTGC GGGTGGAACG CCATGTGCAG
AACGCCCTGC GCGTAGCGGA CTATCTGGCC TCCCATCCCC AGGTGGAGAG GGTGAACCAT
CCCTCCCTGC CGGACCATCC GGACCACGAC CTTTACAAGA GATACTACCC GAACGGGGGC
GGCTCCATCT TCACCTTTGA AATCAAGGGT GGCGCGGAAA AAGCTCGCAA ATTCTGCGAA
AGCCTGGAAC TATTCTCCCT GCTCGCGAAC GTGGCGGACG TCAAGTCCCT GGTGATTCAT
CCGGCCTCCA CCACCCATTC CCAGATGACG GAGGAGGAAC TGAAGGCGGG AGGCATTACG
CCATCCACCG TGCGGCTTTC CATCGGGACG GAACATATCG ACGATATTCT GGAAGATCTG
GAACAAGGCT TCCGCGCCAT TCTCTAA
 
Protein sequence
MSKNHRFETR QIHVGQESPD PATDARAVPI YATTSYVFKD SEQAAGRFAL AEPGNIYNRL 
MNPTADVFEK RIASLEGGTA ALAVSTGAAA VTYAIQNIAR AGDHIVSSST VYGGTYNLFA
NTLADAGIET TFVDARDVQN FSRAIRNNTK ALYVESLGNP NCDIVDMEAL AEVAHAHGIP
LIVDSTFATP FLFRPLEHGA DIVVHSATKF IGGHGTVMGG VIVDGGKFDW TQNDKFPGIS
KPNPNYHGAV FAEVCGNLAY IVKIRATLLR DTGATISPFN SFLLLQGLET LSLRVERHVQ
NALRVADYLA SHPQVERVNH PSLPDHPDHD LYKRYYPNGG GSIFTFEIKG GAEKARKFCE
SLELFSLLAN VADVKSLVIH PASTTHSQMT EEELKAGGIT PSTVRLSIGT EHIDDILEDL
EQGFRAIL