Gene Amuc_1539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1539 
Symbol 
ID6273671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1849845 
End bp1851008 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content60% 
IMG OID642613598 
Producthomoserine O-acetyltransferase 
Protein accessionYP_001878141 
Protein GI187736029 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000120571 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTTC CTGCCGTCCA AACCAAATTC CTGGATCTTC CCAACTCCTT CTCCCTGAGA 
AATGGAGCTA CGCTGGACAA GGTTCGCGTC GCCTACGAAC AATACGGAAC CCTGACGCCG
AATAAAGACA ACGCCATCCT GCTGTTCCAT GCTCTTTCCG GCAGCCAGCA CGCCTACGGT
TACAATCCGG AAGTCCCGGG CATCGATTCC CTCTGGAAAC CGGAAAACCA CGAGGGCTGG
TGGAACAGCA TCATCGGCCC CGGCAAGCCG CTGGACACAA ACTGCTTCTG CATCATCTGC
GCCAATTACC TGGGAGGCTG CTACGGCACC ACAGGCCCCG CTACCCCCTG CCCTGCGGAC
GGCCAGCCCT ACGGCTCCCG TTTCCCGCAT GTGGAGGCCG CAGACCAGGC ACGTCTCCAG
GCCCTTCTGC TGGACAGCCT GGGCATAGAA CGCGTTCATC TTATGGGCCC CTCCGTGGGC
GGACTGATCG CCCTCAGCTT CGCGTGCCAG TTCCCGGAAC GGGTCCGGAG CTTCATCTCC
ATCGGCTCCG GTTACCGGGC TTCCATTGAA CACCGCCTGT CCCTGTTTGA ACAAATCCTG
GCCATTGAGC TTGACCCGGA TTTCCAAGGC GGGGATTACT ACCGGGGACC GGCGCCAAAA
AAAGGGCTGG CGTTCGCCCG TATCATCGGC CACAAATCAT TTGTTTACCA GGAGGGGCTG
GAACAGCGCG CCAGAAAAGA GGTGGGAGGC AACTACGGCC TGCTCACGTG GATGACCCCC
ACCCGCAGCA CGCAAAGCTA CATGCTTCAC CAGGGAACCA AGTTCGCGGA GCGCTTTGAC
GCCAACGCCT ATATCCGTAT TGCGGATATG TGGGCGGAGT TCGACATCCG CGACCACACC
CCGGACGGAA CATTTCAAAC CGCCCTGGAA GGCTTCCGCC GCGCAGGGAT TCCCGCGCTT
ATCTTTTCCA TTGATACAGA CTGCTGTTTC CGCCCGGCGG AACAGCAGGA TTTCGCGGCG
CAGCTTGAAG CCGCCCATAT TCCCACGGAG TTCCATACCA TCGCTTCCAC CAAGGGACAC
GATTCCTTCC TGCTGGAGCC GGAGCTTTAT GCGGAACCCA TCCGGCGCAT TCTGGCGGCA
AGGAAGCCGA AGGGGACGGC GTAA
 
Protein sequence
MQLPAVQTKF LDLPNSFSLR NGATLDKVRV AYEQYGTLTP NKDNAILLFH ALSGSQHAYG 
YNPEVPGIDS LWKPENHEGW WNSIIGPGKP LDTNCFCIIC ANYLGGCYGT TGPATPCPAD
GQPYGSRFPH VEAADQARLQ ALLLDSLGIE RVHLMGPSVG GLIALSFACQ FPERVRSFIS
IGSGYRASIE HRLSLFEQIL AIELDPDFQG GDYYRGPAPK KGLAFARIIG HKSFVYQEGL
EQRARKEVGG NYGLLTWMTP TRSTQSYMLH QGTKFAERFD ANAYIRIADM WAEFDIRDHT
PDGTFQTALE GFRRAGIPAL IFSIDTDCCF RPAEQQDFAA QLEAAHIPTE FHTIASTKGH
DSFLLEPELY AEPIRRILAA RKPKGTA