Gene Amuc_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1918 
Symbol 
ID6275362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2328690 
End bp2330006 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content58% 
IMG OID642613978 
Productbifunctional UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase/(3R)-hydroxymyristoyl-[acyl-carrier-protein] dehydratase 
Protein accessionYP_001878512 
Protein GI187736400 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0764] 3-hydroxymyristoyl/3-hydroxydecanoyl-(acyl carrier protein) dehydratases
[COG0774] UDP-3-O-acyl-N-acetylglucosamine deacetylase 
TIGRFAM ID[TIGR00325] UDP-3-0-acyl N-acetylglucosamine deacetylase
[TIGR01750] beta-hydroxyacyl-[acyl carrier protein] dehydratase FabZ 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.000565023 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCATGTG GAAACCAAAG AACTGTCGGC TCTCCGGCCT CCCTGGCCGG TACCTCTTTG 
CATACGGGAC AGCCCGTTAC CCTGACCCTG AAGCCGGCTC CTGCCGACTT CGGCATTAAA
TTCCGCCGGG TGGACATTCC GGACCAGCCT TTCATCAGCG CGGACGTGGA AAAGGTGCAG
ACTGTGGAAC GCGCCACCAG CCTGGCTGAA GGTTCCGTCA AAGTGCATAC CGTGGAACAT
ATTCTCTCCG CCCTCACGGG CATGGGCATT GACAACGCCG TCATTGAAAT GGACGCCAAT
GAACCGCCCA TCGGGGACGG GTCCTCAGCC CCCTACGTAG AGCTGATCAA GAGCGCCGGC
ATCGTGGAAC TGGATGTGCC GCGCCGCTAT CTGGAAGTGC GCGAGGCCGT TACCATTGAA
ACCAAGGGCG GTTCCATCCT GACCATTCTC CCCTCCAAGC AATTCCGCGT TTCCGTCACC
TGCGTGGGGC CGGAAAACCG CATTACCCAG TACTTTGACT CGGTTATCAC GCCGGAAACG
TATGAAAAGG AACTGGCCCC GGCTCGCACT TTTACGTTCT ACGAAGACAT CAAGCCCCTG
CTGGAGAAGG GCCTCATCAA GGGCGGCAGC CTGGAAAACG CCGTGGTCAT CCGCGGCGAG
GAGCTCATGA GCAAGGAGCC CATGCGTTTC ATCAATGAAT TTGCCCGCCA CAAGGCCATG
GACCTTATTG GAGACCTGAC CCTGTGCGGC AAACCCATCC TGGGCCACGT CATCGCCATC
AAGCCGGGCC ACGGCCCGAA CACCGAACTT ACGGCCAAGC TCAAAAAGGA ACACCACCGC
AACCAGCAGA TGGCCCCCAA TCCCGTCAAT GTTCCATACG GAGACGCCGT GCTGGACATC
AACGAAGTGA TGAGCCTTCT GCCGCACCGC TATCCCTTCC TGATGGTGGA CCGCATTATC
GGTTTTGAAG GAGAAACCAA GTGCCGGGGA TTGAAGAACC TGACCATGAA TGAACTCTTT
TTCCAGGGCC ACTTCCCGGG ACATCCGGTC ATGCCGGGCG TTCTCCAGGT GGAAGCCATG
GCCCAGGTAG CCTCCATCGT CATGCTGCGC CAGCCCGGCA ATGCCAGCAA GCTCGGCTAC
TTCATGAGTG CGGACAAGGT TAAATTCCGC CGCGTCGTGG TTCCCGGCGA TACGCTCATC
ATTGAAGCGG AACTGACCAA GATGCGCGGA AACATCGGCC AGGCGACGGC TCGTTGTCTG
GTGAACGGCC AGGTGGTTTC GGAAGCCGAG CTTAAATTCG GCCTTCAGGA TGCTTGA
 
Protein sequence
MACGNQRTVG SPASLAGTSL HTGQPVTLTL KPAPADFGIK FRRVDIPDQP FISADVEKVQ 
TVERATSLAE GSVKVHTVEH ILSALTGMGI DNAVIEMDAN EPPIGDGSSA PYVELIKSAG
IVELDVPRRY LEVREAVTIE TKGGSILTIL PSKQFRVSVT CVGPENRITQ YFDSVITPET
YEKELAPART FTFYEDIKPL LEKGLIKGGS LENAVVIRGE ELMSKEPMRF INEFARHKAM
DLIGDLTLCG KPILGHVIAI KPGHGPNTEL TAKLKKEHHR NQQMAPNPVN VPYGDAVLDI
NEVMSLLPHR YPFLMVDRII GFEGETKCRG LKNLTMNELF FQGHFPGHPV MPGVLQVEAM
AQVASIVMLR QPGNASKLGY FMSADKVKFR RVVVPGDTLI IEAELTKMRG NIGQATARCL
VNGQVVSEAE LKFGLQDA