Gene Amuc_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1921 
Symbol 
ID6275327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2331792 
End bp2333681 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content59% 
IMG OID642613981 
Productapolipoprotein N-acyltransferase 
Protein accessionYP_001878515 
Protein GI187736403 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0815] Apolipoprotein N-acyltransferase 
TIGRFAM ID[TIGR00546] apolipoprotein N-acyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0008443 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAATATCC CACCTGCTTC CCTCCCTCGT CTCCTGCCCG TGTGGGCAGG GCTGCTGCTG 
GCCGCTTTTT CCGGAATTCT GACGGCCTGC GCCTTTATCC CGGTGGACTG GGGCGGATGC
GTATGGATAG GGTTCCTGCC TTTGCTGACA GCCCTGTGGT ACGGCCGCAG GCGGGAGGGG
AAAAGGGGAA TGCTGGCATA CGCCCTGTAT GGCTGGATGT TTGGGGTAGT ATTTTACGGC
ATTTCCTTCT GGTGGGTCAA TGAAGTCAGC ACGCTGGGCT ATATTCCCCT GATGATTTTT
TACGGAGGCC TGTTTCCCGG CCTCTGGGCT CTGGTCATGG GAGTGTTCTT CCGACCGGAC
GCCCGGCCGC TGCCCGATGC GAGGCTGACG GCAAAGGAAC GCCGGGCCGC CTGGAAAACC
TGGGCCCTGG GCGACATACT GCCCAGCGCC TCCGCCGCTC TGGCGGGCGC CGCTTTATGG
GTATGTCTGG AATGGGGGCG GGGCTGGCTG ATACCGGGCT TCGGCTGGAA CAATCTGGGC
GTGGCCCTTT ACGGCAATCC TCTGGCCCAG TGGGCGGAAT ACCTTGGAGT AACGGCGCTG
GCTTTTATTC CGGCCGTTTT TTCCGTCTGG CTCTGGCGCG TTTGCCGCCG GGCCGGAACC
ATGATTATCC ATGAAGGCAG GCGTACTGTT CCCTGGGATT TTTTCATACT GGTGGCGGTT
CTTTTAACCA TGTTTGTCAC TGGCGTGGTC TGGACGGCCC GTTACTCCGC CCAGTCTGCC
GAAGCCACGG GCGACGGCAG ATTCACCGTT CCCGTCATGG CCGTGCAGCT GAATCTGAGC
CAGAAGGAAA AATGGGACCC GGCCAACCGG AGGAACATTT ACCGGGCTTT GCTGGACATG
ACGGAACAGG GAATGCTGGA TCTTCAAAAC CGCGCTCTGG AACAGGCGGT CAAAGACGGC
GCGGAAGCCT CCCTGGATAT GCCGGCCTGG GTAATCTGGC CGGAAAGTTC CTTCCCCATT
TCCACGTTTT ACCGTGATGC CACGGGAGAA CGCTTCTCCA ACCAGGACAA TGTCAACTTT
CTGAGCGCGG AAGAAGATTA CGTCCAGGCT CTCCGGAACG GGATATGCAA TTTTATTCTG
CTGACGGGCA CGGATGATAT CTATCTGTCT GATGAAGGGC GTGTGGCGCG GGCTTACAAC
TGTCTTACCG TCTTTGAGGG GGATTATTCC ACAGCCCGGC TGCACGCCAA GGCCATGCTG
GTCCCCTTCG GCGAATACAT CCCTATGCGG AAAACATTCC CATTTCTGGA AAAAGCCTTC
GAGGCTTCCG CCGGAACCGC CATGGGACTG AATTATACGC CCGGGTGTTC CTCCAGCCCC
GTTCCCGTGC CCATCCGTCC CGGAAGCTCC GTTACGGTGG GGGTCATCCC TCTTGTCTGC
TTTGAAGATG TTGTGGGAAG CTGGGTGCGC CGCTTCATCC GGCAGGAGCC CCAGTTGATG
GTGAACGTGA CCAATGACGG GTGGTTCAAC CGTTCCTGCG CCAACGAACA GCATTGGCGC
AATGCGGCGT TCCGGTGTAT TGAGCTGCGC CGTGCCATGG TCCGTGCCGC CAATACCGGC
GTAAGCGTGG CTCTGGCCCC CAACGGCGCG GTTATTGCGG ATTTGCGGGA TTCCTCCGGT
TCACCGTTTA CCAGAGGGGT GATGGCTGCC ACATTGCCTG TGGGCTGTAC GAAAATAACC
CTGTACGCCA TGCTGGGCGA CTGGGCCGTC CTGGTTTGTT TCCTGGTGTT CGCGGCTCTG
TTGCTGCGCA GGATAGGCGC AGGGAAACGC GTCCATGGCC CCGTGGAGAA CGGAGTTTAC
CGGTGTTCCA CGGGGGCTAC GCCCAGATAA
 
Protein sequence
MNIPPASLPR LLPVWAGLLL AAFSGILTAC AFIPVDWGGC VWIGFLPLLT ALWYGRRREG 
KRGMLAYALY GWMFGVVFYG ISFWWVNEVS TLGYIPLMIF YGGLFPGLWA LVMGVFFRPD
ARPLPDARLT AKERRAAWKT WALGDILPSA SAALAGAALW VCLEWGRGWL IPGFGWNNLG
VALYGNPLAQ WAEYLGVTAL AFIPAVFSVW LWRVCRRAGT MIIHEGRRTV PWDFFILVAV
LLTMFVTGVV WTARYSAQSA EATGDGRFTV PVMAVQLNLS QKEKWDPANR RNIYRALLDM
TEQGMLDLQN RALEQAVKDG AEASLDMPAW VIWPESSFPI STFYRDATGE RFSNQDNVNF
LSAEEDYVQA LRNGICNFIL LTGTDDIYLS DEGRVARAYN CLTVFEGDYS TARLHAKAML
VPFGEYIPMR KTFPFLEKAF EASAGTAMGL NYTPGCSSSP VPVPIRPGSS VTVGVIPLVC
FEDVVGSWVR RFIRQEPQLM VNVTNDGWFN RSCANEQHWR NAAFRCIELR RAMVRAANTG
VSVALAPNGA VIADLRDSSG SPFTRGVMAA TLPVGCTKIT LYAMLGDWAV LVCFLVFAAL
LLRRIGAGKR VHGPVENGVY RCSTGATPR