Gene Amuc_0465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0465 
Symbol 
ID6274714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp551989 
End bp553032 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content58% 
IMG OID642612515 
ProductPeptidase M23 
Protein accessionYP_001877084 
Protein GI187734972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.817942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCA AATCCTTTCC TCTCCTGACC GGCATCCTCC TGTTTCTTTG GGGCATGCCA 
TTACTGATGG CGGATATTGT AGTGCGCTTC CCCACGGAAA ATACAGCCCT GCTGGACAAC
CGCCCGCAGG ACTTTTACAT GTATGTAGAC CGCAATTTTG AAGGGAAGAA ATCCCAGCCG
TGGGAAGCTG GAGCCTACGG CTTTACCCGG ACCCTCGTCA GAACCCAGGC AGGCCCCGTG
GCCGTCAAAT TTCATGAGGG TATCGACATT AAACCTCTCA GGAGGGATGC TTCCGGCACG
CCGCTGGACG ACGTGCACCC CGTAGCCGGA GGCACGGTAG TCCATGCCTC CGCCAACCCA
ACCCATAGCA ATTACGGCCG CTATGTGGTC ATTGAGCACC AGCTGAAGGA CGGCCCGCTT
TACAGCCTGT ATGCCCATCT GGCCTCCGTC TCCTGCAGGA AAGGCGACCG GGTAGGAACC
GGAAACGTTA TTGGAAAGCT GGGATACTCC GGGGTGGGTT TGAACAAAAC GCGTGCTCAT
GTGCATCTGG AACTCTGCCT CAAGCTACAA GATGACTTTG AAAACTGGTA TTCCAGTCTG
AAACTGGGCA CTCCCAACCG CCACGGTTCC TATAACGGAC TCAATTTGGC CGGCTTTGAC
CCGGCACCCG TCCTCCTGCA ATGCAAGGAC GGGGCGGAAT TTTCCCTCTC TCGCCATATC
TCCTCCCTGC CGGTCCAATA CGTCGTGCGG GCTCCCTCTT CCGGCGAACT GCCCAGCCTT
GTCAAACGCT ACCCCTTCCT CCTGAAGCCG GGGCCCTCCG ACCCCAAATC CTGGGAAATC
AGTTTCACGG GAGAAGGAGT TCCCGTTTCC GTGACTCCTT CCAGCCAACC GTGCACGGAA
CCCGTCGTCA TCCGGGCCGT TCCGCATCCT TTCTCCCAAC TGTACAGGAC CTGCAACCGC
GTTTCCGGCT CCAGCAAGGA CCCTAAGCTT ACCGCCGCCG GCAAACGCTA CATCCGGCTC
ATCTTCATGG GGCCTGAATC ATAA
 
Protein sequence
MHGKSFPLLT GILLFLWGMP LLMADIVVRF PTENTALLDN RPQDFYMYVD RNFEGKKSQP 
WEAGAYGFTR TLVRTQAGPV AVKFHEGIDI KPLRRDASGT PLDDVHPVAG GTVVHASANP
THSNYGRYVV IEHQLKDGPL YSLYAHLASV SCRKGDRVGT GNVIGKLGYS GVGLNKTRAH
VHLELCLKLQ DDFENWYSSL KLGTPNRHGS YNGLNLAGFD PAPVLLQCKD GAEFSLSRHI
SSLPVQYVVR APSSGELPSL VKRYPFLLKP GPSDPKSWEI SFTGEGVPVS VTPSSQPCTE
PVVIRAVPHP FSQLYRTCNR VSGSSKDPKL TAAGKRYIRL IFMGPES