Gene Amuc_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2167 
Symbol 
ID6274766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2642583 
End bp2644685 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content60% 
IMG OID642614227 
ProductOligopeptidase A 
Protein accessionYP_001878755 
Protein GI187736643 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.803751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATC CCTATCTGGA CCCCTCCTTT CTGGTTTCCT GGTCCCGGCT TACGCCGGAG 
GCCATCAGGC CGGACATCAC GGAAGCCATC TCCCGCGCCA AAGAGAATAT CCGGACCATT
TGCGACCAGC CGCTGGAGTC CCTGACTTAT GAAAGCACCT TCGGCGCTCT GGAAAAGGCC
TCCGAGGATC TGCACCTGGG CTGGGGCCGC ATCATGCACC TGGACTCCGT CAATGACGAA
CCCGCCCAGA GGGAGGCCAT CGGGGAAATG CTGCCGGAAG TGGTGGCTTT CTCCTCCTCC
GTGCCGTTGA ACCCGCGCCT GTGGACGGTG CTGAAAGCCG CGGCCTCCTG TGACTGGGTG
AAAAGCCTTT CCCCCGTCAG GCAGCGTTTC ATCCAGGAAA CGCTGGCGGA CTTCCGCGAG
AGCGGGGCGG ACCTGCCGGA CGACGTGAAG CCGGAATATG CGGAGATAGA AGCCCAGCTC
TCCCTGAAGA CCAAGAAATT CGCGGAAAAC GTGCTGGACT CCACCAACGC CTGGGAACTC
ATTGTGGAAG ATGAAGCGGA ACTTTCCGGG CTGCCGGATT CCGCGAAGGA AGCCGCCCGC
CTGGATGCCC TGGCCAACGG CCACGGCACG GAAGAAGCCC CCCGCTGGCG CTTCACCCAG
AAATTTACCT CCCTCCAGCC TGTCATGCAG TTTGCGGACT CGGACAGCCT GCGCCGCCGC
ATGTGGGAGG GCTCCTGTTC CATCGGGAAG GGCGGAGAAT ACGATAATGA AGCCCTTATC
GCTGAAATCC TGGAACTGAG GGACAGGAAA GCCCATTTGC TGGGGTACGG CTGCTTTGCG
GATTACGCCA CTTCCCGCCG CATGGCCGGG AGCGGAGCCA ACGCCCTGAA ATTCATCAAC
GACCTGCATG ACAGGGTGAA GCCCTCTTTC CTGAAGGACA TGGAAGCCGT TCGCAGGTAC
AAGGAGGAAA AAACAGGAAA ACCCGTGGAA AAGCTCTCCC CGTGGGAAAC CGGATACTGG
TCTGAAAAAC GCCGCCGCGA ATTGTACGCT TTTGACGAGG AAGACCTGCG CCCGTATTAC
TCCGTGGAAA AAGTCATGGA AGGGCTCTTT TCCATCTACT CCGGCCTGTA CGGCATCACG
GTCACGCCGC GTCCCACGGT GGCGTTCAAG CCCGGTGAAT CCGGGGAAGC GCCGGAAGGC
GCGGTGGAGG TGTGGCATCC GGACGTCCTG TTCTATGAAT TGCATGATGC GGAAAGCGGG
GAACACCTGG GTTCCTTTTA TGCGGACTGG CATCCGAGGG ACTCCAAGCG CGCCGGAGCG
TGGATGAACT ACCTGAGCGT AGGGGAACCT CCGCACGGCG GAAAACCCCG CGTTCCCCAT
CTGGGTCTCA TGGTCGGCAA CATGACCAAG CCCGTAGGGG ACAAGCCCGC GCTGCTGTCC
CACCGGGAGG TGGAAACCAT CTTCCATGAA TTCGGCCACC TGCTGCACCA GCTCCTTTCC
GATGTGGAAG TGAAGTCCCT GTCGGGCACC AACGTTGCCT GGGACTTTGT GGAACTGCCC
TCCCAGATTA ATGAAAACTG GTGTTGGGAG CGTGAATCCG TGGACCTCTT CGCCGCCCAC
TATGAAACGG GTGAAAAAAT ACCGGACGAA CTGTTCTCCA AAATGCGCGC CGCCCGCAAT
TATATGAGCG GCACGGATTT CATGCGCCAG CTCTGCTTTG GCAAGCTGGA TCTGGAGCTT
CACGTAAACT GGCCTCAGTA CAAGGGTGTT CCGCTGGAAG AAACGGATGA ACGCATTCTG
GCGGATTACC GGGTGCCGAT GACGGACCGC GGCCCTTCCG TGGCGCGCCG CCTGACCCAC
ATCTTCGCGG ATCCCACGGG TTATGCTTCC GGTTATTACT CCTACAAATG GGCGGAGGTG
CTGGAAGCGG ACGCTTTCAG CCGCTTCCTG AAAGAAGGAG TGCTGAATCC CCGAACCGGG
CGCGACTTCC GCCGCTGCAT CCTCAGCAAG GGCAACAGCA AGCCTGCCGC TGAACTCTAC
CGCGACTTCA TGGGCCGTGA TCCGGACGCG GAAGCGCTGC TTGTCAAATC CGGCGTTCTT
TAA
 
Protein sequence
MNHPYLDPSF LVSWSRLTPE AIRPDITEAI SRAKENIRTI CDQPLESLTY ESTFGALEKA 
SEDLHLGWGR IMHLDSVNDE PAQREAIGEM LPEVVAFSSS VPLNPRLWTV LKAAASCDWV
KSLSPVRQRF IQETLADFRE SGADLPDDVK PEYAEIEAQL SLKTKKFAEN VLDSTNAWEL
IVEDEAELSG LPDSAKEAAR LDALANGHGT EEAPRWRFTQ KFTSLQPVMQ FADSDSLRRR
MWEGSCSIGK GGEYDNEALI AEILELRDRK AHLLGYGCFA DYATSRRMAG SGANALKFIN
DLHDRVKPSF LKDMEAVRRY KEEKTGKPVE KLSPWETGYW SEKRRRELYA FDEEDLRPYY
SVEKVMEGLF SIYSGLYGIT VTPRPTVAFK PGESGEAPEG AVEVWHPDVL FYELHDAESG
EHLGSFYADW HPRDSKRAGA WMNYLSVGEP PHGGKPRVPH LGLMVGNMTK PVGDKPALLS
HREVETIFHE FGHLLHQLLS DVEVKSLSGT NVAWDFVELP SQINENWCWE RESVDLFAAH
YETGEKIPDE LFSKMRAARN YMSGTDFMRQ LCFGKLDLEL HVNWPQYKGV PLEETDERIL
ADYRVPMTDR GPSVARRLTH IFADPTGYAS GYYSYKWAEV LEADAFSRFL KEGVLNPRTG
RDFRRCILSK GNSKPAAELY RDFMGRDPDA EALLVKSGVL