Gene Amuc_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0117 
Symbol 
ID6274920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp144848 
End bp145903 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content60% 
IMG OID642612162 
ProductAgmatine deiminase 
Protein accessionYP_001876743 
Protein GI187734631 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2957] Peptidylarginine deiminase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGG AACCCGATGT ACGCTGGCCC GCTGAATGGG AGCCTCAGGA TGCTGTCTGG 
CTGTCCTGGC CCCACCGCAG GGATTTATGG CAAGGGGGGC TGGACGAGTT GCAGCAGACT
TATGGGAGCG TGGCCGCTGC CATTGCTCCG CATGCCCTTG TATGCGTGAA TGCCGCAGCT
CCCCTTCATC CCGGCGTCAG GCAGGCGATG CTGGCCTCCG GAATGAGTGA GGAGCAATTC
CGCCTGTTCA ACCACCCGAC CAACGACGTA TGGTGCCGGG ACCACGGCCC CGTTTTCGTC
CAGGATGTGA AGGACGGTTC CCTGATGCTG GCGGATTGGC AATTTAATGC GTGGGGCGGC
AAATTTGCCC CGTGGGACCT GGACAACGGC GTTCCCGCCC TGATTGGGGC GGCGCTGGGG
CTTCCCGTGC GCAGTTCTTC CCTGATTCTG GAAGGGGGGG CGATTGAGGG CAATGGGGAC
GGCTTGCTGG TGACGACGGA GTCCGTGCTG CTGAATCCCA ACCGCAATCC GGATTGGAGC
CGGGCCATGA TTGAGGAGGA ATTGAAGCGC ATGCTGGGCG TCAGAGCCGT TTTCTGGCTC
GGTTCCGGCA TTGAAGGGGA TGATACGGAC GGCCATATTG ACGACATGGT GCGTTTTGTG
TGCCGGGATG CCGTAGTCTC CATCGTGGAA ACGGATTCTT CCTCTCCCCA TTACCGCGCT
CTGGCGGAGA ATAATGAACG CCTTCAGGAT TTGAGATGCG TGGACGGTTC CGGGGTGGAG
GTGATTCCCC TGCCGATGCC GGATCCCCTC CATGCGGAGG ACTGGCGCCT GGATCAGCTC
CCTGCCAGTT ACGCCAATTT CCTCATTGTT AATGAGGCCG TCATTGTTCC CGTATTCAAC
CAGCCCCGGA ATGACGATCG CGCCCTGGGC ATTTTGCGTG AATGTTTCAG CGGAAAACAG
GTAATAGGGG TGGATGCCCG CAAGCTGGTG CTGGAAGGCG GCGCCATCCA CTGCATCACC
CAGCAGCAGC CTCGGCCGGG GAAGGAGGGA CTGTGA
 
Protein sequence
MNKEPDVRWP AEWEPQDAVW LSWPHRRDLW QGGLDELQQT YGSVAAAIAP HALVCVNAAA 
PLHPGVRQAM LASGMSEEQF RLFNHPTNDV WCRDHGPVFV QDVKDGSLML ADWQFNAWGG
KFAPWDLDNG VPALIGAALG LPVRSSSLIL EGGAIEGNGD GLLVTTESVL LNPNRNPDWS
RAMIEEELKR MLGVRAVFWL GSGIEGDDTD GHIDDMVRFV CRDAVVSIVE TDSSSPHYRA
LAENNERLQD LRCVDGSGVE VIPLPMPDPL HAEDWRLDQL PASYANFLIV NEAVIVPVFN
QPRNDDRALG ILRECFSGKQ VIGVDARKLV LEGGAIHCIT QQQPRPGKEG L