Gene Amuc_0396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0396 
Symbol 
ID6274807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp477908 
End bp479323 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID642612447 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_001877016 
Protein GI187734904 
COG category[R] General function prediction only 
COG ID[COG1524] Uncharacterized proteins of the AP superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTCC CGTTCCAAAC CGCTAATATC CCTCCCATGA AACACCCTCG CACCCGTGTG 
GCCGTCATTG ACGTGGTGGC CCTTTCCCGC CAGATGATGG AACACATGCC GCGGCTCTCC
GCCTGGGCGG AGGGGCGGAG CGTTTCCTCC TTCCCCCCGG CCTTTCCGGC CCTCACCTGC
TCTGCCCAGA GCACCTACGT GACAGGGCTT TCCCCGCGGG AGCACGCCAT TCCCGGCAAC
GGATGGTACA ACCGGAATAT GTGTGAAATC CAATTCTGGA AGCAGTCCAA CAAGCTGGTG
CAGGGCCCGC GCCTCTGGGA GAAACTGAGG GAACGGTACG GTTCCGGCTT CACCTGCGCC
AAACTTTTCT GGTGGTACAA CATGTATTCC ACGGCGGACT GGACCATCAC GCCGCGCCCC
ATGTACCCGG CAGACGGCCG CAAGATCTTC GACATTTACA CCCAACCCAT GGAACTCCGG
GAAACCATTA AAAAGGATCT GGGAGAATTC CCCTTCCCCA CCTTCTGGGG CCCCATGGCA
GGGATTCAGT CCTCCCAATG GATAGCAGAC TCCGCCCGGT GGGTGGAACG GAAACATCGC
CCTGACCTCA GCCTCATCTA TCTGCCCTAT CTGGACTATG ACCTTCAGAA ATTCGGACCG
TCCTCGACCC AGGCTGCCCA CGCGGCAGAG GCTATGGACG GTCTTCTCTG CGACTTGATC
GACTTTCTGG AACGGGAAGG CGTCACCCCC GTCGTCCTCA GTGAATACGG TATTTCCGAC
GTATCCCGCA GCATTGCCCT CAACCGCCTC TTCCGGGAAC GGGGCTGGAT TACCGTCAAA
CCGGAAATGG GTACGGAAAT GCTGGACTGC GGCGCCTCCC GCGCCTTTGC CGTGGCGGAT
CACCAGACTG CCCATATCTA CATCAATGAT CCTTCCGTAA AAGAAGAAGT GAAAGCACTG
CTCTCCGCCA CACCCGGAGT GGAAGAAATC AGGGAAACGG ACTTCTCCGG CCTTTCTTCC
GCGGCTCTGG AACGCCTGCC GGAATTCACC GCCGTCGCAG CCCCGGATGC ATGGTTCACC
TACTATTACT GGCTGGATGA CACCAAGGCG CCGGACTTCG CCCGCTGCGT GGACATCCAC
CGCAAACCCG GCTATGACCC CGCGGAAATG TTCTTTGATC CGGGCCTTAC CCTCCCCATG
TTCCATGCCG CCGCCTTTCT GCTGAAAAAA AAGCTGGGGT TCCGCGCCCT GATGAAAGTT
ATCCCCCTCA ATGGCGACCA GGTGAAAGGC TCCCATGGCA GAGACCGGGT GCCTGCAAAC
CAGCAGCCCG TATTCATCGG CCCGGCCTTC CTGCCGGAAA TCCATGCTGC TGAGGATGTC
CATCAAGCCA TCCTCTCCGT CTTTGAAAAA GAATAA
 
Protein sequence
MEFPFQTANI PPMKHPRTRV AVIDVVALSR QMMEHMPRLS AWAEGRSVSS FPPAFPALTC 
SAQSTYVTGL SPREHAIPGN GWYNRNMCEI QFWKQSNKLV QGPRLWEKLR ERYGSGFTCA
KLFWWYNMYS TADWTITPRP MYPADGRKIF DIYTQPMELR ETIKKDLGEF PFPTFWGPMA
GIQSSQWIAD SARWVERKHR PDLSLIYLPY LDYDLQKFGP SSTQAAHAAE AMDGLLCDLI
DFLEREGVTP VVLSEYGISD VSRSIALNRL FRERGWITVK PEMGTEMLDC GASRAFAVAD
HQTAHIYIND PSVKEEVKAL LSATPGVEEI RETDFSGLSS AALERLPEFT AVAAPDAWFT
YYYWLDDTKA PDFARCVDIH RKPGYDPAEM FFDPGLTLPM FHAAAFLLKK KLGFRALMKV
IPLNGDQVKG SHGRDRVPAN QQPVFIGPAF LPEIHAAEDV HQAILSVFEK E