Gene Amuc_0573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0573 
Symbol 
ID6274572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp670610 
End bp671668 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content58% 
IMG OID642612623 
Product3'-5' exonuclease 
Protein accessionYP_001877191 
Protein GI187735079 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0349] Ribonuclease D 
TIGRFAM ID[TIGR01388] ribonuclease D 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGCG AGAAAGAGGA ATTACTGGAA TGGCGCAAAC GCGCCGCCGC ACAGCCCGCG 
GGACGGGTAG TCCTGGATCT GGAGGCAGAC AGCCTGCACC GCTATCAGGA AAAAATCTGC
CTGATCCAAT ATGCGGACGA AACGGGTTCC TGCCTGATTG ACCCTCTCTC TATCGAAGAT
ATGGGGCCTT TCTACAACTG GCTGAAAGAA ACGGAAGTCT GGATGCACGG AGCGGACTAC
GATATGAGCC TCTTTCAAAA CGCCTGGGAA ACGCTGCCCG CCATGATCTG GGATACGCAG
ACGGCGGCGC GCCTGCTGGG CTTCCGCCAG TTCGGGCTGG CAGCCCTGGT GGAACACTTC
CACGGCATCA CCCTGAGCAA ATCTTCCCAA AAGGCGGACT GGGCGCGGCG CCCCCTTTCC
CCAACCATGG TCACTTACGC CCTGAACGAC GTAAATTACA TGCTGGACAT GGCGGACAAA
CTGACGGCCG CCCTCCGGAA AAAAGGACGC ATGGGCTGGT TTGAAGAAAT TTGCAGACAT
TCCATGGAAC GCGCCCGGGA ACGCCATCTG GCAGGCCATC AGGACCCCTG GCGCATCCAG
GGCTGCGGCA AATTGAACAG GAAGGGGCTG GCCGCCCTCC GGGAAATGTG GACCTGGCGT
GATGCGGAAG CCAAAACGTG GGACAAACCC GCGTTCATGG TTTGCTCCAA TGCTGACCTC
ATCCAGTGGA GCGTGGCTCT CCAGGAACAG CGCACCGTGG CGCCCCCGCC CCGTTTTCAT
GCCCACAGGC GCAGCCGGTT CATGAATGCG CTCCAGAAAT TCTACCTGCT GGATGAAGAA
GACTACCCAT GCCGGCCCCG CATTCAGCGC CGGCAACATT CCGACCAATT TGAGGACAAT
CTGGCCCGCC TGTGCAAACT CAGGGATGAA AAAGCTGAAG AACTGGGCAT GGAAGGCTCC
TTCCTGATTA CCCGGGCCTC TCTGGAAGCT ATTGCGGAAG ACAGGGAAAA AGGCGTTTCC
ACCCTGTTGA ACTGGCAGAA GGAAGCCCTG GGTTTTTAA
 
Protein sequence
MISEKEELLE WRKRAAAQPA GRVVLDLEAD SLHRYQEKIC LIQYADETGS CLIDPLSIED 
MGPFYNWLKE TEVWMHGADY DMSLFQNAWE TLPAMIWDTQ TAARLLGFRQ FGLAALVEHF
HGITLSKSSQ KADWARRPLS PTMVTYALND VNYMLDMADK LTAALRKKGR MGWFEEICRH
SMERARERHL AGHQDPWRIQ GCGKLNRKGL AALREMWTWR DAEAKTWDKP AFMVCSNADL
IQWSVALQEQ RTVAPPPRFH AHRRSRFMNA LQKFYLLDEE DYPCRPRIQR RQHSDQFEDN
LARLCKLRDE KAEELGMEGS FLITRASLEA IAEDREKGVS TLLNWQKEAL GF