Gene Amuc_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0017 
Symbol 
ID6275223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp21561 
End bp23006 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content59% 
IMG OID642612057 
Productoxidoreductase domain protein 
Protein accessionYP_001876645 
Protein GI187734533 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAATT CATCATCACG CCGTCGTTTC CTTCAGACCC TGGGCCTGGC CACTGGCGCC 
CTGGCTGCCG GTTCCTTTGC CAACGCCCAG GAAGTAGCCC CCCTGGCTCC CAAGAAAATC
ACCATTCCGG ACCCGAATAA CATCGGCCCC ATGACCACGT GGCCCCCGCG CAAGCCCGGC
GCCATCTACA TGGGCGGCTT CAGGGCTCCC AAGCTGGACA AGGTGCGTGT GGCCTTTGTC
GGCGTGGGTG AACGCGGTTC CATGCACGTG GGCCAGATGG CCGTTATAGA AGGTGCGGAA
ATTGTCGGCA TTTGCGACCT GTATGAAGAC TGGGCCAAGC GCAGTGCGGA CGTCGTGGAA
AAGAAGACGG GCAAGCGCCC CCCCATTTTC ACGAAAGGAC CGGAAGACTA CAAGCGCATG
ATGAAGGAAG TCAAGCCGGA CGCCGTCATC GTCTGCCCCA GCTGGGAATG GCACTGCCGT
GTTACCTGCG ACGTGATGAA GATGGGCGCC CACGCCTTTG TGGAAGTGCC TATGGCCGTC
TCCATCAAGG AACTCTGGGA AATCGTGGAT ACCTCCGAAG AAACCAGGAA GCACTGCATG
ATGATGGAAA ACGTCAACTA CGGACGTGAG GAACTCATGT ACCTGAACAT GGTGCGCCAG
GGCGTCATTG GCGACCTGCT GTACGGAGAA GCCGCCTACA TCCATGAACT GCGCGGACAG
ATGAAGCAGG TGGAACGCGG CACCGGTTCC TGGAGAACCT ATCACTACGC CAAGCGCAAC
GGCAACGTGT ATCCCACGCA CGGCCTCGGC CCCATTGCCC AGTACATGAA TCTGGCCCGC
AAGGACGACT GCTTCGGCAG GCTCGTCTCC TTCTCCAGCC CGGCCCTGGG CCGCGCCGCG
TATGCCAAGA AAAATTTCCC GGCGGACCAC AAGTGGAACA AGCTGGACTT TGCCTGCGGC
GATATGAATA CCTCCATCAT CAAGACCACC ATGGGCCGCA CCGTCCTGGT GGAATGGGAT
GAAACCAGTC CGCGCCCCTA CTCCCGCCTG AATCTCATCC AGGGCACCCT GGGCACCTTG
GCCGGCTTCC CGACCCGCGT AGCCGGGGAA AAGCTGGGCA ACGGAAATTA TCATGAATGG
ATTGAAGGCA AAGAAAAACT GGCCCCTATT TTTGAAAAGT ACGATCACCC GCTCTGGAAG
AGAATCGGGC CGCTGGCCCT GAAGATGGGC GGTCACGGCG GCATGGACTT CGTGATGCTC
TTCCGCATCA TCGAATGCCT CCGCAATGGC GAACCGATGG ACCAGAACGT TTATGAAGGA
GCTTTCTGGT CCTCCGTCTC CGAGCTTTCC GAATACTCCG TGGCCCAGGG CGGCATGCCC
CAGGTATTCC CGGACTTCAC CCGCGGAGAC TGGAAAACGA CTGCTCCGCT GGGCATCGTC
CAGTAA
 
Protein sequence
MDNSSSRRRF LQTLGLATGA LAAGSFANAQ EVAPLAPKKI TIPDPNNIGP MTTWPPRKPG 
AIYMGGFRAP KLDKVRVAFV GVGERGSMHV GQMAVIEGAE IVGICDLYED WAKRSADVVE
KKTGKRPPIF TKGPEDYKRM MKEVKPDAVI VCPSWEWHCR VTCDVMKMGA HAFVEVPMAV
SIKELWEIVD TSEETRKHCM MMENVNYGRE ELMYLNMVRQ GVIGDLLYGE AAYIHELRGQ
MKQVERGTGS WRTYHYAKRN GNVYPTHGLG PIAQYMNLAR KDDCFGRLVS FSSPALGRAA
YAKKNFPADH KWNKLDFACG DMNTSIIKTT MGRTVLVEWD ETSPRPYSRL NLIQGTLGTL
AGFPTRVAGE KLGNGNYHEW IEGKEKLAPI FEKYDHPLWK RIGPLALKMG GHGGMDFVML
FRIIECLRNG EPMDQNVYEG AFWSSVSELS EYSVAQGGMP QVFPDFTRGD WKTTAPLGIV
Q