Gene Amuc_2045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2045 
Symbol 
ID6274743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2487489 
End bp2490545 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content58% 
IMG OID642614106 
Producthypothetical protein 
Protein accessionYP_001878636 
Protein GI187736524 
COG category 
COG ID 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.585641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.431873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTTT TCTTAACTCT TTTTCAAATA TCCTTTTCAT TTTTCCTTGA CGCTCTGATG 
AAAATCCGGC TCCCTCTCCC CCTCCTGGCT GCCCTGGCGG CGGCATTTCC CGCAGCAACC
GCCGCCCCGG CGGGCACGGA ACTGGGAAAC GTCATGTACG TAGGGGACTC CATCACCCAC
GGGGTCAACA GCGCCTCCTA CCGCTGGGCC CTCCATAAAA TATTTACAGA CAACGGTATT
TCCTACCAGG CGGAAGGAGT CAAAACCGGC AATTATTCCG GGGGCGTTAC TGCGGGGACC
TCCTACGGGG GCCAGATATT CAACAATGAA CATTCCTCCC AGGCAAGCGC CAGGGCCTGG
GAAATATCCG GCAGGAACAG CGGAAGCAGG TTTGACGGCT CCAACATTTA CAACTGGCTG
GGGCTTTCCG GCACCAAGGC AAACGGAAAC GCCTATTCAG GCCAGACATT CACCGGGGAC
AATACGCCGG ATACCTTTTT CATGATGATC GGCACCAACG ATCTCCTTTC CGACGGAAAC
AACGCCACGC TGGCGGACCG CCTTGACTCC GTTTCCCAAA ACCTCCTGAA CGATATGGAC
ACCATCGTGG AGGCCATGTT CCAGGCCAAT GAACATGCAA ATGTCATCAT CCTTACCATT
CCCTGCTGGA CCCGGCATTC CAACGGCAAT TCGGACGCCA CCCACCAGGC CGTGGCTGAT
TACAATGATC GTCTGAAAAC ATGGGGGAAT GGCAGGCAGA ACGTCACGGT CATTGACATC
AATGCCGGCC TCATTGACGT AACATCCTCC ACGCCGTTCT ACGGCGTGAG CTCCATGTTC
AACAACCCCG GCAGCGACGG CCTCCATCCC AATGCCCAGG GAGATCTGCT TATGGCCGGC
AATATCGCCA GGGCCATGGG CTATGCGGGG CGCACCGCCG GACAGGAAAG AAAAGACGCT
GGCGGCCTGG CCATCAATTT TCACCAGGGC GGGCAATCCC CCGCATGGAG CTCGAACAGC
CAGTTGGAAA ATAAAGGGTT TTCCCTCGCC AACGTTACTG TGGACGAGAA CGGCATCAGC
CTGGGGCAAT CCGGGGAAAG CTCCATTTCC TATTCCTGGG CGGAAGGGAC GGAGCTGCAA
AACGGCTTCA CCTTTGATTG CAACCTGGTT CTGGGAAACG GCGCAGCAGA CGGATGGGAT
ACTGCCTCCG ATTTCAGCAT CAGCCTGGGG AACAGCTCCT TTTACGGCAC GCTCAATATT
AATGAGGCCT ACATCAAATG GGGCGATGCC ATTCTGTATT CCCTGGACAT GTCCGCCAAT
GCGGAAAACC TGCGTATGGC ATATATCTGC GGCAATGATA TGGAAGGACT GAAAGGCGGT
TATTACGTCT GGCTGGGAGA CATGCTGATT GGGGAAGCGC TTTCCGTTAC CTCCGGCTCC
GGCTGCAACG GCCTCACCAT TCAATACAAC GGCAGCGGAA CCGCTGTTCT GCACGACCTG
GCCCTGGACG GCACGGGTTC CTATGCCCCT GCCACTTCCG GAATGCTTAA CGCGGAAAAG
TCTTTTATTT CTTCCGGCTC CTTCACCCAG GGTGGAACAC CGGAAGGCAA TATTGAGTGG
AAAACGGAGG GATTCACCAA AACCGCTGAT AATCTGGCCA GCTCCGGTAC CTTTAATGCC
CGCTCCCAGG CGGATTCCTC CGCAGGGGGA ACGGGGAATT CCGTAAACGT CTCCATTACC
TCCGGAAACG CCACTCATAT TTATGCGAAC AGCGGCAACT ACACGGGAGA CGTCTGGCTG
ACCATCTCCA AGGAGGGACA GGCCTCCGCC TGGTACGGAG CTCACGGAGC TTCCGGCACG
CTCAACGGGA ACGCCTTCCT CCGTTTCACG GATGCCGCCA CAGGCGGAAG CACGGTCTTC
GGAGCCGTGA ACGCCGCCGG CGTCACCGGA AACGTTTATC TGGAATTCTC TGCGGAAAAC
GCCTCCTTCG GCACCTTTAC CAGCAGTAAT TCCTCCTCTG TGGTCGGCTC CTATGCCACG
GATATCCAGG GGAATGTGGA CATTGTGGTC AACTCCGGCA CCTTCAACCA TCAAATCATG
GGCGGCATCT TCGCAAATGC CAGGACCGGA ACCACCACCA TTGGAGGAAA CACGCATGTT
TACATCAACG GCGGCTCCGT CACCGGAAAT GTTATGGGCG GCGGCCTGAC AGGCTCCATT
TCCGGCGGCG CCAACGTCAC CGTCACAGGC GGCGTTATCA GCGGTTCCGT TTACGGCGCG
GGCCAGGGCG GCTCCATCCT GGCAGGAAGC TCGGTTTGCT TGACAGGAGG CCTGGTTAAA
GGGGACGTCT ACGCCGGAGG CAAGGCGGGT TCCATTCAGG GGGACACGTC CGTCACCATC
ACCGGAAACA CGGCCACGCT TTATAACGGG AGCTCCTGGG GCAGCATCTC CGGCGGCGGT
TCCGGCGGAA CGGTGGAGGG AAACTCCACC GTACGCATCC AGAATCTTTC CTCAGGAACG
ACGGCCTACG GCTTTGACAA ATACGCCGGA AACATCAGCG GAGGAACCAA CGTAAGCGGA
GACAGGAGCC TGGTGCTGGA CCATGTAACC GTGGACAGCC TCCTGGCTTC CCTGAGTGAC
TTCACCCATA TCTCCGCCGT CAACCAGACC CGCACCTCGC TGGATTCCCT GGGCGAGGCG
CTCACCGTAA CCATTGAGGC GGGAAGCAGC CTTATCCTGA ACGGAACCTC TGACCTGACC
ACGCTTATTC TGGGGGAACA CGCCTCCCTG ACGCTTCAGG GGCTCACCGC AGATGCGGTC
ATCGTGGACA TTACGGGAAC AACCAATTAC ACACTCTCCC TGACAGAGAT ACCTGCCAGC
CTGGACAATA TCAAATTCCT GAACGACGGC GTTCTGTACG ACGCGGCCAT GTCCATGGAT
CTTCAGGCCA ATTCCGCCAT GCTTTTCGCC CAGGTGCCGG AACCCGGAAG CGCCGCCCTG
GCCCTGGCAG GACTGGCCCC CCTCCTGTGG AGGAGGCGCA GAAAAATGTC CCATTGA
 
Protein sequence
MDVFLTLFQI SFSFFLDALM KIRLPLPLLA ALAAAFPAAT AAPAGTELGN VMYVGDSITH 
GVNSASYRWA LHKIFTDNGI SYQAEGVKTG NYSGGVTAGT SYGGQIFNNE HSSQASARAW
EISGRNSGSR FDGSNIYNWL GLSGTKANGN AYSGQTFTGD NTPDTFFMMI GTNDLLSDGN
NATLADRLDS VSQNLLNDMD TIVEAMFQAN EHANVIILTI PCWTRHSNGN SDATHQAVAD
YNDRLKTWGN GRQNVTVIDI NAGLIDVTSS TPFYGVSSMF NNPGSDGLHP NAQGDLLMAG
NIARAMGYAG RTAGQERKDA GGLAINFHQG GQSPAWSSNS QLENKGFSLA NVTVDENGIS
LGQSGESSIS YSWAEGTELQ NGFTFDCNLV LGNGAADGWD TASDFSISLG NSSFYGTLNI
NEAYIKWGDA ILYSLDMSAN AENLRMAYIC GNDMEGLKGG YYVWLGDMLI GEALSVTSGS
GCNGLTIQYN GSGTAVLHDL ALDGTGSYAP ATSGMLNAEK SFISSGSFTQ GGTPEGNIEW
KTEGFTKTAD NLASSGTFNA RSQADSSAGG TGNSVNVSIT SGNATHIYAN SGNYTGDVWL
TISKEGQASA WYGAHGASGT LNGNAFLRFT DAATGGSTVF GAVNAAGVTG NVYLEFSAEN
ASFGTFTSSN SSSVVGSYAT DIQGNVDIVV NSGTFNHQIM GGIFANARTG TTTIGGNTHV
YINGGSVTGN VMGGGLTGSI SGGANVTVTG GVISGSVYGA GQGGSILAGS SVCLTGGLVK
GDVYAGGKAG SIQGDTSVTI TGNTATLYNG SSWGSISGGG SGGTVEGNST VRIQNLSSGT
TAYGFDKYAG NISGGTNVSG DRSLVLDHVT VDSLLASLSD FTHISAVNQT RTSLDSLGEA
LTVTIEAGSS LILNGTSDLT TLILGEHASL TLQGLTADAV IVDITGTTNY TLSLTEIPAS
LDNIKFLNDG VLYDAAMSMD LQANSAMLFA QVPEPGSAAL ALAGLAPLLW RRRRKMSH