Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0855 |
Symbol | |
ID | 6274314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1018288 |
End bp | 1020438 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612910 |
Product | glycoside hydrolase clan GH-D |
Protein accession | YP_001877469 |
Protein GI | 187735357 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.705806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCGG ATTCCGTTCA TTTGCTGACC CGTTCCTCTC AGATGACCCT GTACAAAGAT GAACAGGGCC GTGTGCGCCG CGCCTATTAC GGACCGAAGC TGCGTGAACC GGAGGATGCC CTGGAAAACG CCGCGGATGC TCCTCTCCTG TATTCCTCCC TGGCGGATTC CGTAACAGGG CTTCCCAATG CGTCCGGCGA GTACTGCATT TGCGTAACCC AGGCGGACGG GGCCCTTTCC CTGAGCCTGG AATGCCTGGA TTGGGAAGTG CTGGAACTGG ATGACGACCG GGAAGAAGCC GTCTTCTATG GGAAGGACCC TTCCTACCCT GTGCGGGTGG AAATCCACGT GCGCTCCCAC AGGGAATCGG ACACTTTTCT GCAATGGGCG GTGATTCGGA ACGAAGGGAA AGAAGGCATC CGCCTGCACC GCGCGGCATC CGCACAGCTG GGTTTGCGCG CAGAACGGTA TTTCGTCACC AGCTTCCGCG GCACCTGGGG CGGCGAGTCC CTGATGAGTG AGGAAGAAGT GGCCCGGGGG CATGAACTTG CCCTGGTATC CGGAACCGGC ACCCGCACCG CCCAGGAGGG AAGCCCCGGA TTCATCATCT CTCTGGACGG TCCGGCGCGG GAGGATTCCG GGGAGGTGGT TCTGGGGGCG CTGGCATGGT CCGGCAATTA CCGGATATGG TTCCGGCACA GCCCGTATCA TTACCTTTTT GCCGGAGCGG GGCTGGATAT GGCTCCCGCG CCCTATCTGC TGGACGGCGG CGGTGTGTTC AAGACGCCGC CCCTCATTCT GGCGCACAGT AAAAACGGCA AGGGAGAGGC CTCCCGGCGC ATCCACCGCT GGGCGCGCCG TTACGGCCTG CGCGGTGGAG AATCAGAGAG GCCGACCTTG CTGAATTCCT GGGAGGGGGT GTATTTTACC TTTACGGAAA AAGTGCTGCA CGGCATGATG AAACGTGCGG CAGACTTGGG GATAGAGCTT TTCGTGCTGG ATGACGGCTG GTTTGGCGGC CGGTTCCCCC GCAATGACGC CCGTGCCGGA TTGGGGGACT GGCAGGTTAA CCGCGCTAAA TTGCCTCATG GACTGGAAGA CCTCATTCGC CAGGCGGAAA AACTGGGAAT ACGCTTCGGC ATCTGGGTGG AGCCGGAAAT GGTGAACCCT CATTCGGAGC TTTATCAGAA CCATCCGGAG TGGGCCATCG GTCTCCCGCA TCGGGAAAAC AGGCTGGAAC GCAGCCAGTA TTTGCTGGAC CTGAGCAATC CTCATGTATG CAGGTACATT CTTGATGCCA TGCGGAAGCT GCTTGGGGAG CATCCGGGTA TTTCCTACGT CAAATGGGAT TGCAACCGCA AAATTTCCGA CCCCGGCTCT CCCTGGCTGG ATGCCTGCCG CCAGGGGAAT CTGCCCATTG ACTATGTGCG GGGCTATGAA CGTATTCTTG AGACGCTGGC GGCGGAATTC CCGGATGTGA TATTCCAGGC CTGTTCCTCC GGAGGAGGCC GCGCCGACTA TGGAACCATG CGGAAGCATC ATGAATTCTG GACGTCCGAC AACACGGACG CTTATGAACG CGTGTTCATG CAGTGGGGCA TAGGCCACCT GTTCCCGGCT ATTTCCATGG CCGCCCATGT GACCGCCTCC CCCAACCACC AGACGGGCCG CTCTGCTCCG CTCAAATTCC GGTTTGACGT GGCCATGTCC GGCCGGCTTG GTTTTGAACT CCAGCCATGT GACATGACGG AGGAGGACAT GGTTTTTTCC AAACGGGCTC TGGCGGAATA CAAGCGCATC CGCCCCGTAG TCCAGTTCGG TGACTTGTAC CGGCTTTCCT CCCCGTATGA AAGCCGGGTG GCCTCCCTGA TGTATGTATG CGGAAAACGG GCTGTCGTTT TCGCCTGGCT GATGGATAAA TGGCTGGCGG ATTCCCCTCC GCCCCTGCGC CTGAAAGGCC TGAATTTTTC CGCTAGGTAC TTGGTGCGGG AAATCAACAT GAATGAAGAC GGCTCTCTCA CGGATGTTCA CGAACGAAGG CTGGGAGGCG ACTTTCTGAT GGACGCCGGC ATCCGCATCC GCTGGAAAAA GGCCTTCCAG TCCGTCTGCC TGGAGGTGGT GGAAACAAAC CCTGATGCTC CGGCTTCATG A
|
Protein sequence | MIPDSVHLLT RSSQMTLYKD EQGRVRRAYY GPKLREPEDA LENAADAPLL YSSLADSVTG LPNASGEYCI CVTQADGALS LSLECLDWEV LELDDDREEA VFYGKDPSYP VRVEIHVRSH RESDTFLQWA VIRNEGKEGI RLHRAASAQL GLRAERYFVT SFRGTWGGES LMSEEEVARG HELALVSGTG TRTAQEGSPG FIISLDGPAR EDSGEVVLGA LAWSGNYRIW FRHSPYHYLF AGAGLDMAPA PYLLDGGGVF KTPPLILAHS KNGKGEASRR IHRWARRYGL RGGESERPTL LNSWEGVYFT FTEKVLHGMM KRAADLGIEL FVLDDGWFGG RFPRNDARAG LGDWQVNRAK LPHGLEDLIR QAEKLGIRFG IWVEPEMVNP HSELYQNHPE WAIGLPHREN RLERSQYLLD LSNPHVCRYI LDAMRKLLGE HPGISYVKWD CNRKISDPGS PWLDACRQGN LPIDYVRGYE RILETLAAEF PDVIFQACSS GGGRADYGTM RKHHEFWTSD NTDAYERVFM QWGIGHLFPA ISMAAHVTAS PNHQTGRSAP LKFRFDVAMS GRLGFELQPC DMTEEDMVFS KRALAEYKRI RPVVQFGDLY RLSSPYESRV ASLMYVCGKR AVVFAWLMDK WLADSPPPLR LKGLNFSARY LVREINMNED GSLTDVHERR LGGDFLMDAG IRIRWKKAFQ SVCLEVVETN PDAPAS
|
| |