Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0539 |
Symbol | |
ID | 6275094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 634325 |
End bp | 637066 |
Gene Length | 2742 bp |
Protein Length | 913 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612589 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_001877158 |
Protein GI | 187735046 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.160142 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAAG TCACTACCAT TCTCATAACG GCGGCATGTC TGGCCGGCTC TTTTTCCCTT TATGCCGATG CGGTTTCCGA TCAGGAGGAG GCCGCTTCAT CTACGGGAGC CTATTCCGTG GCAGGGGCTC TTCGCCTGCC GGAAAACGTA ACGAAAAGGG ACGTTTACGG CATGAATGTG GGCTGGAAAC TGTTTAAGGG AGAGAAGGAG CCGGAGGAAG CGGCCAGCCC GGATGTTGAC GATTCCTCCT GGGAGTCCGT CAACCTCCCC AACGGCATTG AGCTTCTTCC GGAAGAGGCC AGCGGCTGTT CCAACTACCA GGGGCCGGTA TGGTACCGGA AAACATTTAT TCCTCCCTCC CGGCTGGAGG GGAGGAGAAA CACCCTGTAT TTTGAGGGCA TCATGGGAAA GAGCGAGGTC TGGGTGAATG GGGAGAAGGC CGCGGAGCAT TTCGGCGGAT ATCTTCCCGT GATTGTTAAT CTGGACAAGT GGCTGAAACC GGGACAAAAG AATGTCATCG CCGTGAAGGC GGACAATTCC AACGACGCTT CCTATCCCCC CGGCAAACCC CAGGAGGGCC TGGATTTTTC CTATTTTGGC GGTATTTACC GGGATGTGTA CCTCATTTCC ACCGGACCCG TGTATATCAC GGACCCGAAC GAGGCCGGAA CTGTGGCCGG GGGGGGAGTT TTTTTCCGGA CGGAATTTCT TGATCCCCGT ACCCGCAAGG GCAAGGTGGG GGTGAAGGTT CAGGTGGCCA ACCAGACAGA CAAGGAACGG AAGGTGCGTG TGCAGGCGGT GATGACGGAC CCCAAGGGGG TGGATCCTGC CGGGGAGACG GTTCCGCTGA CTATTCCTCC ACATTCCACG GGTGAAGTGG ATATTCCTCT TGTGCTTTCC AACGTGAAAC CGTGGTCTCC GGACAATCCC GATTTGTACA CGTTGAGCGT GGAAGTATGG GACGCTGCTG GAGGGGACCA TTCCAAAAAT CCGGCCCATT TGCTGGACAA CCGTTCCATA AGGGTGGGTG TCCGGACGGT GGAAATTACG GAAAAGGGGC TGGTGCTCAA TGGATCCCTG TTTCCTGAAA AGCTGATTGG CGGCAACAGG CACCAGGATT TTGCCCGGCT TGGGAATGCA GTGCCCAACA ACCTGCAATG GCAGGATGCC GTGAAGTTGA GGAAGGCAGG CATGCGCGTG ATCCGGAGCG CCCATTATCC GCAGGATCCG GCTTTCATGG ACGCCTGCGA CCGTCTGGGC CTTTTCGTCA TCGTCGCCAC GCCGGGCTGG CAGTTCTGGG GCAGCGGCCC TTTTGCGGAT CGGGTGTATG ACGATATCCG CCAGATGGTG CGCCGGGACA GGAACCATCC CTCCGTGATG ATGTGGGAAC CTATTTTGAA TGAAACTCAT TACCCGGCGG ATTTCGCCAA AAAGGCCCGC GACCTGGTGC ATGAGGAATA TCCGTACAAG GGATGCTACA CCGCTTGCGA CGCGGTGGCT CAGGGCAGCC AGCATTATGA AGTTCTGTAT GCCCACCCGG TTACGGGGGA CAAACACTGG TCCATCAAGG AACGGAAGAA CAGCAAACCG TATTTTACCC GTGAATTCGG TGATAACGTG GATACCTGGT CCGCCCACAA CTCCACCTCC CGTGTAGCCC GTCACTGGGG TGAAGCTCCC ATGATGGTGC AGGCCCTTCA CTACCTCAAG ACCTCGTTCC CATATACGAC GTACGATACG CTGAACGCCG CTCCGGCCTA TCATTTCGGC GGCTGCCTGT GGCATCCTTT CGACCATCAG CGCGGTTACC ATCCGGATCC CTTCTACGGC GGTATTCTGG ATGCTTTCAG GCAGCCCAAA ACCTCCTATT ACGCATTTAT GTCCCAGCGT CCCCAGAAGA CGCGCAACGG GTTGGGTTCC GGCCCTGTGG TGCACATCGC CAATGAATGC ACTCCCTTTT CTCCGGAAGA CGTGACGGTG TTTTCCAATT GTGATTCCGT TCGCCTGAGT GTCAACGGAG GGCCCCCGGT GGAAAAGAGG GTGGCGTCCT GTCCCGGAGG CCTCAAGCGC GTGCCCGTCG TGTTCCCTAA AGCCTGGGAT TTCATGGAAA ACAAGAAGCT TGCCCGAGCA GGCAAGGAGG GAGCGGTGAA ACTGGTGGCG GAAGGCCTGA TAGGCGGCAA GGTGGTGACC AGGCATGAAG TCCGCCCGGC CCGGAGGGCG GAGAAAATCC GGTTGACTCT GGACCGGGAA GATGGAGTGG ATTTGTACGC CAACGGTTCC GATGTGTTTG CCGTGGTGGC GGAGGTTACG GATGGCCGCG GAACAGTGAA GCGCCTGAAT GATGAGGAGA TTGTCTTTTC CGTGGAAGGC CCCGCCGAGC TGCTGACGGA TTCCCCGGAC GGCACGCTGA CCCAGTCCGT TAAATGGGGT TCCGCTCCGG CCCTGGTGCG CCTGGGGGCA ACACCCGGCA CGGTGACGGT GAGAGCCTCC GTGAAGCATC CCGGTTCTCA AAAGCCCGTT TCCGGCGTCA TAAGGTTTGA CACGAAGGCT CCCGGGTTGA AAATGCTCTT TACGGAAGAC GCCGCAGGCA GCGCGAAGAA GGTTGCCGGT ACGCCGTCTT CCTCCGAGGC TCCTGCCTCC GAACGGGAGA AATCCCTTCA GATGGAGCTG GAAAAAGTGC GCCATGAACT GAACAGGTTG CGCAATGAAA AAGTGAGCGC CCAGCAAACC CACTTCGAAT AA
|
Protein sequence | MMKVTTILIT AACLAGSFSL YADAVSDQEE AASSTGAYSV AGALRLPENV TKRDVYGMNV GWKLFKGEKE PEEAASPDVD DSSWESVNLP NGIELLPEEA SGCSNYQGPV WYRKTFIPPS RLEGRRNTLY FEGIMGKSEV WVNGEKAAEH FGGYLPVIVN LDKWLKPGQK NVIAVKADNS NDASYPPGKP QEGLDFSYFG GIYRDVYLIS TGPVYITDPN EAGTVAGGGV FFRTEFLDPR TRKGKVGVKV QVANQTDKER KVRVQAVMTD PKGVDPAGET VPLTIPPHST GEVDIPLVLS NVKPWSPDNP DLYTLSVEVW DAAGGDHSKN PAHLLDNRSI RVGVRTVEIT EKGLVLNGSL FPEKLIGGNR HQDFARLGNA VPNNLQWQDA VKLRKAGMRV IRSAHYPQDP AFMDACDRLG LFVIVATPGW QFWGSGPFAD RVYDDIRQMV RRDRNHPSVM MWEPILNETH YPADFAKKAR DLVHEEYPYK GCYTACDAVA QGSQHYEVLY AHPVTGDKHW SIKERKNSKP YFTREFGDNV DTWSAHNSTS RVARHWGEAP MMVQALHYLK TSFPYTTYDT LNAAPAYHFG GCLWHPFDHQ RGYHPDPFYG GILDAFRQPK TSYYAFMSQR PQKTRNGLGS GPVVHIANEC TPFSPEDVTV FSNCDSVRLS VNGGPPVEKR VASCPGGLKR VPVVFPKAWD FMENKKLARA GKEGAVKLVA EGLIGGKVVT RHEVRPARRA EKIRLTLDRE DGVDLYANGS DVFAVVAEVT DGRGTVKRLN DEEIVFSVEG PAELLTDSPD GTLTQSVKWG SAPALVRLGA TPGTVTVRAS VKHPGSQKPV SGVIRFDTKA PGLKMLFTED AAGSAKKVAG TPSSSEAPAS EREKSLQMEL EKVRHELNRL RNEKVSAQQT HFE
|
| |