Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0757 |
Symbol | |
ID | 6275321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 890740 |
End bp | 891894 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642612808 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001877374 |
Protein GI | 187735262 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.122576 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCCG ATGTTTCCAT CATTGTTCCC TGCTACAATG TAGCCGCTTA CGTGGACAGC TGCCTGGAGA GCCTGGTACG CCAGACTCTC CGGAACATAG AAATTATCTG CATCAATGAC GGCTCCACGG ATGAGACCTG GACACATCTG CTGCGCTGGA AAGAAAAGGA CAGCAGAATC ATCCTTCTGA ACCAGCGGAA CGCGGGCGTC TCCGCAGCAA GGAATGCAGG GCTGGATGCC GCCCGCGGCC TTTATGTCGG TTTTGCGGAT CCGGACGACT ACATGGATCC GGAAATGTAT TCCCGCCTTT TTTCGGCGGC TCTGGAATAT GACGCAGACA TCGTGGAATG CGGCAACCAT GTTTTTGAAG ACTCGTCAGA CCGGATTATC GAAGCCAAAA GAAGATCACC CTCCCGGCAT TTTGAAGAGA ACGCCTCTCC GGCCAGCTTC TTCCGGGATT CCATCTGGGG GAAAATGGAT ATCTGCGTGT GGAGCAAACT GTTCCGGAAA AGCATGCTGG ACGCCCACCG CCTCCGCTTC AACGTACATC TGAAATCCGG CGCGGAGGAT GAAACCTTCC GGCTGATGGC CGTTCCCCAT GCCTCCCGTC TCCTGTTCAT TCCCGACTGC CTGTATTACT ACCGCCTTAT GCGCAACGGC TCCCTCTCCC GCCGCTGCAA CGTTCCCACC TACTCCAAAT GCGTGCAGGA ATTCCAGCGG CTGCTGTACA TTGTGGACTA CTGGCAGAAA CAAGGATGGC AGAATGAAGG CCTGTTCGCT TACGGCGTCC GGAAAATCAG GCCTTTTTTT GTTTCCAAGC ATCCCCTTTT CCATCAGATG ACCGCTGTCC AGCAACGTTC CGCGCTGGAC TGGTGGAGCC TGTTCTATCA GAAGGCGGAA GGAAAACGTT TCCTCTCCGC TCTATCGGGA CGGGACAGGC AACTGGCGGA CCTGTTGAAC TCGGCGGAAC CGGTCCCCAA CGGCTGGGGG CGCATTCTGC TGGCAGCCTG CTCCCTGCTT CCCGGACAAA AAGGACGTTA TTACTCCTGC AAAAAAATGC TTGCGGAACA TTTTTCCCAA ATCTGCCCCA ATGAGTTTCA AAAAGAAAAC GCTTCTCTGG AAGAGCCGTT TGACACATCG CCGCCTTCCC TGTAA
|
Protein sequence | MIPDVSIIVP CYNVAAYVDS CLESLVRQTL RNIEIICIND GSTDETWTHL LRWKEKDSRI ILLNQRNAGV SAARNAGLDA ARGLYVGFAD PDDYMDPEMY SRLFSAALEY DADIVECGNH VFEDSSDRII EAKRRSPSRH FEENASPASF FRDSIWGKMD ICVWSKLFRK SMLDAHRLRF NVHLKSGAED ETFRLMAVPH ASRLLFIPDC LYYYRLMRNG SLSRRCNVPT YSKCVQEFQR LLYIVDYWQK QGWQNEGLFA YGVRKIRPFF VSKHPLFHQM TAVQQRSALD WWSLFYQKAE GKRFLSALSG RDRQLADLLN SAEPVPNGWG RILLAACSLL PGQKGRYYSC KKMLAEHFSQ ICPNEFQKEN ASLEEPFDTS PPSL
|
| |