Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2090 |
Symbol | |
ID | 6275638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2541168 |
End bp | 2542343 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614152 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001878680 |
Protein GI | 187736568 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCC TCTTTTTGCT CGGAAAATTC CCTTCCATAG GCGGCGTGGA AACCGTCACC GCCATTTTGG CGAATGAATT CTCCGCCCGG GGCCATGCGG TTCATGTGGT TTCCTTTGAA CAGGTAACGG AAAAGCCCTC CCCAGCGCTG GACGAACGGG TCACGCTGCA CCGGCTGAGC TACCCCGTTT CCAGCCGGTC CAACCGGAAT GCCCTGCGGG ACATCCTGGC CACCTGCCGC ATTGACGTCA TCATCAACCA GTGGTGCCTG CCCTTCCACG TCACCAGGCT GTGCCGGAAA GCCATGAGGG GCCTGCCCTG CCGCCTGCTG GCCGTCCACC ATAACGCCCC GGACTGCAAC GCACGGCTGG AAGGCCTCAG GATGCGCATG GCCCGGACGG GAAACCCGGT GAACAGGGCG TCCCTGCGCC TTCTGCTGAA AGGCTGCGCC ATGGCTACCG GGGCCAGCCT GCGTTACGTT TACGCCCACA GCGACCGTTA CATCCTCCTT TCAGACAGTT TCCACCAGGC CTTCCGGAAC ATCACCGGAC TGAAAGACAC CGGAAAACTT CTGACGATTC CCAACCCCAT TACCGTGGAA AACCCGGAAT TCCGCTATGA ACCGGGCCTC AAGAAAAAGG AGGTTCTTTT TGTCGGACGG CTGGAACCCA ACCAGAAACG GGTCTCCCGC GTGCTGGAAA CGTGGGCGCT GCTGGAACCC TGCTTCCCGG ACTGGACTCT CCGCCTGGTG GGTGACGGGC CGGAAAAACG CTCCCTTCAG GAATTCTGCG AGGAACACCG CCTGAAGCAC GTCTCCTTTG AAGGCTTCCA AAATCCTGCC CCGTATTACG AACAGGCCTC CCTGCTCTTT TTAACCTCGG AATATGAAGG ACTTCCTCTC GCTATGGTGG AAGCTATGTC CTTCGGCGTC TCCCCCATTG TTTACGGAAG CTTCTCCGCC GCCTATGACC TGGTGGACCA CGGAAAAGAC GGCTGCATCC TGCCCGCGGC CGGCGGTTTC CAGGCGCATC GGATGGCGGA AATGGCCGCA GGGCTGATGC GGGAACCGGC CGCCCTGCGC GCCATGGCGA GGAACGCCAT AGCCAAAAGC CGGAAATTCA CGCGGGAACA TATCATTCCC CAGTGGGAAA AAGCTTTCCT CCCAGACGCC TCCTGA
|
Protein sequence | MNILFLLGKF PSIGGVETVT AILANEFSAR GHAVHVVSFE QVTEKPSPAL DERVTLHRLS YPVSSRSNRN ALRDILATCR IDVIINQWCL PFHVTRLCRK AMRGLPCRLL AVHHNAPDCN ARLEGLRMRM ARTGNPVNRA SLRLLLKGCA MATGASLRYV YAHSDRYILL SDSFHQAFRN ITGLKDTGKL LTIPNPITVE NPEFRYEPGL KKKEVLFVGR LEPNQKRVSR VLETWALLEP CFPDWTLRLV GDGPEKRSLQ EFCEEHRLKH VSFEGFQNPA PYYEQASLLF LTSEYEGLPL AMVEAMSFGV SPIVYGSFSA AYDLVDHGKD GCILPAAGGF QAHRMAEMAA GLMREPAALR AMARNAIAKS RKFTREHIIP QWEKAFLPDA S
|
| |