Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1644 |
Symbol | |
ID | 6274431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1986772 |
End bp | 1987911 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613704 |
Product | transglutaminase domain protein |
Protein accession | YP_001878245 |
Protein GI | 187736133 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1305] Transglutaminase-like enzymes, putative cysteine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.258709 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTGC TGCCTTTCGC CCTTCTCCCG GGCCTGACCG CCGCTATTTG CGCCGCCCAT GACGCCATTC AGGACGGGGT GGAGTTTCTG CGTGCCTACA TGCCCGCGCA GGACAGGGGA ACGGTTACGC AAGAAAGGCT GGTCCGGGAA GTCCGCCTCG CCCTGGCTGC ACGGAGCCAA TTCCCCTGGG CTGCCCAGGT TCCGTGGGAA CTTTATGAAA ATAACGTTCT CCCTTATGCC GTGGTCAACG AACCGCGCGA CGAATGGAGG GAGCAGTTCC ATCACCTCTT CGCACCGCTT GTTTCCCCAT GCAAGACGGG GCGGGAAGCC GCCCTCGCCA TCGCCTCCCG CATCCAGAAA ACCCTGAATG TACGCTATTC CACGGAGAGA AGAGTTCCCC ACCAGGGAGT CAAGGAATCC CTGCAATCCG GCAAGGTCTC CTGTACGGGC CAAAGCATCC TGCTAATCTG CGCTCTTCGC TCCGTGGGCA TTCCGGCCCG CATGGCGGGC GTTCTTACCT GGAACCACGT GCGCGGCAAC CACAACTGGG TGGAAGCCTG GTGTGACGGA GAATGGAAAA TGCTGGAATA CAATGAAAAG GACTTCAACA CCCCGTGGGT GATGTCCGCC ATCAGCATGC TGGATCCCCG CAAACCGGAG AACGCCATTT ATGCCACCTC CTGGAAAAAA GAACCTTCAG GAGCCTTTTT CCCTATGATA TGGGAAGCCC GCTACGACGA CAAACGGCAC GCGCTGGCTT TCCCTCCGGA AAGCCGTACC GTCCCCGCCG TCAATATCAC GGACCGCTAC ATGAAACTGG CGAATGAATG GGTGGCGGCC CAACCGGAAT ATGTGCCCGG CAGCCGGCTG ATGCTGGACA TCAGGGAAGA GAGAAAAAAC GGTGCCAGAA GGCTTCCCTT GCACGTCGTC CTCAAATCGG AAGAAGGGAA AGTTCTGGCG GAAGGCATTA CACCCGGACC GTCCGACGAC ATGAGGAAGT TTCTTGAGGT ACTCCTGCCG GACAATATTT CCCGCGGCAT GCTGGAGTTC AAGCTGCCTG ACGGAACCGT GCGCCATGAA CCTGTGGCAC ACACGGAGGC CCCGGTTCAG ATTTTGAACT TCTTCGTGTC CGCTCCATGA
|
Protein sequence | MNLLPFALLP GLTAAICAAH DAIQDGVEFL RAYMPAQDRG TVTQERLVRE VRLALAARSQ FPWAAQVPWE LYENNVLPYA VVNEPRDEWR EQFHHLFAPL VSPCKTGREA ALAIASRIQK TLNVRYSTER RVPHQGVKES LQSGKVSCTG QSILLICALR SVGIPARMAG VLTWNHVRGN HNWVEAWCDG EWKMLEYNEK DFNTPWVMSA ISMLDPRKPE NAIYATSWKK EPSGAFFPMI WEARYDDKRH ALAFPPESRT VPAVNITDRY MKLANEWVAA QPEYVPGSRL MLDIREERKN GARRLPLHVV LKSEEGKVLA EGITPGPSDD MRKFLEVLLP DNISRGMLEF KLPDGTVRHE PVAHTEAPVQ ILNFFVSAP
|
| |