Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1103 |
Symbol | |
ID | 6273998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1318190 |
End bp | 1319269 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642613154 |
Product | Glutamyl aminopeptidase |
Protein accession | YP_001877710 |
Protein GI | 187735598 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 0.24305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATTG ATTCTGATCT GCTGAAGAAG TTTTCAGAGG CCCACGGTAT TTCGGGGCAT GAGGATGAAG TGAGAAGCCT GGTGGCTCAG GAACTTGAAG GATATGGGGA ATTTTCTTCC GATGGTTCCG GCAGCCTGTT CTGCACCGGC GGCGAGGCCG GCCCCCGCGT GATGCTGGCC GCCCACATGG ATGAAATAGG CTTTCTGGTG CAGAATATAG CGTCGAACGG TTTCCTTCAG TTGGTGGGGA TAGGCGGCTG GTGGCCGCAT ACCCTGTTAA GCCAGCGTGT CTTGGTAAAA ACCCGCTCGG GCCGCGGCAT CCGGGGGGTG ATTGGTTCCA AGCCTCCCCA TTTTCTGCCG GAAAGCCAGC GCAACAGTGT CATGAGCATG GAAGCCCTGT TTGTGGACGT GGGCGCTGAA AGCGCGGAAC AGGTGAAGAA TGAATTCGGT ATTCACCTGG GGGATCCCGT GGTGCCGGAC GTGAGATTCT CCCCGTTGGA AAATCCGTTC CGGGTCATGG GGAAAGCCTT TGACAACCGC GCCGGTCTTT CTGTGATGAT AGAGGCATTC AAGAAATTAT GCCGGGAAGG GCATCCCAAC ACCCTGATTG CCGCCGCGAC GGTTCAGGAG GAGGTGGGCA CGCGGGGCGC CAGGACGGCC GGCGTCGCCA TGCAACCGGA TTGCGTGATT GTTCTGGAAG GGCCGCCGGC AGACGACACC CCCGGGTTTG CCGTGACGGA TTCCCAAGGG GCCTTGGGCG GAGGCGTGCA GATCAGGCTG TTTGACCCCA CGGCCATCAC CAATCCCCGG CTGGCCGCCC TGGCCGAAAA AACGGCTTTG GATGCGGGCA TCCCATTCCA GTTGACGGTG CGCCGTTCCG GCGGAACGGA TGCGGCGGCC CTGCATCTTT CCGGAAAGGG TGTTCCCACT ATTGTGCTGG GAATTCCCAC CCGGTACATT CATGCCCATA ACGGCGTGCT GGATCTGCGG GACTACCGCG CCGCGGTGGA ATTGACGGTG GCTCTGGCCC GTTCCCTGGA CCAGGAGGCC GTGGAAGCCC TTACCCACTA CCTGCCCTGA
|
Protein sequence | MAIDSDLLKK FSEAHGISGH EDEVRSLVAQ ELEGYGEFSS DGSGSLFCTG GEAGPRVMLA AHMDEIGFLV QNIASNGFLQ LVGIGGWWPH TLLSQRVLVK TRSGRGIRGV IGSKPPHFLP ESQRNSVMSM EALFVDVGAE SAEQVKNEFG IHLGDPVVPD VRFSPLENPF RVMGKAFDNR AGLSVMIEAF KKLCREGHPN TLIAAATVQE EVGTRGARTA GVAMQPDCVI VLEGPPADDT PGFAVTDSQG ALGGGVQIRL FDPTAITNPR LAALAEKTAL DAGIPFQLTV RRSGGTDAAA LHLSGKGVPT IVLGIPTRYI HAHNGVLDLR DYRAAVELTV ALARSLDQEA VEALTHYLP
|
| |