Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1288 |
Symbol | |
ID | 6273845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1560701 |
End bp | 1561972 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642613345 |
Product | peptidase U32 |
Protein accession | YP_001877894 |
Protein GI | 187735782 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 77 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTTTC AAAACCGGAT AGAAATCATG GCCCCGGCGG GGTCGTTTGA ATCTCTGGCG GCCGCCTTGC AGGGCGGAGC GGACTCCGTG TACTTCGGAG TAGGAAAGCT GAACATGCGT TCCCGCGCCA CGGTCAATTT TTCGGAAGAA GATTTGCCGG AAATCGTCGA GCGTTGCCAT GAGGCGGGCG CCAAAGCTTA TCTGACACTG AATATCATTG TGTACGATGA AGAGCTGGAG GCGGTGCATG CCCTGTGCGA CGCCGCCCGG AAAGCCGGTG TGGATGCCGT CATCGCTTCC GACCTGGCGG TGATTTCTTA TGCGCGTTCC ATTGGGCTGG AAGTCCATAT GTCCGTGCAG GCCAACGTCT GCAACATGGC CTCCGTCAAA TTTTACGCGC AGTACGCGGA TGTGGTGGTG CTGGCCCGGG AACTCACCCT GGCGCAAATC AGGCATATCA TTGAATCCAT CCGGAAGGAG GGCGTGAAAG GCCCCTCCGG GGAGCTGCTG CGGGTGGAAA TTTTCGCCCA TGGGGCGTTA TGTGTGGCCG TGTCCGGCAA ATGCCACATG AGCCTGGCGG CCTACAACTC CTCCGCCAAC CGGGGGGCCT GCTTCCAGAA CTGCCGCCGC GCCTACCGCG TGACGGATGA GGAAACGGGC AATGAACTGG TGATAGACAA CAAATACGTG ATGTCTCCCA AGGACCTGTG TACCATTCCG GTGCTGGACC AGCTTCTGGA CGCGGGCGTT TCCGTGCTGA AGCTGGAAGG GCGCGGCCGT TCCTCGGATT ACGTCAGGAC GGTTACCTCC GTGTACCGGG AAGCCGCGCG GGCATGCCAG GACGGAACCT TTTCCGCGGA CAGGGCGGAA GCGTGGATGA AACGGCTGGA ATCCGTTTTC AACCGGGGAT TCTGGCAAGG CGGCTATTAC CTGGGCGTGA AGTGGGGGGA ATGGAGCGGT TCCGCCAACA GCCGCGCTGC CCTGTTGAAG ATCCACATTG CCAGGGTAGA GAACTTTTAT AAGAAGAACG GGGTGGCGGC CCTGTTCCTG GAAGCCGGCG GCCTGTCCGC GGGGCAGACC ATCCTCATAA CAGGCCCCAC TACGGGAGCC GTCCGCATGG AAGTGGCAGC CATGCGGAGG GAGACGGCAG AGGGCATGGA GCCCGTAGAA GCCGCTCAAA AGGGAGAAAC CGTCTATCTG GCGGTTCCCG AACAGGTGCG CCGCCGGGAC AAGGTGTACC TGCTGCGCCC CCGGATGCTG GAGGATGCCT GA
|
Protein sequence | MPFQNRIEIM APAGSFESLA AALQGGADSV YFGVGKLNMR SRATVNFSEE DLPEIVERCH EAGAKAYLTL NIIVYDEELE AVHALCDAAR KAGVDAVIAS DLAVISYARS IGLEVHMSVQ ANVCNMASVK FYAQYADVVV LARELTLAQI RHIIESIRKE GVKGPSGELL RVEIFAHGAL CVAVSGKCHM SLAAYNSSAN RGACFQNCRR AYRVTDEETG NELVIDNKYV MSPKDLCTIP VLDQLLDAGV SVLKLEGRGR SSDYVRTVTS VYREAARACQ DGTFSADRAE AWMKRLESVF NRGFWQGGYY LGVKWGEWSG SANSRAALLK IHIARVENFY KKNGVAALFL EAGGLSAGQT ILITGPTTGA VRMEVAAMRR ETAEGMEPVE AAQKGETVYL AVPEQVRRRD KVYLLRPRML EDA
|
| |