Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2040 |
Symbol | |
ID | 6273697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2477612 |
End bp | 2479792 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642614101 |
Product | Oligopeptidase A |
Protein accession | YP_001878631 |
Protein GI | 187736519 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0328478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00152686 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGCCA TGAGTTTACT CAGGCAGTCT CTTCATCTCT TTTTCCTCTT CCTCTCCGTG GCCCTTCCCT CCCCGGCGGC CACTTCCGCC CACCCTTTCC TGGACAGGGA GCGTCCCATC CGCTGGAGCC GGCTGACCCA GGACAAGCTG GAACCGGATA TTCAGGAAGC CATGCGCCTC ACCCGGACAG CCATAGAGGA AATCAGCCGC CTCCGTCCGG AAGAAATGAC GTATGAAAAT ACGTTCGGCG CGCTGGAGAA AAGCAATGAC CTCCTTACGG AAGGCATGTG CAAGGCTTAT GTCCTGAAAA GCCTGTGCGA CAGCGGGGAA CTTCGCAAGG CCATGGATTC GGTCGCTCCC CGCGTGTCCG CCTTCCTTTC CTCCGTCACG AAAGACCAGG CCCTGTGGAA AGTCCTGAAA ACCGCGGAGG AACGCCTGCG GCAAACTCAC CTGAACCCCG AACAGGAACG GTACATGGAG CTAAGCATGC AGAGCTTCCG GGATAACGGC GCTGATCTGC CGCCGGACAA GCGCGCACGG CTTGAGTCCA TTGACAGGGA ACTGACGCTC GCCTCCCAGC GTTTCAACAA TTTGTATATG GATGCCAGGA AATCCTGGAC CTGGACGGTG CGGAACGCCG CCCTTCTGGA AGGGATAGAC GGAAGCGCCC TGCAGCAGGC GCGTGAAGAA TTTCTGAAGC GCCAGCCCGG CCAGTCCGGT CCGGGCTGGA CGTTCACGCT TGATTCCGCG GCTTCCGCCC GGGTCATGGA AAAAGCCCGG AGGGAAGAAT GCCGGAAAGA TTTATGGGAA CACCGCCAGT CTCTGGCTAC GGGAACATGC GACACGGAAC CCGTCATCAG GGAAATCCTC TCCCTGCGCC GTGAAAAGGC GCATTTGTGC GGATATAAGG AATACCCGGA TTACGCCCTG CGCGAGAGCA TGGCGGAAAA CGGGGAAAAC GCCATAAAGT TCGTCAATGA GCTGCTGGAC AAGATCAAGG CCCCCTTTTT CCGTGAAATG GAAACGCTCC GCAGCCTGAA GGCCCGCCTT ACGGGGCAGG AAAACGCCCG CCTGAATCCC TGGGACGTGG CATATTACGC CAATCTCCGG GCAGAAGAAC ATTTCCGGCT GGACCAGGAG GAGCTGCGCC GCCACTTTCC CCTTCCCCGC GTGCTGGACG GCCTGTTTTC CCTGGCGGAG CGGCTGTACG GCATCCGCGT GAAGGAAGTT CCGGCGCGGC AGTCCCTCTC CGGCATCCCG GCAGGCGAAT CCGCCGGAAC CGTGGAAGTA TGGCATCCGG ACGTACGCTT TTTCACGATT GACGACGGCA ACGGCAACCA ACTGGGTTCC TTTTACCTGG ATCTTTTCTC CCGTAGCAAT AAACGGGCCG GAGCGTGGAT GAACACCCTG GATACGGGCA GCCCGTCCAC GCCGGAGACC CCGGGCAAGC CGCGCCTGGG CATGGTCTGC CTCAATATCC ATCCCCCCGC GGCAGGAGAC ACGGTGATAC TGTCCCACCG GGAAGTCAGG ACTTTATTCC ATGAGTTCGG CCACTTGCTG CACCTGATGT TTACCAGGGT TTCCATTCCT TCCCTGGCGG GGACCAGCGT GCCGCGGGAT TTTGTGGAAG TCCCCTCCCA ATTCATGGAA AACTGGTGCT GGCGGCCGGA CGTGCTGAAA AGCTTTGCGC GCCATGAACG AACGGGACTC CCCATTCCGG AGGAAATGCT GAACTCTCTG GACGCCTCCC GCGGCAATAC GCCTGCCCTC GCGCTGGCCG GGCAGCTCCT GTACGCAAAG ATGGACCTGG CCGTGCATTC GGAACCGGAA CGCTTCTCCG CCGGCTCTCT TGATGATGTG GATTCCGCCG TAGCGGGAGA TATGGATTAT TTCAAAGATT TCAAAAGAGC CGGCAAGCTG CGCACGGCGC GTCACTTGTT TTCCTCTCCT GCGGGCTATG CCTCCTTTTA TTTCTCCTAC CAATGGGCGG AAGTTTTGGA CAAAGATATT TTTGAAGCCT TTGAACGGGC CGGAGGCCAG GACAGGGAAA CGGCAGGAAA ATTCCGGAAA ACCATTCTGG AAAAAGGCTA TGCTGTCCCG CCCATGCGGC AGTTCATGGA TTTCATGGGA AGAAAGCCGC GCATGGACGC CATGCTCCGC AAGAGGCGGC TGGCATCCTG A
|
Protein sequence | MAAMSLLRQS LHLFFLFLSV ALPSPAATSA HPFLDRERPI RWSRLTQDKL EPDIQEAMRL TRTAIEEISR LRPEEMTYEN TFGALEKSND LLTEGMCKAY VLKSLCDSGE LRKAMDSVAP RVSAFLSSVT KDQALWKVLK TAEERLRQTH LNPEQERYME LSMQSFRDNG ADLPPDKRAR LESIDRELTL ASQRFNNLYM DARKSWTWTV RNAALLEGID GSALQQAREE FLKRQPGQSG PGWTFTLDSA ASARVMEKAR REECRKDLWE HRQSLATGTC DTEPVIREIL SLRREKAHLC GYKEYPDYAL RESMAENGEN AIKFVNELLD KIKAPFFREM ETLRSLKARL TGQENARLNP WDVAYYANLR AEEHFRLDQE ELRRHFPLPR VLDGLFSLAE RLYGIRVKEV PARQSLSGIP AGESAGTVEV WHPDVRFFTI DDGNGNQLGS FYLDLFSRSN KRAGAWMNTL DTGSPSTPET PGKPRLGMVC LNIHPPAAGD TVILSHREVR TLFHEFGHLL HLMFTRVSIP SLAGTSVPRD FVEVPSQFME NWCWRPDVLK SFARHERTGL PIPEEMLNSL DASRGNTPAL ALAGQLLYAK MDLAVHSEPE RFSAGSLDDV DSAVAGDMDY FKDFKRAGKL RTARHLFSSP AGYASFYFSY QWAEVLDKDI FEAFERAGGQ DRETAGKFRK TILEKGYAVP PMRQFMDFMG RKPRMDAMLR KRRLAS
|
| |