Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1897 |
Symbol | |
ID | 6273687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2303821 |
End bp | 2304906 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642613958 |
Product | chorismate synthase |
Protein accession | YP_001878492 |
Protein GI | 187736380 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCA GTTTTGGTCA GGTGTTCAGA ATTTCTACCT GGGGTGAATC CCATGGGACT GGGGTAGGCG TGGTGATTGA TGGTTGCCCG TCCCTCGTCC CGGTGACGGA AGAAGACATT CAGCGGGAGC TGGACCGGCG CAGGCCGGGG CAGAGCGACA TCGTAACCCC CCGCAGGGAG GAAGACCGCG CGGAAATCCT TTCCGGAGTG CTGGACGGCA AAACCCTGGG AACGCCTATC GCCATCAGTG TCCGGAACAA GGACCACCGC TCTTCCGCCT ATGACGAGAT GGCCAGAACG TACCGGCCCT CCCACGCGGA CTATACATAC GACGCTAAAT ACGGCATTCG CGCCTGGGCG GGCGGGGGCC GGGCCTCCGC ACGGGAAACC ATCGGCCGCG TCGCAGCCGG AGCGGTGGCC AGGGCCGTGC TGAAGCAGGC TTTCCCCGAT ATGGAGGTCG TGGCCTGGGT GGATCAGGTT CACCATGTGA AAGCTTCCGT GGACTGGGGA GCCGTGACGG CCTCTGCCAT TGAGAGCAAC ATCGTCCGTA CGGCGGACCC CTCCGCTGCG GAAGCCATGA TCGCTGCCAT CAAGGAAGCT CGTGACTCCG GAAACTCCTT GGGCGGCGTG GTCAAATGCG TGGTGCGCGG CTGCCCTCCC GGACTGGGTG ATCCGGTTTT TGACAAGCTG GACGCTACGC TTGCCCACGC CATGATGAGC ATTCCCGCCA CCAAGGCTTT CGCCGTGGGT TCCGGTTTTG AAGCGGCGGA CATGACCGGC TTGGAACATA ATGACCCTTT TTACATGCAG GGCTGCCGGG TGCGTACTAC CACCAACCAC TCCGGCGGTA TTCAGGGCGG CATCTCCAAC GGAGAGGACA TTCTGATGCG CATCGGCTTC AAGCCTACGG CCACCTTGAT GATTGACCAG CAGACGGTCA ACAGGGACGG GGAGGATGCC CGGCTCAAGG GCAGGGGACG GCATGATGCC TGCGTACTGC CGCGCGCCGT GCCCATTGTG GAGGCCATGG CCTGGCTCTG CCTGTGCGAC CACTACCTGC GCCAACGCTG CCAGAGGGCT CTGTAA
|
Protein sequence | MSSSFGQVFR ISTWGESHGT GVGVVIDGCP SLVPVTEEDI QRELDRRRPG QSDIVTPRRE EDRAEILSGV LDGKTLGTPI AISVRNKDHR SSAYDEMART YRPSHADYTY DAKYGIRAWA GGGRASARET IGRVAAGAVA RAVLKQAFPD MEVVAWVDQV HHVKASVDWG AVTASAIESN IVRTADPSAA EAMIAAIKEA RDSGNSLGGV VKCVVRGCPP GLGDPVFDKL DATLAHAMMS IPATKAFAVG SGFEAADMTG LEHNDPFYMQ GCRVRTTTNH SGGIQGGISN GEDILMRIGF KPTATLMIDQ QTVNRDGEDA RLKGRGRHDA CVLPRAVPIV EAMAWLCLCD HYLRQRCQRA L
|
| |