Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1995 |
Symbol | tnaA |
ID | 6274118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2422591 |
End bp | 2424054 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642614055 |
Product | tryptophanase |
Protein accession | YP_001878587 |
Protein GI | 187736475 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3033] Tryptophanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.383478 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.337663 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTCAG AACCATCCAA CGTTGTCAAA TTTTACAATG GGGAACAAAT TCCCCTGGAA CTGCATAAAG TCCGCGTGGT GCAGAAGCTG CATCTTGTCC CCGTGGAACG CCGCCTGGAA GCCGCACGGG AAGCCGGGTT CAACACCTTC CAGCTCAGCA CGAACGATGT CTATCTGGAC ATGCTGACGG ATTCCGGCGT CAACGCCATG AGCGACAACC AGATCGCCGC CATGTTCCGG GCGGATGACG CCTATGCCGG TTCCCAGAGC TTTGACCGCC TGAAGCAGGC TGTCCGGGAT GTCTTCGGCA AGGAATACCT TCTGCCCGCC CACCAGGGGC GCGCCTGTGA AAACATCATT GCCCGCACCT TCGTGAAGCC CGGGGACGTG GTTCCAATGA ACTACCATTT CACCACCACG CACGCCCATA TTGACCTGAA CGGCGGCAAG ATTGAGGAAC TGGTGGCTGA TGAAGCCGTC AATCCGGTCA GCACCAATCC CTTCAAGGGC AATCTGGACC CCGGCAAGCT GCGGGACTGC ATTGCCCGTC ACGGGGCGGG CAAAATCCCC TTCGTGCGCA TGGAAGCCTC AACCAACCTG ATTGGCGGCC AGCCCTTCTC CATTGCCAAC ATGCGGGAAA TCCGCGGCAT TTGCGATGAA TTCGGCATCA TGCTGGTGCT GGACGCCTCC CTGATCGGGG AAAACGCCTA CTTCATCAAG ATGCGTGAAG ACGAGTTCAG GGATGCGTCC TGCGCGGACA TCCTGAAGAC CATGTGCGGA CTGGCGGATC TGGTGTATTT TTCCGCCCGC AAGGTTTCCT CCTCCCGCGG CGGCGGCATC TGCACGAACG ACCGGGCCAT TGCCAAGAAA ATGGAGCATC TCGTTCCCCT CTTTGAAGGC TTTTTGACTT ATGGCGGCAT CTCCGTGCGG GAAATTGAGG CCATGGCCGT AGGCCTGTAT GAAACCACGG ATTTGACTGT GATTTCCCAG AGCCCCTCCT TCATTGAATA TTTCATCGGC CAGATGGTGG ACATGGGCAT TCCCTGCGTA ACCCCCGCTG GCGGTCTGGG CGCCCATATT GACGCAGGGC GTTTCCTGCC CCATATCCCG CAGGAGGACT ATCCCGCCGG GGCTCTGGCG GCGGCTTTCT TCATCGCCTC CGGCGTGCGC GGCATGGAAC GCGGCACGCT TTCCAGCGTC CGCGACGAAA AAGGCAAGGA CATTCTGGCG GATGTGGAAC TGCTCCGCCT GGCCTTCCCG CGCCGTGTTT TCACGTTATC CCAGGTGAAA TATGTGGCGG ACCGCATGAA GTGGCTCTAT GACAACCGCG ATTTGATTGG CGGCCTGGAA TTTGTGGAGG AACCGCCCGT CCTGCGCTTC TTCATGGGCA AGCTCCGCGC CAAGAGCGAC TGGCCTGAAA AACTGGCGGC CAAATACCGC CGGGACTTCG GGGAAAGCCT GTAA
|
Protein sequence | MNSEPSNVVK FYNGEQIPLE LHKVRVVQKL HLVPVERRLE AAREAGFNTF QLSTNDVYLD MLTDSGVNAM SDNQIAAMFR ADDAYAGSQS FDRLKQAVRD VFGKEYLLPA HQGRACENII ARTFVKPGDV VPMNYHFTTT HAHIDLNGGK IEELVADEAV NPVSTNPFKG NLDPGKLRDC IARHGAGKIP FVRMEASTNL IGGQPFSIAN MREIRGICDE FGIMLVLDAS LIGENAYFIK MREDEFRDAS CADILKTMCG LADLVYFSAR KVSSSRGGGI CTNDRAIAKK MEHLVPLFEG FLTYGGISVR EIEAMAVGLY ETTDLTVISQ SPSFIEYFIG QMVDMGIPCV TPAGGLGAHI DAGRFLPHIP QEDYPAGALA AAFFIASGVR GMERGTLSSV RDEKGKDILA DVELLRLAFP RRVFTLSQVK YVADRMKWLY DNRDLIGGLE FVEEPPVLRF FMGKLRAKSD WPEKLAAKYR RDFGESL
|
| |