Gene Amuc_1995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1995 
SymboltnaA 
ID6274118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2422591 
End bp2424054 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content59% 
IMG OID642614055 
Producttryptophanase 
Protein accessionYP_001878587 
Protein GI187736475 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3033] Tryptophanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.383478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.337663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAG AACCATCCAA CGTTGTCAAA TTTTACAATG GGGAACAAAT TCCCCTGGAA 
CTGCATAAAG TCCGCGTGGT GCAGAAGCTG CATCTTGTCC CCGTGGAACG CCGCCTGGAA
GCCGCACGGG AAGCCGGGTT CAACACCTTC CAGCTCAGCA CGAACGATGT CTATCTGGAC
ATGCTGACGG ATTCCGGCGT CAACGCCATG AGCGACAACC AGATCGCCGC CATGTTCCGG
GCGGATGACG CCTATGCCGG TTCCCAGAGC TTTGACCGCC TGAAGCAGGC TGTCCGGGAT
GTCTTCGGCA AGGAATACCT TCTGCCCGCC CACCAGGGGC GCGCCTGTGA AAACATCATT
GCCCGCACCT TCGTGAAGCC CGGGGACGTG GTTCCAATGA ACTACCATTT CACCACCACG
CACGCCCATA TTGACCTGAA CGGCGGCAAG ATTGAGGAAC TGGTGGCTGA TGAAGCCGTC
AATCCGGTCA GCACCAATCC CTTCAAGGGC AATCTGGACC CCGGCAAGCT GCGGGACTGC
ATTGCCCGTC ACGGGGCGGG CAAAATCCCC TTCGTGCGCA TGGAAGCCTC AACCAACCTG
ATTGGCGGCC AGCCCTTCTC CATTGCCAAC ATGCGGGAAA TCCGCGGCAT TTGCGATGAA
TTCGGCATCA TGCTGGTGCT GGACGCCTCC CTGATCGGGG AAAACGCCTA CTTCATCAAG
ATGCGTGAAG ACGAGTTCAG GGATGCGTCC TGCGCGGACA TCCTGAAGAC CATGTGCGGA
CTGGCGGATC TGGTGTATTT TTCCGCCCGC AAGGTTTCCT CCTCCCGCGG CGGCGGCATC
TGCACGAACG ACCGGGCCAT TGCCAAGAAA ATGGAGCATC TCGTTCCCCT CTTTGAAGGC
TTTTTGACTT ATGGCGGCAT CTCCGTGCGG GAAATTGAGG CCATGGCCGT AGGCCTGTAT
GAAACCACGG ATTTGACTGT GATTTCCCAG AGCCCCTCCT TCATTGAATA TTTCATCGGC
CAGATGGTGG ACATGGGCAT TCCCTGCGTA ACCCCCGCTG GCGGTCTGGG CGCCCATATT
GACGCAGGGC GTTTCCTGCC CCATATCCCG CAGGAGGACT ATCCCGCCGG GGCTCTGGCG
GCGGCTTTCT TCATCGCCTC CGGCGTGCGC GGCATGGAAC GCGGCACGCT TTCCAGCGTC
CGCGACGAAA AAGGCAAGGA CATTCTGGCG GATGTGGAAC TGCTCCGCCT GGCCTTCCCG
CGCCGTGTTT TCACGTTATC CCAGGTGAAA TATGTGGCGG ACCGCATGAA GTGGCTCTAT
GACAACCGCG ATTTGATTGG CGGCCTGGAA TTTGTGGAGG AACCGCCCGT CCTGCGCTTC
TTCATGGGCA AGCTCCGCGC CAAGAGCGAC TGGCCTGAAA AACTGGCGGC CAAATACCGC
CGGGACTTCG GGGAAAGCCT GTAA
 
Protein sequence
MNSEPSNVVK FYNGEQIPLE LHKVRVVQKL HLVPVERRLE AAREAGFNTF QLSTNDVYLD 
MLTDSGVNAM SDNQIAAMFR ADDAYAGSQS FDRLKQAVRD VFGKEYLLPA HQGRACENII
ARTFVKPGDV VPMNYHFTTT HAHIDLNGGK IEELVADEAV NPVSTNPFKG NLDPGKLRDC
IARHGAGKIP FVRMEASTNL IGGQPFSIAN MREIRGICDE FGIMLVLDAS LIGENAYFIK
MREDEFRDAS CADILKTMCG LADLVYFSAR KVSSSRGGGI CTNDRAIAKK MEHLVPLFEG
FLTYGGISVR EIEAMAVGLY ETTDLTVISQ SPSFIEYFIG QMVDMGIPCV TPAGGLGAHI
DAGRFLPHIP QEDYPAGALA AAFFIASGVR GMERGTLSSV RDEKGKDILA DVELLRLAFP
RRVFTLSQVK YVADRMKWLY DNRDLIGGLE FVEEPPVLRF FMGKLRAKSD WPEKLAAKYR
RDFGESL