Gene Amuc_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0140 
Symbol 
ID6274801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp172287 
End bp173468 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content57% 
IMG OID642612185 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_001876765 
Protein GI187734653 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.485656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATAG ATGAGCAATT AGACATATTG ATGGGCGGTA CCGCCGTCGT GATCAGCCGC 
GAAGAGCTGA AGGAGCGTCT CAAGCTGGGC CGCCCCCTGC GCGTGAAGCT GGGCGTGGAC
CCTACTGCGC CGGACATCCA CCTGGGCCAT ACCGTGGCTA TTGAGAAATT GCGCCAGTTC
CAGGAACTTG GCCACCAGGC TGTTTTGCTC ATCGGGGATT TCACCGCCAC GATCGGCGAC
CCTTCCGGCC GTTCCGTGAC CCGCCCCCCC CTTTCCCGTG AACAGGTGCT GGAGAATGCG
GAGACATATA CCAAGCAGGC GTTCAAGATT CTGGACCGTG ACAAGACGGA GATCGTGTAT
AATGGGGACT GGTTCCGCAA GATGACGTAT GAGGAGGTGC TGAAGCTTAA TTCCCGCGTG
ACCATGCAGC AGATGCTGGC CCGGGAGGAT TTCAAGGCCC GTGTGGAGGG AGGTAAGGAG
GTGCGCCTGC ATGAGATGCA GTATCCGATT ATGCAGGGCT GGGATTCCGT GGAAATCCGT
GCGGACGTGG AACTGGGCGG GACGGACCAG CTTTTCAACA TCCTGGTGGG CCGCGACCTT
CAGAAGGAGG AAGGCATGTT GCCGCAGATC GCCATGACGA TGCCTCTTCT GGAAGGTCTG
GACGGCGTTC GGAAGATGTC CAAGTCCTAC GGGAATTACG TGGGCGTGGA TGAGTCTCCG
GAGATGATGT TCGGCAAGAT GATGAGCGCC AGCGACGAAC TGATGGACCG TTATTACCTG
GTGCTGCTGG GTGAGAAGCG GGACATGGGA TTGCATCCGA TGGAAGCCAA AAAGCTCCTG
GCCTGGAAAA TCACGGCACG CTATCATGAT TCCGCCGCTG CGGATGCCGC GCGTTCTGAC
TGGGAAACCC GTTTTTCCAA GAGGGATTTG GCTGCCGCGG ATTTGCCGGA AGTGGAGATT
GCCTCCCTGC CTGCCGACAT GAATGCCCTG GCCCTGGTTT CCTTCCTGTT TGAGAATGTT
TTCCAGGTGA AAAAATCCAA TGGCGTTCTC CGCAAGGAGC ATTTCACGCC CGGCGCTATC
CAGTTGAATG ATGTGAAAAT GACAGACCCC TCCGCCGTTT TGGAACTGGC TCCGGGCAGC
ATCCTGCGCC TGAGCAAGAA GCATGCTGTG CGTTTCAAAT AG
 
Protein sequence
MTIDEQLDIL MGGTAVVISR EELKERLKLG RPLRVKLGVD PTAPDIHLGH TVAIEKLRQF 
QELGHQAVLL IGDFTATIGD PSGRSVTRPP LSREQVLENA ETYTKQAFKI LDRDKTEIVY
NGDWFRKMTY EEVLKLNSRV TMQQMLARED FKARVEGGKE VRLHEMQYPI MQGWDSVEIR
ADVELGGTDQ LFNILVGRDL QKEEGMLPQI AMTMPLLEGL DGVRKMSKSY GNYVGVDESP
EMMFGKMMSA SDELMDRYYL VLLGEKRDMG LHPMEAKKLL AWKITARYHD SAAADAARSD
WETRFSKRDL AAADLPEVEI ASLPADMNAL ALVSFLFENV FQVKKSNGVL RKEHFTPGAI
QLNDVKMTDP SAVLELAPGS ILRLSKKHAV RFK