Gene Amuc_1383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1383 
Symbol 
ID6274625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1649138 
End bp1651966 
Gene Length2829 bp 
Protein Length942 aa 
Translation table11 
GC content60% 
IMG OID642613440 
Productalanyl-tRNA synthetase 
Protein accessionYP_001877988 
Protein GI187735876 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID[TIGR00344] alanine--tRNA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0477098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00263773 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATGACCG CCACCGAGAT ACGCCAAAGC TTTCTGGACT TTTTCCGCGA AAAACAGCAC 
ACGGTCGTGC CTTCCGCTTC TTTGATGCCC CAGAGCCCCG GTTTGTTGTT TACAAATGCC
GGCATGAATC AGTTTGTCCC GTATTTCCTG GGCGTATGGA CTCCCCCGTG GACGCCCGCC
CGCGCTACGG ATACCCAGAA GTGCATCCGC GCAGGCGGCA AGCACAATGA CCTGGAGGAT
GTGGGGTATG ACTCCTACCA CCACACGTTT TTTGAAATGC TGGGGAACTG GTCCTTCGGG
GATTATTTCA AGAGGGAAGC TATCCGCTGG GCCTGGGAGC TGGTCGTGGA GCGGTGGGGA
TTCCCGGCGG AACGCCTGTA CGCCACCGTG TACGCGCCGG ACAAGAGCAA GGGCGACCCC
GGAGAGTTTG ACCGGGAAGC TTGGGATTTC TGGGCTGAGC TGTTCCGTTC CCGAGGGCTG
GACCCGGACG TGCATATCGT GCACGGGAAT GTGAAGGATA ATTTCTGGAT GATGGGGGAA
ACCGGCCCCT GCGGCCCCTG TTCCGAGCTG CACGTGGACC TGACCCCGGA GGGGAATACG
AAGGGAAGCC TGGTAAACAA GGATTCCGAC CAGTGCATAG AGATATGGAA CCTGGTGTTT
ATCCAGTACA ATGCGGAGAG CGACGGCTCC ATGCGCAATC TTCCGGCATG TCATGTGGAT
ACCGGCATGG GGTTTGAGCG CGCGTGCTCC ATCATGCAGT GCACGAACGG ATTCAAGGAT
TTTTCCCGCA AACCGTCCAA TTACGCCACG GATGTATTCC GCCCCCTGTT TGACCGCCTG
GAAGTTTTGA GCGGACGGAA GTACGCGGAC GTGTATCCGG CGCCCGGTTC CAAAAGGGTG
GATGCGGAGG ACGGGACCCT TCAGGAGGCG ATTGCCTTCC GCGTGATTGC CGATCATCTG
CGCACGCTCA GTTTTTCCAT TGCGGACGGC ATTCTGCCGG GCAACAATGG CCGTAATTAC
GTGCTGCGCC GCATTCTGCG CCGTGCCGTG CGCTATGGGC GCCGCCTGGG CTTTACCCAG
CCGTTTTTGG CGGAACTGGT GGATACGCTG GTGGAGTCCT TCGGACAGGT GTTCCCGGAA
CTGGCCGCCC GCGCCACTAC CGTGAAGGAG GTTTTGAACC GTGAAGAGGC CAGTTTTAAT
GAGACGCTGG ACCGCGGCCT GGAATTGTTT GACGCGGAAA CGGCTTCCGC CGGAAAGGTG
AGCGGCGAGT TCGCCTTCAA GCTGTATGAT ACGTACGGGT TCCCCATTGA CCTGACCGCC
CTGCTGGCGG AGGAACGCGG CCTGGATATT GATATGGAGC GGTTCAACAG GCTGATGGAG
GAACAGCGGG AACGCGCCCG GGCCGCCCGC AAGAGCGAGG TGGTGCGCGC CCTGGATTTG
AAGACGGACG CCGTGACGGA GTTTACGGGG TACGATGTGG ACGAATGCGC CGCTACGGTG
CTGGAAGTGA GCCGCCAGGG GGATTCCCTG TTCATCATCA CGGACAAGAC TCCGTTTTAC
GCGGAAATGG GCGGGCAGGT GTCCGATGCC GGGTTGATTG AAATCGGCGG GGAAAGCTAC
CATGTGATGG CCGTCCAGCA GATAGGGAAT GCCCGAGCCC ATGTGGTGGA GGCCCGTCCC
GGGCTGGAGG TGAAGCCCGG CGACCGCGTG CATTTGAGCA TTGACGCGGA ACGCCGCCGC
CGCATTGAGG CGCATCACAC CGCCACGCAT CTTCTTCACT GCGCTCTGCA TCAGGTGGTC
AGCCCGGATG CGGCCCAGCA GGGGTCCTTT GTTTCGGAAG ACCGGCTGCG CTTTGACTTT
AACAGCAGCG CCGTTTCTCC GGACCAGCTC CGCCTGATTG AAGAGAAGGT GAACGGCTGG
ATTGAGGAGT CTCTTCCCGT GCACTGCACG GAACGCGCTT ATGCGGACGT GAAGGGCAAT
GCCGCGATTG CCCAGTTCTT CGGCGACAAG TACGGGGATG TGGTGCGCGT GGTTCAGGTG
GGCGGATGCA GGGATGGGCT GGACGGGGTT TCCATGGAAT TCTGCGGCGG AACTCATATT
GCCAATACGA AGAATATCGG CCTGTTCAAG ATTAAGAGCG AGGGTGCCAT CGCTTCCGGC
GTGCGCCGCA TTGAGGCGAT GACTGGGGAC GCTGCTCTGG AAATGATACG GCAGCATGTT
GTTGCCAAGA GCCTGGAAAT CGCCAAGGCG GTGGAGAAGA TCAAGGAAGT TAATTACGAG
TTGGCGGACA TGGGGCTGGA ACAGGTGCCT GTCCCCACGA TTGAAGGCAA GCCGGGGCTG
ACGGCCCTGG GGGCTTCCGA TATCCGGACG GTAAATGATT CCCTGGCGCG TTTCGACGCC
TCCGTGGAGC ATTTCAAACA GACGGCTCTG GATGCGGAGA AGAAGCTTAA AAAAGCCCGC
GCCGGGCAGT CCGCCGCCAA GGCAGACGCC CTGCTGAATG AGTGGCTTTC CGATGCGCCT
TCTTCCCTGA TCCAGGTGGC GGAGGGCGCC GGGGAATTGC TTCAGGAACT GCTGAACGGG
TTGAAAAAGC GCCAGTATGC GGGCGCCGCC TTCCTGCTGT GCGTGGACAG TTCTTCCTTG
CTCCTGGGCG CTTATTGTGG CAAAGATGCC ATTGCGGACG GATTGTCCGC CGGAGATATG
ATCCGCGAGG TTGCCGCTCT TGCCGGAGGC AAGGGAGGCG GCCGTGCGGA TCAGGCCCGC
GGTTCCGCTC CGCAGGATGC CGATCCTCAG GCCCTGGCTG CCGCCGCCCG CAATATTATT
AACGGATAA
 
Protein sequence
MMTATEIRQS FLDFFREKQH TVVPSASLMP QSPGLLFTNA GMNQFVPYFL GVWTPPWTPA 
RATDTQKCIR AGGKHNDLED VGYDSYHHTF FEMLGNWSFG DYFKREAIRW AWELVVERWG
FPAERLYATV YAPDKSKGDP GEFDREAWDF WAELFRSRGL DPDVHIVHGN VKDNFWMMGE
TGPCGPCSEL HVDLTPEGNT KGSLVNKDSD QCIEIWNLVF IQYNAESDGS MRNLPACHVD
TGMGFERACS IMQCTNGFKD FSRKPSNYAT DVFRPLFDRL EVLSGRKYAD VYPAPGSKRV
DAEDGTLQEA IAFRVIADHL RTLSFSIADG ILPGNNGRNY VLRRILRRAV RYGRRLGFTQ
PFLAELVDTL VESFGQVFPE LAARATTVKE VLNREEASFN ETLDRGLELF DAETASAGKV
SGEFAFKLYD TYGFPIDLTA LLAEERGLDI DMERFNRLME EQRERARAAR KSEVVRALDL
KTDAVTEFTG YDVDECAATV LEVSRQGDSL FIITDKTPFY AEMGGQVSDA GLIEIGGESY
HVMAVQQIGN ARAHVVEARP GLEVKPGDRV HLSIDAERRR RIEAHHTATH LLHCALHQVV
SPDAAQQGSF VSEDRLRFDF NSSAVSPDQL RLIEEKVNGW IEESLPVHCT ERAYADVKGN
AAIAQFFGDK YGDVVRVVQV GGCRDGLDGV SMEFCGGTHI ANTKNIGLFK IKSEGAIASG
VRRIEAMTGD AALEMIRQHV VAKSLEIAKA VEKIKEVNYE LADMGLEQVP VPTIEGKPGL
TALGASDIRT VNDSLARFDA SVEHFKQTAL DAEKKLKKAR AGQSAAKADA LLNEWLSDAP
SSLIQVAEGA GELLQELLNG LKKRQYAGAA FLLCVDSSSL LLGAYCGKDA IADGLSAGDM
IREVAALAGG KGGGRADQAR GSAPQDADPQ ALAAAARNII NG