Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1383 |
Symbol | |
ID | 6274625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1649138 |
End bp | 1651966 |
Gene Length | 2829 bp |
Protein Length | 942 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613440 |
Product | alanyl-tRNA synthetase |
Protein accession | YP_001877988 |
Protein GI | 187735876 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0477098 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00263773 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGACCG CCACCGAGAT ACGCCAAAGC TTTCTGGACT TTTTCCGCGA AAAACAGCAC ACGGTCGTGC CTTCCGCTTC TTTGATGCCC CAGAGCCCCG GTTTGTTGTT TACAAATGCC GGCATGAATC AGTTTGTCCC GTATTTCCTG GGCGTATGGA CTCCCCCGTG GACGCCCGCC CGCGCTACGG ATACCCAGAA GTGCATCCGC GCAGGCGGCA AGCACAATGA CCTGGAGGAT GTGGGGTATG ACTCCTACCA CCACACGTTT TTTGAAATGC TGGGGAACTG GTCCTTCGGG GATTATTTCA AGAGGGAAGC TATCCGCTGG GCCTGGGAGC TGGTCGTGGA GCGGTGGGGA TTCCCGGCGG AACGCCTGTA CGCCACCGTG TACGCGCCGG ACAAGAGCAA GGGCGACCCC GGAGAGTTTG ACCGGGAAGC TTGGGATTTC TGGGCTGAGC TGTTCCGTTC CCGAGGGCTG GACCCGGACG TGCATATCGT GCACGGGAAT GTGAAGGATA ATTTCTGGAT GATGGGGGAA ACCGGCCCCT GCGGCCCCTG TTCCGAGCTG CACGTGGACC TGACCCCGGA GGGGAATACG AAGGGAAGCC TGGTAAACAA GGATTCCGAC CAGTGCATAG AGATATGGAA CCTGGTGTTT ATCCAGTACA ATGCGGAGAG CGACGGCTCC ATGCGCAATC TTCCGGCATG TCATGTGGAT ACCGGCATGG GGTTTGAGCG CGCGTGCTCC ATCATGCAGT GCACGAACGG ATTCAAGGAT TTTTCCCGCA AACCGTCCAA TTACGCCACG GATGTATTCC GCCCCCTGTT TGACCGCCTG GAAGTTTTGA GCGGACGGAA GTACGCGGAC GTGTATCCGG CGCCCGGTTC CAAAAGGGTG GATGCGGAGG ACGGGACCCT TCAGGAGGCG ATTGCCTTCC GCGTGATTGC CGATCATCTG CGCACGCTCA GTTTTTCCAT TGCGGACGGC ATTCTGCCGG GCAACAATGG CCGTAATTAC GTGCTGCGCC GCATTCTGCG CCGTGCCGTG CGCTATGGGC GCCGCCTGGG CTTTACCCAG CCGTTTTTGG CGGAACTGGT GGATACGCTG GTGGAGTCCT TCGGACAGGT GTTCCCGGAA CTGGCCGCCC GCGCCACTAC CGTGAAGGAG GTTTTGAACC GTGAAGAGGC CAGTTTTAAT GAGACGCTGG ACCGCGGCCT GGAATTGTTT GACGCGGAAA CGGCTTCCGC CGGAAAGGTG AGCGGCGAGT TCGCCTTCAA GCTGTATGAT ACGTACGGGT TCCCCATTGA CCTGACCGCC CTGCTGGCGG AGGAACGCGG CCTGGATATT GATATGGAGC GGTTCAACAG GCTGATGGAG GAACAGCGGG AACGCGCCCG GGCCGCCCGC AAGAGCGAGG TGGTGCGCGC CCTGGATTTG AAGACGGACG CCGTGACGGA GTTTACGGGG TACGATGTGG ACGAATGCGC CGCTACGGTG CTGGAAGTGA GCCGCCAGGG GGATTCCCTG TTCATCATCA CGGACAAGAC TCCGTTTTAC GCGGAAATGG GCGGGCAGGT GTCCGATGCC GGGTTGATTG AAATCGGCGG GGAAAGCTAC CATGTGATGG CCGTCCAGCA GATAGGGAAT GCCCGAGCCC ATGTGGTGGA GGCCCGTCCC GGGCTGGAGG TGAAGCCCGG CGACCGCGTG CATTTGAGCA TTGACGCGGA ACGCCGCCGC CGCATTGAGG CGCATCACAC CGCCACGCAT CTTCTTCACT GCGCTCTGCA TCAGGTGGTC AGCCCGGATG CGGCCCAGCA GGGGTCCTTT GTTTCGGAAG ACCGGCTGCG CTTTGACTTT AACAGCAGCG CCGTTTCTCC GGACCAGCTC CGCCTGATTG AAGAGAAGGT GAACGGCTGG ATTGAGGAGT CTCTTCCCGT GCACTGCACG GAACGCGCTT ATGCGGACGT GAAGGGCAAT GCCGCGATTG CCCAGTTCTT CGGCGACAAG TACGGGGATG TGGTGCGCGT GGTTCAGGTG GGCGGATGCA GGGATGGGCT GGACGGGGTT TCCATGGAAT TCTGCGGCGG AACTCATATT GCCAATACGA AGAATATCGG CCTGTTCAAG ATTAAGAGCG AGGGTGCCAT CGCTTCCGGC GTGCGCCGCA TTGAGGCGAT GACTGGGGAC GCTGCTCTGG AAATGATACG GCAGCATGTT GTTGCCAAGA GCCTGGAAAT CGCCAAGGCG GTGGAGAAGA TCAAGGAAGT TAATTACGAG TTGGCGGACA TGGGGCTGGA ACAGGTGCCT GTCCCCACGA TTGAAGGCAA GCCGGGGCTG ACGGCCCTGG GGGCTTCCGA TATCCGGACG GTAAATGATT CCCTGGCGCG TTTCGACGCC TCCGTGGAGC ATTTCAAACA GACGGCTCTG GATGCGGAGA AGAAGCTTAA AAAAGCCCGC GCCGGGCAGT CCGCCGCCAA GGCAGACGCC CTGCTGAATG AGTGGCTTTC CGATGCGCCT TCTTCCCTGA TCCAGGTGGC GGAGGGCGCC GGGGAATTGC TTCAGGAACT GCTGAACGGG TTGAAAAAGC GCCAGTATGC GGGCGCCGCC TTCCTGCTGT GCGTGGACAG TTCTTCCTTG CTCCTGGGCG CTTATTGTGG CAAAGATGCC ATTGCGGACG GATTGTCCGC CGGAGATATG ATCCGCGAGG TTGCCGCTCT TGCCGGAGGC AAGGGAGGCG GCCGTGCGGA TCAGGCCCGC GGTTCCGCTC CGCAGGATGC CGATCCTCAG GCCCTGGCTG CCGCCGCCCG CAATATTATT AACGGATAA
|
Protein sequence | MMTATEIRQS FLDFFREKQH TVVPSASLMP QSPGLLFTNA GMNQFVPYFL GVWTPPWTPA RATDTQKCIR AGGKHNDLED VGYDSYHHTF FEMLGNWSFG DYFKREAIRW AWELVVERWG FPAERLYATV YAPDKSKGDP GEFDREAWDF WAELFRSRGL DPDVHIVHGN VKDNFWMMGE TGPCGPCSEL HVDLTPEGNT KGSLVNKDSD QCIEIWNLVF IQYNAESDGS MRNLPACHVD TGMGFERACS IMQCTNGFKD FSRKPSNYAT DVFRPLFDRL EVLSGRKYAD VYPAPGSKRV DAEDGTLQEA IAFRVIADHL RTLSFSIADG ILPGNNGRNY VLRRILRRAV RYGRRLGFTQ PFLAELVDTL VESFGQVFPE LAARATTVKE VLNREEASFN ETLDRGLELF DAETASAGKV SGEFAFKLYD TYGFPIDLTA LLAEERGLDI DMERFNRLME EQRERARAAR KSEVVRALDL KTDAVTEFTG YDVDECAATV LEVSRQGDSL FIITDKTPFY AEMGGQVSDA GLIEIGGESY HVMAVQQIGN ARAHVVEARP GLEVKPGDRV HLSIDAERRR RIEAHHTATH LLHCALHQVV SPDAAQQGSF VSEDRLRFDF NSSAVSPDQL RLIEEKVNGW IEESLPVHCT ERAYADVKGN AAIAQFFGDK YGDVVRVVQV GGCRDGLDGV SMEFCGGTHI ANTKNIGLFK IKSEGAIASG VRRIEAMTGD AALEMIRQHV VAKSLEIAKA VEKIKEVNYE LADMGLEQVP VPTIEGKPGL TALGASDIRT VNDSLARFDA SVEHFKQTAL DAEKKLKKAR AGQSAAKADA LLNEWLSDAP SSLIQVAEGA GELLQELLNG LKKRQYAGAA FLLCVDSSSL LLGAYCGKDA IADGLSAGDM IREVAALAGG KGGGRADQAR GSAPQDADPQ ALAAAARNII NG
|
| |