Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1946 |
Symbol | |
ID | 6275140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2364322 |
End bp | 2365239 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642614006 |
Product | dihydrodipicolinate synthetase |
Protein accession | YP_001878540 |
Protein GI | 187736428 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.191392 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.0845217 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACGT CATTGAAACT CCACGGCCTG GTGGCCGCCG TACACACTCC GTTCAAGGCG GACGGTTCCC TGAACCCTTC CAGCGTTGAC GCCCAGGCCA AGTTGCTTGC CTCCCAGGGC ATCAAGCTTG CTTTCATTAC CGGCAGCACC GGGGAATCCT CCTCCATGCA GCTTGAAGAA CGCAAGGAAA TCTATTCTGC CTGGAAGGAA GCCTCCGCCA AGCATGGCGT GGAAGTTATC GCCCATACCG GTTCCAACAG CGTCTGGGAC GCCCGGGAAC TGGCCTCTTT TGCCCAGGAA TGCGGATTCG TGGCCACCAG TTCCCTGGCC CCGTCCTACT ACAAGCCCGG CACTGTTCAG CGCCTGGTGG AATGCTGCGC CTTCGCCGCC TCCGGCGCTC CCGACCTGCC CTATTATTAC TACGATATCC CCGTGCTGAC GGGCGTACGC TTCAATCCGG TGGATTTCAT CAGGCTGGCC AAGGAACAGA TTCCAAATTT CGCAGGCATC AAATTCACCA ATCCGGATCT GGCCCTGTAC CAGACCACGC TGAATTACGA CGAGACCGTG GATATTCCCT GGGGTGTGGA CGAATGGTTT ACGGGCGCCC TTTCCGTGGG GGCCAAGGGC GCTGTGGGGA GCTCCTTCAA CTTTGCTCCG GCCCTGTACC AGAAACTCAT GAAAGCCTTT GCGGAAGGCG ATGTGGAAAC GGCGCGCGAC TGCCAGTGGA AATCCGTTCA GATGATCAAT ATCCTGGCCT CCAAGGGCTA TATGGGCTGC GCCAAGGCTC TGATGGGCTG GCTGGGCGTC GATCTTGGCC CCGCCCGACT TCCGCAGGGC AACCCGACCG CAGATCAGCT GAAGGAACTC CGTTCCGAAC TGGAAGGCAT CGGCTTCTTC CAGTGGGCTT TAAACTGA
|
Protein sequence | MDTSLKLHGL VAAVHTPFKA DGSLNPSSVD AQAKLLASQG IKLAFITGST GESSSMQLEE RKEIYSAWKE ASAKHGVEVI AHTGSNSVWD ARELASFAQE CGFVATSSLA PSYYKPGTVQ RLVECCAFAA SGAPDLPYYY YDIPVLTGVR FNPVDFIRLA KEQIPNFAGI KFTNPDLALY QTTLNYDETV DIPWGVDEWF TGALSVGAKG AVGSSFNFAP ALYQKLMKAF AEGDVETARD CQWKSVQMIN ILASKGYMGC AKALMGWLGV DLGPARLPQG NPTADQLKEL RSELEGIGFF QWALN
|
| |