Gene Amuc_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1946 
Symbol 
ID6275140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2364322 
End bp2365239 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content59% 
IMG OID642614006 
Productdihydrodipicolinate synthetase 
Protein accessionYP_001878540 
Protein GI187736428 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.191392 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0845217 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACGT CATTGAAACT CCACGGCCTG GTGGCCGCCG TACACACTCC GTTCAAGGCG 
GACGGTTCCC TGAACCCTTC CAGCGTTGAC GCCCAGGCCA AGTTGCTTGC CTCCCAGGGC
ATCAAGCTTG CTTTCATTAC CGGCAGCACC GGGGAATCCT CCTCCATGCA GCTTGAAGAA
CGCAAGGAAA TCTATTCTGC CTGGAAGGAA GCCTCCGCCA AGCATGGCGT GGAAGTTATC
GCCCATACCG GTTCCAACAG CGTCTGGGAC GCCCGGGAAC TGGCCTCTTT TGCCCAGGAA
TGCGGATTCG TGGCCACCAG TTCCCTGGCC CCGTCCTACT ACAAGCCCGG CACTGTTCAG
CGCCTGGTGG AATGCTGCGC CTTCGCCGCC TCCGGCGCTC CCGACCTGCC CTATTATTAC
TACGATATCC CCGTGCTGAC GGGCGTACGC TTCAATCCGG TGGATTTCAT CAGGCTGGCC
AAGGAACAGA TTCCAAATTT CGCAGGCATC AAATTCACCA ATCCGGATCT GGCCCTGTAC
CAGACCACGC TGAATTACGA CGAGACCGTG GATATTCCCT GGGGTGTGGA CGAATGGTTT
ACGGGCGCCC TTTCCGTGGG GGCCAAGGGC GCTGTGGGGA GCTCCTTCAA CTTTGCTCCG
GCCCTGTACC AGAAACTCAT GAAAGCCTTT GCGGAAGGCG ATGTGGAAAC GGCGCGCGAC
TGCCAGTGGA AATCCGTTCA GATGATCAAT ATCCTGGCCT CCAAGGGCTA TATGGGCTGC
GCCAAGGCTC TGATGGGCTG GCTGGGCGTC GATCTTGGCC CCGCCCGACT TCCGCAGGGC
AACCCGACCG CAGATCAGCT GAAGGAACTC CGTTCCGAAC TGGAAGGCAT CGGCTTCTTC
CAGTGGGCTT TAAACTGA
 
Protein sequence
MDTSLKLHGL VAAVHTPFKA DGSLNPSSVD AQAKLLASQG IKLAFITGST GESSSMQLEE 
RKEIYSAWKE ASAKHGVEVI AHTGSNSVWD ARELASFAQE CGFVATSSLA PSYYKPGTVQ
RLVECCAFAA SGAPDLPYYY YDIPVLTGVR FNPVDFIRLA KEQIPNFAGI KFTNPDLALY
QTTLNYDETV DIPWGVDEWF TGALSVGAKG AVGSSFNFAP ALYQKLMKAF AEGDVETARD
CQWKSVQMIN ILASKGYMGC AKALMGWLGV DLGPARLPQG NPTADQLKEL RSELEGIGFF
QWALN