Gene Amuc_0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0720 
Symbol 
ID6273856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp849108 
End bp850397 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content55% 
IMG OID642612772 
Productdiaminopimelate decarboxylase 
Protein accessionYP_001877338 
Protein GI187735226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0019] Diaminopimelate decarboxylase 
TIGRFAM ID[TIGR01048] diaminopimelate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00473762 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTCAT TCGCTTACAA AAACGGCACG CTCTACTGTG AAAACGTCAA CCTTCAGGAA 
CTGGCGGACA AGGAAAGCAC ACCCCTGTAC GTTTATAGCA AACAAACCAT TTTAAACCAC
TTTCACCGCC TCAGGGAAGC TCTGGCACCG CTTAACGCGG AAGTGGCCTA CGCCGTCAAA
GCCTGCTCCA ACATCGCCAT CCTGAACCTC ATGGCCCGCA ACGGGGCGGG ATTCGACATC
GTCTCCGGCG GAGAACTCTT CCGTGTCCTC AAAGCCGGGG GAGATCCGTC CAAATGCACT
TATGCCGGCG TAGGAAAAAC CGAGCAGGAA ATCCGTTATG CCCTGGCCCA GGGCATTTAT
TGCTTCAATG TGGAATCCGA AGCGGAACTG CGGGCCATTA ACGCCATTGC CGCCTCCATG
GGAGTCAAGG CTCCGGTGGC CGTGCGCGTC AATCCCAACG TAGAGGCGGG AACGCACAAA
TACATTACTA CCGGCAAGGC TGAAAATAAA TTCGGCGTGG ACTTCGAACG CATTGAATCT
CTTTATGAGA TGGCGGCCCG CGAACTTCCG AACCTTCATC TGAAAGGGTT GCAAATGCAT
ATCGGCTCCC AGCTTACCCA GGCGAAACCC TTCCTAGAAG CCGTCAGGAA AGTCGCCCCC
CTGGCTGCTT CCCTGAAAGA AAAACACGGC ATTGAATTTT TCTCCATCGG AGGAGGAATT
GGCATCGTTT ACCAGGGCAC GCTGGATTCC GGCGTTCAGG AATGGTGGAA TGAGGACTGC
GCCCAGCTCA CGCTGAGCAC TTACGCCCAG GCCGTCGTCC CCACCCTGCA ACCTCTGGGA
TTACACATCA TTGTGGAACC GGGCCGTCTT ATCGTAGGAA ATGCGGGAGC ACTCATCACG
CGTTGCCTGT ATGAAAAAAA CGGGAAAGCC AAAACCTTTA AAATTGTGGA TGCAGGGATG
AACGACCTCA TCCGCCCCGC CCTCTACCAG GGCTATCATG AAATTATCCC GGTCAGAGAA
CACCCCTCCG GATCCTGCGT CACAGCGGAT GTTGTGGGTC CCATTTGTGA ATCCGGAGAC
TTCCTCGCCC AAAACAGGGA CATGCCGGAC GTGCGCCAGG GAGAACTCCT GGCCGTACTG
TCCGCCGGAG CCTATGGTTT TTCCATGTCT TCCAACTACA ATTCACGGCC TATGGCGGAA
GAAGTCCTGG TGGACGGGGA CCAATGGAAC GTCATTCGCA GCCGCCAAAG CTGGGAAGAC
CTCATCCGGG GAGAATCCAT TCCGGAATAA
 
Protein sequence
MHSFAYKNGT LYCENVNLQE LADKESTPLY VYSKQTILNH FHRLREALAP LNAEVAYAVK 
ACSNIAILNL MARNGAGFDI VSGGELFRVL KAGGDPSKCT YAGVGKTEQE IRYALAQGIY
CFNVESEAEL RAINAIAASM GVKAPVAVRV NPNVEAGTHK YITTGKAENK FGVDFERIES
LYEMAARELP NLHLKGLQMH IGSQLTQAKP FLEAVRKVAP LAASLKEKHG IEFFSIGGGI
GIVYQGTLDS GVQEWWNEDC AQLTLSTYAQ AVVPTLQPLG LHIIVEPGRL IVGNAGALIT
RCLYEKNGKA KTFKIVDAGM NDLIRPALYQ GYHEIIPVRE HPSGSCVTAD VVGPICESGD
FLAQNRDMPD VRQGELLAVL SAGAYGFSMS SNYNSRPMAE EVLVDGDQWN VIRSRQSWED
LIRGESIPE