Gene Amuc_1730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1730 
Symbol 
ID6274695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2111470 
End bp2112465 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content60% 
IMG OID642613793 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001878329 
Protein GI187736217 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.665288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATC TACCGATACG GCCTCGCCGC AACAGAAAAT CCGCCAACAT CCGGGGCCTT 
ATCCGGGAAA CCTCCCTTTC CCCGGAACAC CTCATCTATC CCGTCTTCGT CCATGAAGGG
ACCGGGAACC AGCCCATCCC CTCCCTGCCG GGCTGCACGC GCTGGAGCGT CAAGGGACTG
GTGGAAGAAG CCAAACGCCT GATGGATCTG GGCATCCGGA CGCTGGATCT TTTTCCAGCC
ATTCCGGATG AGAAAAAAAC GCCGGACGCC TGCGAAGCCT GCAACCCTGA CGGCCTCATC
CCGCGCACCA TTTACGCGCT CAAAAGCGAA GTGCCGGGCA TCACCGTCAT GACGGACGTA
GCCCTGGATC CCTACAATTC CGACGGCCAT GACGGCCTGG TGGAATTCCG CTCCGACGGA
ACGATGGAAA TCCTTAACGA CGATTCCGTG GAGGTTCTCT GCCGCCAGGC ATTGTGCCAC
GCGGATGCCG GGGCGGATAT CGTCTCCCCC AGCGACATGA TGGACGGCCG CGTGGCCGCC
ATCCGCGCCA CCCTGGATTC CGAAGCCCTG GACGATGTCT CCATCATGGC CTATACCGCC
AAGTACGCCA GCGCTCTGTA CGGGCCGTTC CGCGGAGCTC TGGAGAGCGC ACCCAAGGAA
GGGGATAAAA AAACCTACCA GATGGATCCC GGCAACATCC GGGAAGCCCT TAGGGAAGCG
CAGCTGGATG AAGCTGAGGG AGCGGACATC CTGATGGTGA AACCCGCCAC ATTGTATCTG
GACGTTATGG CCGCCATGCG GAAGCAGGTG ACCCTTCCCA TAGCCGCCTA CCATGTCAGC
GGGGAGTACC TGATGATCAA GTCCGCCGCA GCTTCCGGCT GGCTGGATGA ACGGGAAACA
GTTCTGGAGA CTCTAATATC CATCCGGCGC GCCGGGGCCG ATATGATCCT TACGTATTAC
GCCCCCCAGG CCGCCGAATG GCTTCAACAA CGCTGA
 
Protein sequence
MMNLPIRPRR NRKSANIRGL IRETSLSPEH LIYPVFVHEG TGNQPIPSLP GCTRWSVKGL 
VEEAKRLMDL GIRTLDLFPA IPDEKKTPDA CEACNPDGLI PRTIYALKSE VPGITVMTDV
ALDPYNSDGH DGLVEFRSDG TMEILNDDSV EVLCRQALCH ADAGADIVSP SDMMDGRVAA
IRATLDSEAL DDVSIMAYTA KYASALYGPF RGALESAPKE GDKKTYQMDP GNIREALREA
QLDEAEGADI LMVKPATLYL DVMAAMRKQV TLPIAAYHVS GEYLMIKSAA ASGWLDERET
VLETLISIRR AGADMILTYY APQAAEWLQQ R