Gene Amuc_0474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0474 
Symbol 
ID6274557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp562895 
End bp564181 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content59% 
IMG OID642612524 
ProductThree-deoxy-D-manno-octulosonic-acid transferase domain protein 
Protein accessionYP_001877093 
Protein GI187734981 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1519] 3-deoxy-D-manno-octulosonic-acid transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.547592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCT TCCTGTTTTC CTTCTGTTAC AATATCCTGT ATACCGTCGG GTGGCTGGTA 
ACGCTGCCTT CCTATCTGCT TAAGCAGAAG AGGAGAGGCG GATTCGGCAC GGGGCTTCTG
GAGCGTTTCG GGCTGTACCG GGTCTCCTAT AACCGGGAGC CGAAAGGGGT GTTGTATGTG
CATGCCGTGA GCGTGGGTGA AGTGGTGCTG GCTCTCAAGT TTCTCCGGGC GTGGCTCCGG
GAACGCGGAG GTTCCGCCGT GCTGGCTACC AGTACGGCTA CGGGGCATGC TACGGCAGTG
AGTGCGCAGA TTCCCGGCGT GCGCGTGATT TATGCCCCCT TTGACCTTCT GGGGCTGCCG
GGGAGGTGTT TTGACCGTTT TGAACCGGAG GCCATTGTCC TGGTGGAGGC CGAGTTGTGG
CCGAATTTTG CCCGTGCAGC CAAGGTGCGC GGCATACCCA TGGCGATGAT TAATGCCCGT
ATGTCCGCCA GGTCCGAGAG CCGCTACCGC GCATTTAAAT GGATTTCAAA GTATTATTTT
TCCTTCCTTG ACGCAATGGG CGTACAGGAC AAGGGGGATG TCCGGAGGTT TGAATCCGTG
GGCGTGCGTT CCTCCATCAT CCACGTGACG GGAAGCATCA AATTTGACCA GCAAATGGCG
GAGCGCAGGG AGGCTAATGC GGAGTTTGCC TCCATTTTGG ACAAACTGAA ACGCGGCAAG
CCTGTAGTGC TGGCCGCCAG CACGCATGAC GGGGAGGAAG TTCTGATCGC AGAAGCGGCG
CGCAAGGCGG GAGGCTTCCC CCTGATCGTG CCCAGGCATG CGGAACGCCG CCATGCCGTG
GTGCGGGAAC TGGAGGCCCA GGGCTGGCAG TGCGTGCTGC GGACTGATGG CGAGATTCCG
GAAACCCTGA AGGATCATGT TTGTTACATC GCGGATACGA CGGGGGAATT GAGGGACTGG
ACCGCGCTGG CGGATGTAGC CGTGATCGGA AAAAGCTTTT TGGCGGACGG TGGACAGAAT
CCGGCTGAAG CCGTCGCCTG TGGCGTGCCC GTGCTGACGG GGCCCCATAT GGAAAATTTC
GATGCACTGG TCCAGCTGCT TGAAGGAGTG GACGGAATTG CCAGGTGTGA TGAAAACCGC
CTGGCGGACG TGCTCAAGGA GATGCTGGAC AACCCCCTGC TGGCTCATGC CCAGTCTTCC
CGCGCCCAGG TGGCGTTGAA GGCCCATTTC GGCGCTACGG CCCGGACCAT TCGCATGATC
TGCATCATGC TCAAGATTCC TGTTTGA
 
Protein sequence
MKAFLFSFCY NILYTVGWLV TLPSYLLKQK RRGGFGTGLL ERFGLYRVSY NREPKGVLYV 
HAVSVGEVVL ALKFLRAWLR ERGGSAVLAT STATGHATAV SAQIPGVRVI YAPFDLLGLP
GRCFDRFEPE AIVLVEAELW PNFARAAKVR GIPMAMINAR MSARSESRYR AFKWISKYYF
SFLDAMGVQD KGDVRRFESV GVRSSIIHVT GSIKFDQQMA ERREANAEFA SILDKLKRGK
PVVLAASTHD GEEVLIAEAA RKAGGFPLIV PRHAERRHAV VRELEAQGWQ CVLRTDGEIP
ETLKDHVCYI ADTTGELRDW TALADVAVIG KSFLADGGQN PAEAVACGVP VLTGPHMENF
DALVQLLEGV DGIARCDENR LADVLKEMLD NPLLAHAQSS RAQVALKAHF GATARTIRMI
CIMLKIPV