Gene Amuc_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0418 
Symbol 
ID6274836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp498523 
End bp499581 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content58% 
IMG OID642612468 
ProductAnkyrin 
Protein accessionYP_001877037 
Protein GI187734925 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAAA TGACAGGAAC CATGCTGCTT CTGTCCTCAG CAGTCGTTAT GGCCGCTCCC 
TGGGCGCATG CGGAGGAAAA GACAATCCAG CTCACGGAAG CGGAACAACA GGAAATCAAG
ACGGCAAACG AAAAGCTGCT GGGCCTTACC CTCCGTTTCC TGCATGACTC ATGGCCGCTG
GAAATCATGT TTCCCGGAGA AGTGCAGGAG GAATTCCACT CCATTCTTCA ATGCCATCAG
ATGCTGGAGC AATTCCGCCA AACCGGCAAC CTCCTGCTCC AGACGCCGGA CCGTACCACG
CCTCTGCACC TCTGCATTGC CCTGGGATTA AACCGGCTGG CCGTCCGGAT GGTAGAAGCG
GGCGCTCCCG TCAATGCTCA ATCCATTTTC ATGCATGACG GCACAAAAGA GCCGGGAGAC
ACCCCCCTTA CCTGGGCGTG CCTCTCAGGC CTTTATATGA ATTCCACCGC GGAAGAAAGG
CTGCCGCTGG TGCACGCCCT GCTCAAACAC GGCGCCGAGC CGGATCAGCC GGGGCCTTGG
GGCGTCACCC CGTTCATGTA TTCCGCCGCC CTCAATGACT CCGATCCGGG TCAGGAAAAA
ATAGCGCTGG CGCTCCTGGA CGCCGGCTCC CCGGATCTCA AGCGCAGAAT GAACGCTCAG
GCGCGCGGAG TCGGCTTCCT CAGTCTGTCC CCCGCCATTT ACGAACGGCT CATCAAAGCC
GGCTGCGATG TTAACGAACG CTTTTTTGAA AGCAAGCAAT CACCCCTCCA CCTGGTCTGC
ACCAAGGAAA AACCGGCGGA ACGCCTCATT CCTCTCATTG AACTTCTCAT CAACGCGGGA
GCGGATCCCA ACCAGCCGGA CGTGGATGGC CTGACTCCGC TGATGGCCTG CAACTCTCCG
GAAATAGCCG TTTGCCTCAT GAATCACGGC GCCAATCCCT CTCTCCGCAA CGATGACGGC
CAGACAGCCT ATGACTTCCA TATGAAAAAC GGGTATCCCC CCATTGCGGA AGCCATCAAG
CACTGGCAGT CCAAGCAGAA AAAAGGGGAA ACCCGCTAA
 
Protein sequence
MRQMTGTMLL LSSAVVMAAP WAHAEEKTIQ LTEAEQQEIK TANEKLLGLT LRFLHDSWPL 
EIMFPGEVQE EFHSILQCHQ MLEQFRQTGN LLLQTPDRTT PLHLCIALGL NRLAVRMVEA
GAPVNAQSIF MHDGTKEPGD TPLTWACLSG LYMNSTAEER LPLVHALLKH GAEPDQPGPW
GVTPFMYSAA LNDSDPGQEK IALALLDAGS PDLKRRMNAQ ARGVGFLSLS PAIYERLIKA
GCDVNERFFE SKQSPLHLVC TKEKPAERLI PLIELLINAG ADPNQPDVDG LTPLMACNSP
EIAVCLMNHG ANPSLRNDDG QTAYDFHMKN GYPPIAEAIK HWQSKQKKGE TR