Gene Amuc_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1858 
Symbol 
ID6275469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2259751 
End bp2261733 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content37% 
IMG OID642613919 
Producthypothetical protein 
Protein accessionYP_001878453 
Protein GI187736341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.153141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.576375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACAAA GGTCTTATTA TTTCAATGAC CTACAGTCTT TTTGTACCCA AACTCTAGAA 
GAAATTTTAG GGCAGATTGC TCTTCATACC CCATTTTCTT TAGATCAAAA TTCACGTAAT
TCTTTTGTTC ACGAAATTAG ATTACTCCAA AGTGTTCTTA AAAGTATTCC CTCTGGATCC
ATAGCCCTTG AATACACTAT TCCTCGCATC GGAGAGCGAA TAGATGTCGT AATCGCTTGT
GCTGGTATTC TCTATATTCT TGAATTTAAA GTAGGAGAAT CATCTTATCC AAGACATGCT
ATTGATCAAG TGGTAGATTA TGCACTAGCA TTAAAATATT TCCATCAAGA AAGTTATCAA
AAGAAAATTG TTCCTCTGGT AGTCTGCACT CATGCTCCTT CTAAAGAGTT TCAACTTATA
ATGAACCCAG ATGGGGTTTA TCTTCCTATC CTTTGCAATG ATAATACCTT AGGTCCTAAT
TTAACAAAAC TTACTAACAA TCTGTGTGAT GACGAATTCC ATTTTAAACA ATGGTTAATT
TCGCCATACA TGCCTACTCC CACGATCATT GAAGCAGCTC AAGCACTCTA CCGTGGACAT
GGAGTCAAAG AAATTTCACG GAGTAGTGCC GGAGCCTACA ATCTTAGTCT CACAACAAAG
GTACTAAATC GTATTATTGA ACAAAGTAAA CAGTATCATC AAAAATCTAT TTGTTTTGTT
ACTGGCGTAC CTGGTGCCGG AAAAACGCTA GTAGGCCTCA ATATTGCAAA TGAGCGACAT
CAATATGATA AACAAGAACA TGCAGTTTTT CTTTCCGGTA ATGGTCCCTT GGTTGCTGTC
CTGCGAGAAG CTCTAGTACG AGATGAAATT AAACGCTGTA AAGGCAAAAT AACAAAAATA
ACATCTAAAA GAAAAGTTGC TGCCTTTATT CAAAATATCC ATCATTTTCG TGATACTTAT
CTACCTCCTT CCGAACAAGT TCCTGCGGAA AAAGTAACTA TTTTTGACGA AGCACAACGT
GCTTGGACAA AAGAGCAAAC GGCTAAATTT ATGTTAAAAC GCCATGTTCC CTCCTGGAAC
ATGTCTGAAC CAGAATTCCT CATTAGTGTA ATGGATCGCC ATCAAGATTG GGCTGTAATC
ATTTGTCTAA TTGGAGGTGG TCAAGAAATA CATACTGGAG AAGCTGGCCT TTTAGCTTGG
TTTGATGCAC TAAGAAACCA TTTCCCTCAT TGGAATGTTT ATGTGTCCCC CCAAATCTCT
GATGTAGAAT ATACGCAAGG AAAAACACTT GAATCTCTCT TTATGGGATT ACATCTTTAT
CAGGAAAAAA AACTTCATCT TTCTGTCTCA CTTCGTTCTT TTCGGAATGA AAAAGTTTCA
GCATTTGTAA AATCTCTATT GGATGAAAAC TTACCAATAG CTCAACAACT CTATTCAGAA
CTCTCACTTA ATTATCCTAT TGTCATTACA CGTAGCCTAG AAAAAGCTAA ACAATGGGTA
CAAAATCAAT CCCGAGGTAC AGAACGTTAT GGACTCATTT CTAGTTCAGG AGCCAAACGT
CTACGCCAAT TTGGTATTTG GGTACAGAAC GATATTCAGG CAGAAAATTG GTTTTTAAAC
GATAAAGAGG ATGTACGCTC TTCCTATTTT CTAGAAGAAA CAGCAACTGA ATTTGATATT
CAGGGTCTTG AAATTGATTG GGCAATCGTT GCATGGGATG CAGACTTTCG TATAGAAAAA
GGACATTTTA AAGCTTATAA TTTTAAGGGG TCTAGTTGGA AAATAGTTCG TAAGAAAGAT
GCACAACTCT ATCTCAAAAA TACTTACCGT GTTTTATTAA CACGAGCACG CCAAGGGTTC
GTTATTTTTA TTCCAAAAGG ATGTGACGAG GATTTGACTC GTCACTCCTC CTTCTATGAT
GGTATTTATT ATTACCTAAA AGAAATAGGT ATCAAGGAGC TATGCCTTTC TGAAGAACAA
TAG
 
Protein sequence
MIQRSYYFND LQSFCTQTLE EILGQIALHT PFSLDQNSRN SFVHEIRLLQ SVLKSIPSGS 
IALEYTIPRI GERIDVVIAC AGILYILEFK VGESSYPRHA IDQVVDYALA LKYFHQESYQ
KKIVPLVVCT HAPSKEFQLI MNPDGVYLPI LCNDNTLGPN LTKLTNNLCD DEFHFKQWLI
SPYMPTPTII EAAQALYRGH GVKEISRSSA GAYNLSLTTK VLNRIIEQSK QYHQKSICFV
TGVPGAGKTL VGLNIANERH QYDKQEHAVF LSGNGPLVAV LREALVRDEI KRCKGKITKI
TSKRKVAAFI QNIHHFRDTY LPPSEQVPAE KVTIFDEAQR AWTKEQTAKF MLKRHVPSWN
MSEPEFLISV MDRHQDWAVI ICLIGGGQEI HTGEAGLLAW FDALRNHFPH WNVYVSPQIS
DVEYTQGKTL ESLFMGLHLY QEKKLHLSVS LRSFRNEKVS AFVKSLLDEN LPIAQQLYSE
LSLNYPIVIT RSLEKAKQWV QNQSRGTERY GLISSSGAKR LRQFGIWVQN DIQAENWFLN
DKEDVRSSYF LEETATEFDI QGLEIDWAIV AWDADFRIEK GHFKAYNFKG SSWKIVRKKD
AQLYLKNTYR VLLTRARQGF VIFIPKGCDE DLTRHSSFYD GIYYYLKEIG IKELCLSEEQ