Gene Amuc_1547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1547 
Symbol 
ID6273661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1858942 
End bp1860729 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content56% 
IMG OID642613606 
Producthypothetical protein 
Protein accessionYP_001878149 
Protein GI187736037 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.180047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC TGTTGTTTGC ATTGTTGACC GGTTCCTTTT GTTGCTGTTA TGCCCAACAG 
AAGGCCGCAC CCGTTCCGGA ACCTGAAGTT GTCGCCACTC CGCCGGCTGA TGCGGGGCGC
GGCCTTATCC GTGTGGACTC CCGTGAAATA CGCCATTATT CCGGTACCCG CAAGGAACCG
GATTACCTGG TCAGCAGGGA TAACGGAAAA ACATGGGAAA TGAAGGCCGC TCCGGCTGGC
TACCCTCCCA ACTACGGTGG CATTCCCAAA GAATCTCCAG CTATTGTGCG CAACCCTCTG
ACCAGGGAAT TCATTCGTGT GCAGCCTATC GGGGGCTTTG TATTTCTTTC CAGGGGTGGG
CTGGACGGCA AGTGGCTTGC CGTCACGAAT GACGGCAAAC TGGAAGAAGA CTGGAAAGAC
CCGGAAAAGA GGAAAAACCT GAAAAAACTG GGCGGCATCA TGCGAACCCC CGTTTTTGTG
AACAAGGGCC GCAGGGTGAT CGTGCCGTTC CACAACATGG GCGGCGGCAC CAAGTTCCAT
ATTTCCGATG ACGGGGGGCT GACCTGGCAT GTATCCAGGA ACGGTGTTAC TTCCCCCAGA
CATGAAGCCA GGCCCCCCCA CCAGGGCGTC AGATGGTTCA ACAATGCCGT GGAAGCCACG
GTTTTGGAAA TGAAAGACGG TACGTTGTGG GCGCTTGCCC GCACCTCCCA GGACCAGGCG
TGGCAGGCTT TTTCCAAGGA TTACGGGGAA ACGTGGAGCA AACCGGAGCC TTCCCGCTTT
TTCGGCACCC TGACCATGAA CACGTTGGGA CGCCTGGATG ACGGAACTAT CGTTTCCCTG
TGGACGAATA CAATGGCTCT GCCTGAAAAC GCCACAGCTG GCAACGGAAC GTGGGAGGAT
GTATTCACCA ACCGTGATTC CCACCACATT GCTATGTCCG GGGACGAGGG CAAAACCTGG
TACGGGTTCC GGGAGATTAT CCTGGACGAA CACCGCAACC ATCCCGGCTA TGCTACGCTG
GATGGTCCGG AAGACCGCGG CAAACATCAG AGCGAAATGG TGCAGCTGGA CAAAAACCGC
ATCCTTATTT CCCTGGGGCA GCATAAAAAC CACCGCCGCC TGGTTATTGT GGACCGCCGC
TGGGTAGGGG CCAAGACGCG TGCCACGCAG ACGGGGAAAG ATTTGGATTC CCAGTGGACC
ATTCACACTT ATATCCCCCA GAAAAAAGGC CATTGCAGTT ATAACCGCAA GCCTTCCGCC
GAGTTGGTTC AGGATCCGTC CGGGGGCACG AAGAAGGTGT TGCAAATCAA GCGTCTGGAT
GATCCCGAAC TGGTCAATGA AAAATCCAAT GTGGATTACC GGAACGGCGG AGCTACCTGG
AACTTTCCGA ACGGGACCAC GGGGCTGGTC AAATTCCGCT TCCGTGTAGT GGACGGGGAG
CAGGCGGATG ATTCCGGCCT TCAGGTCTCT CTGACGGACC GGCTGTTTAA TGCCTGTGAT
TCCACTACGA AGGATTATGC CCTGTTTACC TTCCCGATCA GGCTGAAACC TGCGCCCCAT
CTGTTGCTGG GGATGAAAAA AGTGCCTTTC ACGCCCGGCG CGTGGCATGA AATTTCCCTT
CTTTGGCAGG GTGGGCAGGC CGTGGTGTCT CTGGACGGAA AGAAGGCCGG AACGTTGAAA
ATGGCTAATA AGTCCCCCAA TGGAGCCAGT TATATCCATT TCATCAGCAC CGGGTCCCAA
CCGGATGCCG GCATTCTGCT GGATACGGTG AATGCCCGGG TGAAGTAA
 
Protein sequence
MKNLLFALLT GSFCCCYAQQ KAAPVPEPEV VATPPADAGR GLIRVDSREI RHYSGTRKEP 
DYLVSRDNGK TWEMKAAPAG YPPNYGGIPK ESPAIVRNPL TREFIRVQPI GGFVFLSRGG
LDGKWLAVTN DGKLEEDWKD PEKRKNLKKL GGIMRTPVFV NKGRRVIVPF HNMGGGTKFH
ISDDGGLTWH VSRNGVTSPR HEARPPHQGV RWFNNAVEAT VLEMKDGTLW ALARTSQDQA
WQAFSKDYGE TWSKPEPSRF FGTLTMNTLG RLDDGTIVSL WTNTMALPEN ATAGNGTWED
VFTNRDSHHI AMSGDEGKTW YGFREIILDE HRNHPGYATL DGPEDRGKHQ SEMVQLDKNR
ILISLGQHKN HRRLVIVDRR WVGAKTRATQ TGKDLDSQWT IHTYIPQKKG HCSYNRKPSA
ELVQDPSGGT KKVLQIKRLD DPELVNEKSN VDYRNGGATW NFPNGTTGLV KFRFRVVDGE
QADDSGLQVS LTDRLFNACD STTKDYALFT FPIRLKPAPH LLLGMKKVPF TPGAWHEISL
LWQGGQAVVS LDGKKAGTLK MANKSPNGAS YIHFISTGSQ PDAGILLDTV NARVK