Gene Amuc_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1684 
Symbol 
ID6275609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2043845 
End bp2045851 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content55% 
IMG OID642613743 
ProductTonB-dependent receptor 
Protein accessionYP_001878283 
Protein GI187736171 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTATA CGAAGAAAGC CCTTCAAATG GGCGCAATTG CTGTTGGCCT TGCCGCATTC 
GCAGGCCAGT CCTCTCTTGC CGAGTCCGCT AAAAAGGAAG ACTCCAAGCC CTCCGTCAGG
ACGGAAACCA TGCAGGTTAT GCCGGAACTG ACTATGGCGT CCCACTTTGT GGGCGTGCCG
TACAACCGGT CTGGAGTGTC CGTTTCCATC ATCAATCCGG AAGAATTCCA GAAGGCGGGC
ATTGAAACCC TGACGGGAGC CCTTTCCCAG ACGCCCGGCG TTTTCACGCT GGACGGAGGC
GGCACCTGGC AGCGCGGTTC CGTGAGCAAC ACCGTCATCC GCGGGATGAA CAAGGATACC
TACACCCTGA CCATGGTTGA CGGCATGCGC ATCAGTGATG CCAACATGTC CGGCAACAAG
CTGCTGGGCA TCACCAATCT CTTTACGGTG GGCAATGTGG AAGTGGTGAA AGGAGCGCAG
GGCGCCGTTT TCGGTTCCGG CGCCATTGGA GGCGTTGTCG CGATGGATAC TCCGGAGGGG
GAAGGCGATC CCGTAACCAG GATTTTTGCG GAGGCCGGTT CCTTCGGCTC TTTCAACAGC
TATGTCACTT CCTCCGGCAA GATCAAGAAG CTTTCCTATT TTGTAGGCGT TGGGTTTGAA
ACTACGGAAA ACGATCCGTC AATCTATCCG GCCATTTATG ACAACAGGAC AGGCATGAAC
GATTTCCGCC AGTGGCAGGA AGCCGTGCGC CTGGGCTATG ACATCAATGA CAAGGTGAAG
GTGAGCTTTA CCTATCGCCG CCTGGATTCC TACTTTGAAT ATCCCACGCC GTATGTGGAT
TATAACCAGT GGCCTTCCGT TCCGGAGCCC CATCTGTACA ATACGGAAGA CAAGAACCGC
AGCAACTTGG TGACAGGACG TGTGGATGCG GAAATTTCCA AGCTGTGGTC CACCAGCTTC
ATGGTGGGGC ATTACAACAT GGACTATTCC TGCCATACTC CCGGATTTGA CTTCCAGCCC
AACGTGATGC GCAACCGCCG CTTCCAGGCG GAGTGGCGCA ATGCCCTGAC GTGGAACAAG
GAATGGAAAA CCATCGCCGG CATGGCCTGG GACCGTTCCG ACTACATGAG CGAAAACAAT
TACGTTGCCA AGGATGAATG GCAGAGCACG CTTGCCTTCT TTGCAGAGCA AATGTGGTCA
CCCACGGACA GCTTTGACGC CAGCGTGGCC CTGCGCCTGG AACATGATTC CGTCTGGAAC
AATCATTTTA CGTGGCGTTA TTCCAATTCC TGGAAAGTGA CGGGCAAGGA TTCCCCCACC
CGTATTTTCG GTTCCGTAGG GTCCGGTTTC CGCGCGCCCA CCTGGTTTGA GCAGTATGCG
GCCAATTACG GTTATGTAGG CAACCCTGAT CTGGATGTGT CCAAGTCCCT GGGCGGCGAC
CTGGGTGTGG AACAGCGCCT GGCGGACGGC CATTATGCTT CCGTGACGGG CTTCTGGACC
CGCATTAACG ATGAAATCGG CACCAAGAGC GTAGGTACCT GGCCGAACTC CTATACCACT
TACGCCAATT ATTCCCACTG CACTTCCTAC GGTGTGGAAG TGGCGTTCAA GGGTCAGTTC
AAGGACGCGT GGAACAGCGG CTATTATGCC AATTACACCT TCACGATGCC CAAGCGCGAT
TCCATCGGCA AATACGAGAC CATTCAGATG GCCAATACCG CCCGCCACAC CGTCAACGCA
GAGGTTTATA CCTCCCCGGT TGAAAAGCTT ACGGTTGGCT TCGGCGTAAC GGCCGCCATG
GGACGCACGG ACTACAACTA TGCCCGTCTG GATAATTTCT TTACGGCGCG CCTGTTTGCC
CGTTACCAGG CAACGGACAA TGTGTCCCTC CATGTGCGTC TGGAAAACCT GTTTGACCAG
AAGTTCATCA TGACGAATGA TTATAACTTC GGTCCGCGTG AAGCCCGTGG ATTCGGAATC
TTCGGCGGTG TGACGGTCGA ATTCTAA
 
Protein sequence
MLYTKKALQM GAIAVGLAAF AGQSSLAESA KKEDSKPSVR TETMQVMPEL TMASHFVGVP 
YNRSGVSVSI INPEEFQKAG IETLTGALSQ TPGVFTLDGG GTWQRGSVSN TVIRGMNKDT
YTLTMVDGMR ISDANMSGNK LLGITNLFTV GNVEVVKGAQ GAVFGSGAIG GVVAMDTPEG
EGDPVTRIFA EAGSFGSFNS YVTSSGKIKK LSYFVGVGFE TTENDPSIYP AIYDNRTGMN
DFRQWQEAVR LGYDINDKVK VSFTYRRLDS YFEYPTPYVD YNQWPSVPEP HLYNTEDKNR
SNLVTGRVDA EISKLWSTSF MVGHYNMDYS CHTPGFDFQP NVMRNRRFQA EWRNALTWNK
EWKTIAGMAW DRSDYMSENN YVAKDEWQST LAFFAEQMWS PTDSFDASVA LRLEHDSVWN
NHFTWRYSNS WKVTGKDSPT RIFGSVGSGF RAPTWFEQYA ANYGYVGNPD LDVSKSLGGD
LGVEQRLADG HYASVTGFWT RINDEIGTKS VGTWPNSYTT YANYSHCTSY GVEVAFKGQF
KDAWNSGYYA NYTFTMPKRD SIGKYETIQM ANTARHTVNA EVYTSPVEKL TVGFGVTAAM
GRTDYNYARL DNFFTARLFA RYQATDNVSL HVRLENLFDQ KFIMTNDYNF GPREARGFGI
FGGVTVEF