Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1684 |
Symbol | |
ID | 6275609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2043845 |
End bp | 2045851 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613743 |
Product | TonB-dependent receptor |
Protein accession | YP_001878283 |
Protein GI | 187736171 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTATA CGAAGAAAGC CCTTCAAATG GGCGCAATTG CTGTTGGCCT TGCCGCATTC GCAGGCCAGT CCTCTCTTGC CGAGTCCGCT AAAAAGGAAG ACTCCAAGCC CTCCGTCAGG ACGGAAACCA TGCAGGTTAT GCCGGAACTG ACTATGGCGT CCCACTTTGT GGGCGTGCCG TACAACCGGT CTGGAGTGTC CGTTTCCATC ATCAATCCGG AAGAATTCCA GAAGGCGGGC ATTGAAACCC TGACGGGAGC CCTTTCCCAG ACGCCCGGCG TTTTCACGCT GGACGGAGGC GGCACCTGGC AGCGCGGTTC CGTGAGCAAC ACCGTCATCC GCGGGATGAA CAAGGATACC TACACCCTGA CCATGGTTGA CGGCATGCGC ATCAGTGATG CCAACATGTC CGGCAACAAG CTGCTGGGCA TCACCAATCT CTTTACGGTG GGCAATGTGG AAGTGGTGAA AGGAGCGCAG GGCGCCGTTT TCGGTTCCGG CGCCATTGGA GGCGTTGTCG CGATGGATAC TCCGGAGGGG GAAGGCGATC CCGTAACCAG GATTTTTGCG GAGGCCGGTT CCTTCGGCTC TTTCAACAGC TATGTCACTT CCTCCGGCAA GATCAAGAAG CTTTCCTATT TTGTAGGCGT TGGGTTTGAA ACTACGGAAA ACGATCCGTC AATCTATCCG GCCATTTATG ACAACAGGAC AGGCATGAAC GATTTCCGCC AGTGGCAGGA AGCCGTGCGC CTGGGCTATG ACATCAATGA CAAGGTGAAG GTGAGCTTTA CCTATCGCCG CCTGGATTCC TACTTTGAAT ATCCCACGCC GTATGTGGAT TATAACCAGT GGCCTTCCGT TCCGGAGCCC CATCTGTACA ATACGGAAGA CAAGAACCGC AGCAACTTGG TGACAGGACG TGTGGATGCG GAAATTTCCA AGCTGTGGTC CACCAGCTTC ATGGTGGGGC ATTACAACAT GGACTATTCC TGCCATACTC CCGGATTTGA CTTCCAGCCC AACGTGATGC GCAACCGCCG CTTCCAGGCG GAGTGGCGCA ATGCCCTGAC GTGGAACAAG GAATGGAAAA CCATCGCCGG CATGGCCTGG GACCGTTCCG ACTACATGAG CGAAAACAAT TACGTTGCCA AGGATGAATG GCAGAGCACG CTTGCCTTCT TTGCAGAGCA AATGTGGTCA CCCACGGACA GCTTTGACGC CAGCGTGGCC CTGCGCCTGG AACATGATTC CGTCTGGAAC AATCATTTTA CGTGGCGTTA TTCCAATTCC TGGAAAGTGA CGGGCAAGGA TTCCCCCACC CGTATTTTCG GTTCCGTAGG GTCCGGTTTC CGCGCGCCCA CCTGGTTTGA GCAGTATGCG GCCAATTACG GTTATGTAGG CAACCCTGAT CTGGATGTGT CCAAGTCCCT GGGCGGCGAC CTGGGTGTGG AACAGCGCCT GGCGGACGGC CATTATGCTT CCGTGACGGG CTTCTGGACC CGCATTAACG ATGAAATCGG CACCAAGAGC GTAGGTACCT GGCCGAACTC CTATACCACT TACGCCAATT ATTCCCACTG CACTTCCTAC GGTGTGGAAG TGGCGTTCAA GGGTCAGTTC AAGGACGCGT GGAACAGCGG CTATTATGCC AATTACACCT TCACGATGCC CAAGCGCGAT TCCATCGGCA AATACGAGAC CATTCAGATG GCCAATACCG CCCGCCACAC CGTCAACGCA GAGGTTTATA CCTCCCCGGT TGAAAAGCTT ACGGTTGGCT TCGGCGTAAC GGCCGCCATG GGACGCACGG ACTACAACTA TGCCCGTCTG GATAATTTCT TTACGGCGCG CCTGTTTGCC CGTTACCAGG CAACGGACAA TGTGTCCCTC CATGTGCGTC TGGAAAACCT GTTTGACCAG AAGTTCATCA TGACGAATGA TTATAACTTC GGTCCGCGTG AAGCCCGTGG ATTCGGAATC TTCGGCGGTG TGACGGTCGA ATTCTAA
|
Protein sequence | MLYTKKALQM GAIAVGLAAF AGQSSLAESA KKEDSKPSVR TETMQVMPEL TMASHFVGVP YNRSGVSVSI INPEEFQKAG IETLTGALSQ TPGVFTLDGG GTWQRGSVSN TVIRGMNKDT YTLTMVDGMR ISDANMSGNK LLGITNLFTV GNVEVVKGAQ GAVFGSGAIG GVVAMDTPEG EGDPVTRIFA EAGSFGSFNS YVTSSGKIKK LSYFVGVGFE TTENDPSIYP AIYDNRTGMN DFRQWQEAVR LGYDINDKVK VSFTYRRLDS YFEYPTPYVD YNQWPSVPEP HLYNTEDKNR SNLVTGRVDA EISKLWSTSF MVGHYNMDYS CHTPGFDFQP NVMRNRRFQA EWRNALTWNK EWKTIAGMAW DRSDYMSENN YVAKDEWQST LAFFAEQMWS PTDSFDASVA LRLEHDSVWN NHFTWRYSNS WKVTGKDSPT RIFGSVGSGF RAPTWFEQYA ANYGYVGNPD LDVSKSLGGD LGVEQRLADG HYASVTGFWT RINDEIGTKS VGTWPNSYTT YANYSHCTSY GVEVAFKGQF KDAWNSGYYA NYTFTMPKRD SIGKYETIQM ANTARHTVNA EVYTSPVEKL TVGFGVTAAM GRTDYNYARL DNFFTARLFA RYQATDNVSL HVRLENLFDQ KFIMTNDYNF GPREARGFGI FGGVTVEF
|
| |