Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1686 |
Symbol | |
ID | 6274435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2048214 |
End bp | 2050556 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642613745 |
Product | Beta-galactosidase |
Protein accession | YP_001878285 |
Protein GI | 187736173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00233252 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.321684 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAT CCTTTTTCTC CGTTCTGCTT CTGGCAGGGC ATCTTTGTGC GGCTGCTCCC ATGCCTTTGC CGGAATCCAA TGACGGAGCC AGGCATGTCT TCTCTACTAA TCAGGAAAAT TTTTTGATGG ACGGAAAGCC CGTCAAAATC ATTTCCGGGG AAATGCATTA TCCTCGCGTG CCGCGCCAGC ACTGGAAGGA CAGGTTCCAG CGCATAAAGG CCATGGGCAT GAATACCGTC TGCACTTATC TGTTCTGGAA CGTGCATGAA CCGGAACCCG GCAAATGGGA CTTTTCCGGC AATCTGGATT TTGTGGAATT CATCAAGGAG GCGCAGAAGG CCGGCCTGTG GGTCATTGTG CGTCCCGGGC CCTATGTGTG CGCGGAATGG GAATTCGGCG GATTTCCCGG CTGGCTGCTG AAGGATGAAG ATTTGAAAGT CCGTTCCCAG GATCCCCGCT TCCTGGAACC GGCCATGGCT TATCTTAAAA AAGTCTGTTC CATGCTGGAA CCTCTGCAGA TTACCAAGGG AGGCCCCATC ATCATGGCCC AGGTGGAAAA TGAATACGGC TCCTATGGTT CTGACAAGGA TTACGTGAAA AAGCATCTGG ACGTTATCCG GAAAGAACTT CCGGGAGTTG TTCCCTTCAC GTCGGACGGC CCGAACGACT GGATGATCAA GAACGGCACG CTTCCGGGCG TTGTTCCCGC CATGAATTTC GGCGGCGGAG CCAAGGGCGC TTTTGCGAAT CTGGAGAAGC ACAAGGGCAA AACGCCCCGC ATCAACGGCG AATTCTGGGT GGGCTGGTTT GACCACTGGG GCAAGCCCAA GAATGGCGGC AGTACGGAAG GTTTCAACCG AGACCTGAAG TGGATGCTGG AAAATAACGT TTCCCCCAAC CTATTCATGG CGCATGGGGG GACCTCCTTC GGCTTCATGA ACGGGGCGAA CTGGGAAGGC GCCTACACGC CGGATGTAAC CAATTACGAC TACGGCGCCC CCATTTCCGA AAACGGAACC CTGACGGACC GCTACCGCAC CTTCCGCCAG ACTATTCAGG ATTATTACGG TGATACGTAC AAGCTTCCCG AACCTCCCGC CCAGCCGGAA ATGATGGAGC TGCCTCCCAT CACGTTTACG GAAACAGCCG GCATGTTCTC CCGCCTGCCG CAGCCCGTCA TCCGCAAGGA GCCCGTCCAC ATGGAAGCCT TGGGGCAAAG CCTGGGCTTC ATCCTGTACC GGACAAAGGT GAACGGCCCG GTGAAAGGAG AGCTGAAGAT GAACAACATG CAGGACCGCG CCATCGTTTA CGTGGACGGC AAGAGGCAGG GGGCGGCGGA CCGCCGTTAC AAGCAGGATT CCTGTGACAT TGTCATTCCC TCCGGACTTC ACACGGTGGA CATTTTTGTG GAAAACATGG GCCGCATCAA CTTCGGCGGC CAGATACAGG GCGAGCGCAA GGGCATCCGG GGCCCCATTA CGCTGGACGG CAAAAAGCTG GAAAACTTCC TTATCTACAA CTTCCCGTGC AAGGGGGTGG AGCTTATTCC CTTCTCCGGC AAGAAGCCGG CGGGCGACCA GCCCGTGTTC CACCGCGGGT ATTTCAACGT TTCCAATCCC AAGGATACCT ACTTGGATAT GCGGGACGGC TGGAAGAAAG GCGTCGTGTG GGTGAATGGA CGCAATCTGG GCCGCTTCTG GTTTATCGGC TCCCAGCAGG CTCTTTATTG CCCCGGAGAA TACCTGAAGC CCGGGAAAAA TGAAATCGTG GTGCTGGACG TGGACGGAGG TTCCGGCACG GTGAAGGGTG TGAAGGAAGC CATTTATGAA GTCAACAGGG ACCCCGCCAT GGCGGATGTC TTCCGCGTGG GCAAACCTGT GGCCCCCGCT GCCGGCCAGC TGGTGCACAA GGGTTCCTTC GCCAAGGGGG CGGACCAGCA GGAAATCAAA TTCCGCGCTC CTGTCCAGGC CCGTTACATA GCTATTGTCA GTAAAAACGC TCATGACAAC GGCCCCCATG CCGCCATTGC GGAGCTGAAC TTCCTGGATG CCTCCGGCAA TCTGCTCCCC CGCGAACAGT GGTCCGTGGT TTATGCGGAT TCCCATGAAA CGACGGGAGA AGCCGCCCAG GCGGGACTGG TGATGGACAA CCAGCCCACC ACCTACTGGC ATACCAAGTG GCAGGGGGAC AACCCCAGGC ATCCGCACAT GATCGTGCTG GATCTGGGCA AGGTGCAGAA ACTTTCAGGA TTCCGCTACC TGCCGCGCCA GGACCGGGAA AACGGCCGCA TCAAGGACTA TGAAGTCTAT GCGTCTCCCA AGCCGTTCAA GCCTGCCAAG TAA
|
Protein sequence | MKLSFFSVLL LAGHLCAAAP MPLPESNDGA RHVFSTNQEN FLMDGKPVKI ISGEMHYPRV PRQHWKDRFQ RIKAMGMNTV CTYLFWNVHE PEPGKWDFSG NLDFVEFIKE AQKAGLWVIV RPGPYVCAEW EFGGFPGWLL KDEDLKVRSQ DPRFLEPAMA YLKKVCSMLE PLQITKGGPI IMAQVENEYG SYGSDKDYVK KHLDVIRKEL PGVVPFTSDG PNDWMIKNGT LPGVVPAMNF GGGAKGAFAN LEKHKGKTPR INGEFWVGWF DHWGKPKNGG STEGFNRDLK WMLENNVSPN LFMAHGGTSF GFMNGANWEG AYTPDVTNYD YGAPISENGT LTDRYRTFRQ TIQDYYGDTY KLPEPPAQPE MMELPPITFT ETAGMFSRLP QPVIRKEPVH MEALGQSLGF ILYRTKVNGP VKGELKMNNM QDRAIVYVDG KRQGAADRRY KQDSCDIVIP SGLHTVDIFV ENMGRINFGG QIQGERKGIR GPITLDGKKL ENFLIYNFPC KGVELIPFSG KKPAGDQPVF HRGYFNVSNP KDTYLDMRDG WKKGVVWVNG RNLGRFWFIG SQQALYCPGE YLKPGKNEIV VLDVDGGSGT VKGVKEAIYE VNRDPAMADV FRVGKPVAPA AGQLVHKGSF AKGADQQEIK FRAPVQARYI AIVSKNAHDN GPHAAIAELN FLDASGNLLP REQWSVVYAD SHETTGEAAQ AGLVMDNQPT TYWHTKWQGD NPRHPHMIVL DLGKVQKLSG FRYLPRQDRE NGRIKDYEVY ASPKPFKPAK
|
| |