Gene Amuc_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1686 
Symbol 
ID6274435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2048214 
End bp2050556 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content57% 
IMG OID642613745 
ProductBeta-galactosidase 
Protein accessionYP_001878285 
Protein GI187736173 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00233252 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.321684 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAT CCTTTTTCTC CGTTCTGCTT CTGGCAGGGC ATCTTTGTGC GGCTGCTCCC 
ATGCCTTTGC CGGAATCCAA TGACGGAGCC AGGCATGTCT TCTCTACTAA TCAGGAAAAT
TTTTTGATGG ACGGAAAGCC CGTCAAAATC ATTTCCGGGG AAATGCATTA TCCTCGCGTG
CCGCGCCAGC ACTGGAAGGA CAGGTTCCAG CGCATAAAGG CCATGGGCAT GAATACCGTC
TGCACTTATC TGTTCTGGAA CGTGCATGAA CCGGAACCCG GCAAATGGGA CTTTTCCGGC
AATCTGGATT TTGTGGAATT CATCAAGGAG GCGCAGAAGG CCGGCCTGTG GGTCATTGTG
CGTCCCGGGC CCTATGTGTG CGCGGAATGG GAATTCGGCG GATTTCCCGG CTGGCTGCTG
AAGGATGAAG ATTTGAAAGT CCGTTCCCAG GATCCCCGCT TCCTGGAACC GGCCATGGCT
TATCTTAAAA AAGTCTGTTC CATGCTGGAA CCTCTGCAGA TTACCAAGGG AGGCCCCATC
ATCATGGCCC AGGTGGAAAA TGAATACGGC TCCTATGGTT CTGACAAGGA TTACGTGAAA
AAGCATCTGG ACGTTATCCG GAAAGAACTT CCGGGAGTTG TTCCCTTCAC GTCGGACGGC
CCGAACGACT GGATGATCAA GAACGGCACG CTTCCGGGCG TTGTTCCCGC CATGAATTTC
GGCGGCGGAG CCAAGGGCGC TTTTGCGAAT CTGGAGAAGC ACAAGGGCAA AACGCCCCGC
ATCAACGGCG AATTCTGGGT GGGCTGGTTT GACCACTGGG GCAAGCCCAA GAATGGCGGC
AGTACGGAAG GTTTCAACCG AGACCTGAAG TGGATGCTGG AAAATAACGT TTCCCCCAAC
CTATTCATGG CGCATGGGGG GACCTCCTTC GGCTTCATGA ACGGGGCGAA CTGGGAAGGC
GCCTACACGC CGGATGTAAC CAATTACGAC TACGGCGCCC CCATTTCCGA AAACGGAACC
CTGACGGACC GCTACCGCAC CTTCCGCCAG ACTATTCAGG ATTATTACGG TGATACGTAC
AAGCTTCCCG AACCTCCCGC CCAGCCGGAA ATGATGGAGC TGCCTCCCAT CACGTTTACG
GAAACAGCCG GCATGTTCTC CCGCCTGCCG CAGCCCGTCA TCCGCAAGGA GCCCGTCCAC
ATGGAAGCCT TGGGGCAAAG CCTGGGCTTC ATCCTGTACC GGACAAAGGT GAACGGCCCG
GTGAAAGGAG AGCTGAAGAT GAACAACATG CAGGACCGCG CCATCGTTTA CGTGGACGGC
AAGAGGCAGG GGGCGGCGGA CCGCCGTTAC AAGCAGGATT CCTGTGACAT TGTCATTCCC
TCCGGACTTC ACACGGTGGA CATTTTTGTG GAAAACATGG GCCGCATCAA CTTCGGCGGC
CAGATACAGG GCGAGCGCAA GGGCATCCGG GGCCCCATTA CGCTGGACGG CAAAAAGCTG
GAAAACTTCC TTATCTACAA CTTCCCGTGC AAGGGGGTGG AGCTTATTCC CTTCTCCGGC
AAGAAGCCGG CGGGCGACCA GCCCGTGTTC CACCGCGGGT ATTTCAACGT TTCCAATCCC
AAGGATACCT ACTTGGATAT GCGGGACGGC TGGAAGAAAG GCGTCGTGTG GGTGAATGGA
CGCAATCTGG GCCGCTTCTG GTTTATCGGC TCCCAGCAGG CTCTTTATTG CCCCGGAGAA
TACCTGAAGC CCGGGAAAAA TGAAATCGTG GTGCTGGACG TGGACGGAGG TTCCGGCACG
GTGAAGGGTG TGAAGGAAGC CATTTATGAA GTCAACAGGG ACCCCGCCAT GGCGGATGTC
TTCCGCGTGG GCAAACCTGT GGCCCCCGCT GCCGGCCAGC TGGTGCACAA GGGTTCCTTC
GCCAAGGGGG CGGACCAGCA GGAAATCAAA TTCCGCGCTC CTGTCCAGGC CCGTTACATA
GCTATTGTCA GTAAAAACGC TCATGACAAC GGCCCCCATG CCGCCATTGC GGAGCTGAAC
TTCCTGGATG CCTCCGGCAA TCTGCTCCCC CGCGAACAGT GGTCCGTGGT TTATGCGGAT
TCCCATGAAA CGACGGGAGA AGCCGCCCAG GCGGGACTGG TGATGGACAA CCAGCCCACC
ACCTACTGGC ATACCAAGTG GCAGGGGGAC AACCCCAGGC ATCCGCACAT GATCGTGCTG
GATCTGGGCA AGGTGCAGAA ACTTTCAGGA TTCCGCTACC TGCCGCGCCA GGACCGGGAA
AACGGCCGCA TCAAGGACTA TGAAGTCTAT GCGTCTCCCA AGCCGTTCAA GCCTGCCAAG
TAA
 
Protein sequence
MKLSFFSVLL LAGHLCAAAP MPLPESNDGA RHVFSTNQEN FLMDGKPVKI ISGEMHYPRV 
PRQHWKDRFQ RIKAMGMNTV CTYLFWNVHE PEPGKWDFSG NLDFVEFIKE AQKAGLWVIV
RPGPYVCAEW EFGGFPGWLL KDEDLKVRSQ DPRFLEPAMA YLKKVCSMLE PLQITKGGPI
IMAQVENEYG SYGSDKDYVK KHLDVIRKEL PGVVPFTSDG PNDWMIKNGT LPGVVPAMNF
GGGAKGAFAN LEKHKGKTPR INGEFWVGWF DHWGKPKNGG STEGFNRDLK WMLENNVSPN
LFMAHGGTSF GFMNGANWEG AYTPDVTNYD YGAPISENGT LTDRYRTFRQ TIQDYYGDTY
KLPEPPAQPE MMELPPITFT ETAGMFSRLP QPVIRKEPVH MEALGQSLGF ILYRTKVNGP
VKGELKMNNM QDRAIVYVDG KRQGAADRRY KQDSCDIVIP SGLHTVDIFV ENMGRINFGG
QIQGERKGIR GPITLDGKKL ENFLIYNFPC KGVELIPFSG KKPAGDQPVF HRGYFNVSNP
KDTYLDMRDG WKKGVVWVNG RNLGRFWFIG SQQALYCPGE YLKPGKNEIV VLDVDGGSGT
VKGVKEAIYE VNRDPAMADV FRVGKPVAPA AGQLVHKGSF AKGADQQEIK FRAPVQARYI
AIVSKNAHDN GPHAAIAELN FLDASGNLLP REQWSVVYAD SHETTGEAAQ AGLVMDNQPT
TYWHTKWQGD NPRHPHMIVL DLGKVQKLSG FRYLPRQDRE NGRIKDYEVY ASPKPFKPAK