Gene Amuc_1187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1187 
Symbol 
ID6273828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1423063 
End bp1424664 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content56% 
IMG OID642613238 
ProductAlpha-galactosidase 
Protein accessionYP_001877793 
Protein GI187735681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000788687 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTGT ACCATTTTTT ACTCCCTGCC GTTGTCAGCG CTGCCGTATC TGCGTCATTT 
GGGGCAGAGT TCCCTAATCC CTATCCTGCG CCCGCTCCCG GTGTCCGCCT GACTCCAGAG
ATTCCGCTTT CACCCTCCAT TAATGGCGCC CGTATCGTCG GGGCTACCCC CGGTTCCCGC
ATGCTGTTCC AGGTTCCCGT CTCCGGGGAG CGGCCCATGA AAATTCAGGC AACAGGGCTG
CCCCCAGGCC TGAAGATGGA TTCGCGCGGA TTGATTTCGG GTACCGCTCC GTCCGGGAAG
AGGGAATACA AGGTAAATAT CCAGGCTTCC AACAGGCATG GAAAGGACAT GAAGGAGCTG
ATTCTGAAGG TGGGGGACGA ATTGTGCCTG ACTCCGCCCA TGGGCTGGAG CAGCTGGTAT
TCCTACAGTG AGGCCGTAGG GGAGGATAAT GTGCTGAAGA CGGCACGGCT TTTTGTGGAA
CGGGGTCTGG TCAATCATGG CTGGGCCTAT ATCAACATTG ACGACTGCTG GCAGGGCAGG
CGCGGAGGGA AGTATGGCGC CATTCAACCC AATAAGCGTT TTCCTGACAT GAAGGCCATG
TGCGACGCTA TTCACGCCAT GGGCATGAAA GCGGGCATTT ATTCCACGCC TTGGATGGGA
ACGTATGCCG GTTTTATCGG AGGGAGCGCG CCCAACGCTA AGCCGGACTA CGGGGAAATG
GCCATTCCGG AAAAGGAGCG CAAGCAGGAG GATCAAATCT TTGGAAGTTA TCCGGGAGTT
CATCGCAGAA AGGCGGATCA TGTGGGAGCC GTCTGGCTGT TTGACCGTGA CGCTAAACAA
TGGGCGGATT GGGGGTTCGA TTATGTGAAA GTGGATTGGA ATCCCAACGA TGTGTCTACG
ACAAAGCGCA TCCGCAAGGC GCTGGACGAG TCCGGGAGGG ATATCGTGCT CAGCCTGTCC
AATGCCGCCC CGTACGAACA TGTGGAAGAG CTGGGCAAGC TGGCGAATTT ATGGCGGACG
ACGGGGGATA TCCAGGATCA CTGGGGCAGC GTCAGCGGCA TCGGTTTTTC CCAGGAACGC
TGGCAGAAGC ATATGCGCCC GGGACATTGG AATGATCCGG ACATCCTCCA GATCGGGAAG
CTGGGCAAAC CCAACCAGCC CAACACCACG TTTGTCCAGA CGCGGCTGAC TCCGGATGAA
CAGTACACCC ATGTGACCCT GTGGTGCCTG CTGTCCGCTC CGCTCATCGT CTCCTGTGAT
TTGGAGCATA TTGATTCGTT TACGATGGGA CTGCTTACCA ATGATGAGGT GATAGCGGTG
GATCAGGATC CGGCTGCCCG TCCCGCCCGC AAAGCGTGGC ACCAGGGGAA TTTCCAGGTG
TGGATGAAGG AGTTGTCCGA CGGTTCCGTG GCGGCTGGCT TTTTCAATAC CGGGAAGGAG
AAAGGAATTT TGAAGGTGAA TCTGAAGGAG CTGGGGCTTT CCGGAGCGTA TGAGGCAAGG
GACCTCTGGA AACGCGCTGA CCAGGGGACC GTACAGGGAG ATATGGCGGT AGAATTGAAC
GGGCATGGAG CATCCATGTT CCGGTTCAGC AAAAAGAAGT AA
 
Protein sequence
MRLYHFLLPA VVSAAVSASF GAEFPNPYPA PAPGVRLTPE IPLSPSINGA RIVGATPGSR 
MLFQVPVSGE RPMKIQATGL PPGLKMDSRG LISGTAPSGK REYKVNIQAS NRHGKDMKEL
ILKVGDELCL TPPMGWSSWY SYSEAVGEDN VLKTARLFVE RGLVNHGWAY INIDDCWQGR
RGGKYGAIQP NKRFPDMKAM CDAIHAMGMK AGIYSTPWMG TYAGFIGGSA PNAKPDYGEM
AIPEKERKQE DQIFGSYPGV HRRKADHVGA VWLFDRDAKQ WADWGFDYVK VDWNPNDVST
TKRIRKALDE SGRDIVLSLS NAAPYEHVEE LGKLANLWRT TGDIQDHWGS VSGIGFSQER
WQKHMRPGHW NDPDILQIGK LGKPNQPNTT FVQTRLTPDE QYTHVTLWCL LSAPLIVSCD
LEHIDSFTMG LLTNDEVIAV DQDPAARPAR KAWHQGNFQV WMKELSDGSV AAGFFNTGKE
KGILKVNLKE LGLSGAYEAR DLWKRADQGT VQGDMAVELN GHGASMFRFS KKK