Gene Amuc_0771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0771 
Symbol 
ID6274413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp906706 
End bp908637 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content56% 
IMG OID642612822 
ProductBeta-galactosidase 
Protein accessionYP_001877387 
Protein GI187735275 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGC AATATGGTTT CAGTTGGTCT GCCGCACTGA TGGCCGTCGG AGTGGGAGCA 
TTGGCATGGG CTGGCCCTGA GGCTGTCCAG AATGTACAGA AACCCGCTCT TTCCGGAGGA
CCCCCCGTTG TGTTCGGGTT CGGCGGGGAA GGGAACCAGG AGTTCATGCT GAACGGGAAA
CCCTTCCAGA TCCGCGGCGC GGAGATGCAT CCCCAGCGCA TTCCCCGGGA ATACTGGAGG
CACCGCATCA GGACCGCCAA GGCTATGGGG CTGAATACTA TTGCATTTTA TGTGTTCTGG
AATGACCATG AGCAGCCGGA CGGCAGCTTT GACTTTAAGA CGGGCAATCG GGATCTGGAA
GGGTTCCTCA AGTTATGCCA GGAGGAAGGA ATGTGGGTTT TATTCCGCCC CGGCCCCTAT
GCATGCGGGG AATGGGATCT GGGTGGGCTG CCTCATTATT TGCTGAAGGA TCCCAAGGCC
AAGTTGAGAA CTACGGAAGA CGCCAAATTC ATGAAGGCGC AGACGCGTTA TCTGGAGGCC
GTGGCTCGTG TGGCGGAGCC TTTTTTAGCC AAAAACGGGG GCCCCATTCT GATGACCCAG
CTTGAAAACG AATACGGGAG CTACCAGCGT AAAGACCGCA AGTATATGGA ATGGCTGAAG
GCGTTCTGGA GCAGGAAGGG TTTTGGCCCC TTTTACACCT CCGACGGCGC GGGAGAACAT
TTTCTGAAAG GCGTGGTGCT TCCAGGCGTG GCCGTAGGGC TGGATCCGGG GCTGAACGAC
GGCCATTGGG CAGTGGCTAA TAAATGCAAT CCGGGAGTTC CCGTTTTTTC CTCGGAAACA
TATCCGGGTT GGCTGCGGCA CTGGGGGGAG GGGAATTGGG CTCCCACCCC TGGAGTGGTC
AACCACGTCC GCTGGTTTAT GGACAAGGGG CGTTCCTTCA GCTTGTTCGT TTTCCACGGA
GGCACCAATT TCGGATTCTC GGCCGGAGCC AACAACGGAG GGCCGGGAAA ATACCAGCCG
GACCTGACGA GTTATGATTA CGGTTCTCCC GTGGATGAGC AGGGGCGGAT GAATGAATAT
TATGCCCAGA TGAGGGAAAT CATTTTGAAA AAGTTGCCTC CCGAAGCCGC TGTGCCGGAA
CCTCCCGCAG ACATTCCGGC CATGGAAATT CCGGAGTTCA CGCCCGCAGT GCATGCCGGC
CTTTGGGAGA ACCTGCCCAA GCCTTTCCGG TCCAAGTTCC CGCAGCCTCC CTATTTTGAA
CAATGGAACC AGAACCAGGG TATTGCCGTT TACAGAACGG CCGTTCCGTC AGGACCGCCT
GAAACGCTGG AATTTACCAA TGTCAATGAC TATGCCCAGG TGTATCTGGA TGGAGAGCTG
GTCGGCACGC TGGATCGGCG GCTGGGGCAG AAGAGCGTGA AACTGCCGGA GCGCAGGAAG
CCGGGGACGC TGGAAGTTCT GGTAGAGGCC ATGGGACATA TTAATTTTCA TATCAGCATG
GAGAGTGACC GCAAGGGGAT TTACGGTCCT GTGAAGCTGG GAACGCGGGA GTTAAAGAAC
TGGACGGTGA GGGCGCTTCC CCTGAAAGCT GATTCCATTG TGCGGGCTCC CAAAGGAAAG
GGGCCTTCCC AGAAACGGGA AGGGGCGCAT TTCCGGGCCG TTGTAAATAT TGAAGAGCCT
CAGGACACGT TTCTGGATAT GTCCCGCTAT GTCAAGGGGT ATGTATGGGT GAACGGAATC
AACGTGGGGC GCTATTGGAA TGTGGGACCT CAGTTAAGGC TGTATGTCCC GGCCCCATTC
CTGAAAAAAG GGGAGAATGT GATTGATATT CTGGACCTGC ACGAAAAGGA GCCCAAGCCT
GTCCGCGGCA TGAAGGAACG CAACAAGGAA CCCGGAAAGA TAAATACCAA AAACCTGGAC
AACCAGTGGT AA
 
Protein sequence
MMKQYGFSWS AALMAVGVGA LAWAGPEAVQ NVQKPALSGG PPVVFGFGGE GNQEFMLNGK 
PFQIRGAEMH PQRIPREYWR HRIRTAKAMG LNTIAFYVFW NDHEQPDGSF DFKTGNRDLE
GFLKLCQEEG MWVLFRPGPY ACGEWDLGGL PHYLLKDPKA KLRTTEDAKF MKAQTRYLEA
VARVAEPFLA KNGGPILMTQ LENEYGSYQR KDRKYMEWLK AFWSRKGFGP FYTSDGAGEH
FLKGVVLPGV AVGLDPGLND GHWAVANKCN PGVPVFSSET YPGWLRHWGE GNWAPTPGVV
NHVRWFMDKG RSFSLFVFHG GTNFGFSAGA NNGGPGKYQP DLTSYDYGSP VDEQGRMNEY
YAQMREIILK KLPPEAAVPE PPADIPAMEI PEFTPAVHAG LWENLPKPFR SKFPQPPYFE
QWNQNQGIAV YRTAVPSGPP ETLEFTNVND YAQVYLDGEL VGTLDRRLGQ KSVKLPERRK
PGTLEVLVEA MGHINFHISM ESDRKGIYGP VKLGTRELKN WTVRALPLKA DSIVRAPKGK
GPSQKREGAH FRAVVNIEEP QDTFLDMSRY VKGYVWVNGI NVGRYWNVGP QLRLYVPAPF
LKKGENVIDI LDLHEKEPKP VRGMKERNKE PGKINTKNLD NQW