Gene Amuc_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1077 
Symbol 
ID6274031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1287274 
End bp1288482 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content60% 
IMG OID642613128 
Productglycosyl transferase group 1 
Protein accessionYP_001877684 
Protein GI187735572 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0000145174 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACGCC CACGCATTCT ACATATTTTC AGCCGTTACG GCGAGGTTGG GGGGGAAGAA 
ATCTGCTTCC ATGCCATTAC GGAGGCTTTG GGCGCCATAG CGGATGTCAC ACCCTTTGTC
TATTCAACGG AGGAGCTGTT CCATAGCCCC CACGGCGCCC TGACGAAAAT GGGTTATTTG
CTCCACAACA GGGACGTGGA GCAAAAACTG CGCGAATGCC TGCGTGAAAA CCGCTATGAC
GCATGGATCA TCCACAATAC GTTCCCGGCC ATGTCCCCCT GCGTCTATGA ACTGGCCCTG
CATCAACCCG CTCCCGTCAT CCACTACATG CACAATTACC GCTCCGGCTG CCTCAACGGA
GTATTTTACC GGGACGGAGC GCCCTGCTTT TCCTGCCAAG GCGGCAACTA TTTTCCCGGC
ATCATGCACG CCTGTTGGAG GAAAAACGCC GCGTACTCAT CCCTGGCCGC CGCCGTCCTG
TATAAAACGC GCCGCATGGG AGCCTGGAGC CGCTTTTCCT CCTACATTGC CATCAGCCGG
CGCCAGCGGG AACTTCTCAT CCAAACCGGA ATACCGGAGG ATAAAATCAG GGTTATTCCA
CATTTCATCC GGCAAAACCC CGCCCCTTCC GCCGGCCCGC CCCGCCGGGA CGTCCTTTAC
GCCGGACGCC TGACGCAGGA AAAAGGAGTC CTGCAACTGG TTCAGGCGTG GGAACTCCTA
TCCCCCCCCG GCCGCATTCT CTACCTGATG GGAGACGGCC CCCTGCGCGG AGAACTGGAG
CGTTATATCT CTTCCCGCCA TCTTGAATCC ATCCGCCTGA CCGGGTTCAT TCCCCATGAG
GAACAAGGAG CCGTCCGCGC CGCCTGCGGC CTCTCCGTAG CGCCCTCCCT CTGGGAGGAA
ACCTTCGGTA TGGTTGTCCT GGAATCATGG CTCCACGGCA CGCCCGTCAT CGTTACCCCG
AACGGCGGCC TGCCGGAGCT CATCACCCAC GGCAGGAATG GCTGGATTGC ACAGGAACCT
TCCGTGGAAT CCCTGGCGGA GACGCTGCAC ACCGCCCTGA AGCAAGAAGA ACGCTGGCCG
GCCATGGGCG CGCACGGGCA ACAACTTTTG TCCTCCACAT ACTCCCCCGC CGCATGGCTC
CGGTCCATGG AAGCCCTTCT TGGCGAGCTC CGCGTTTTCC ATTCATCTTC CACCCCACCA
ACATCATGA
 
Protein sequence
MQRPRILHIF SRYGEVGGEE ICFHAITEAL GAIADVTPFV YSTEELFHSP HGALTKMGYL 
LHNRDVEQKL RECLRENRYD AWIIHNTFPA MSPCVYELAL HQPAPVIHYM HNYRSGCLNG
VFYRDGAPCF SCQGGNYFPG IMHACWRKNA AYSSLAAAVL YKTRRMGAWS RFSSYIAISR
RQRELLIQTG IPEDKIRVIP HFIRQNPAPS AGPPRRDVLY AGRLTQEKGV LQLVQAWELL
SPPGRILYLM GDGPLRGELE RYISSRHLES IRLTGFIPHE EQGAVRAACG LSVAPSLWEE
TFGMVVLESW LHGTPVIVTP NGGLPELITH GRNGWIAQEP SVESLAETLH TALKQEERWP
AMGAHGQQLL SSTYSPAAWL RSMEALLGEL RVFHSSSTPP TS