Gene Amuc_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2090 
Symbol 
ID6275638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2541168 
End bp2542343 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content60% 
IMG OID642614152 
Productglycosyl transferase group 1 
Protein accessionYP_001878680 
Protein GI187736568 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCC TCTTTTTGCT CGGAAAATTC CCTTCCATAG GCGGCGTGGA AACCGTCACC 
GCCATTTTGG CGAATGAATT CTCCGCCCGG GGCCATGCGG TTCATGTGGT TTCCTTTGAA
CAGGTAACGG AAAAGCCCTC CCCAGCGCTG GACGAACGGG TCACGCTGCA CCGGCTGAGC
TACCCCGTTT CCAGCCGGTC CAACCGGAAT GCCCTGCGGG ACATCCTGGC CACCTGCCGC
ATTGACGTCA TCATCAACCA GTGGTGCCTG CCCTTCCACG TCACCAGGCT GTGCCGGAAA
GCCATGAGGG GCCTGCCCTG CCGCCTGCTG GCCGTCCACC ATAACGCCCC GGACTGCAAC
GCACGGCTGG AAGGCCTCAG GATGCGCATG GCCCGGACGG GAAACCCGGT GAACAGGGCG
TCCCTGCGCC TTCTGCTGAA AGGCTGCGCC ATGGCTACCG GGGCCAGCCT GCGTTACGTT
TACGCCCACA GCGACCGTTA CATCCTCCTT TCAGACAGTT TCCACCAGGC CTTCCGGAAC
ATCACCGGAC TGAAAGACAC CGGAAAACTT CTGACGATTC CCAACCCCAT TACCGTGGAA
AACCCGGAAT TCCGCTATGA ACCGGGCCTC AAGAAAAAGG AGGTTCTTTT TGTCGGACGG
CTGGAACCCA ACCAGAAACG GGTCTCCCGC GTGCTGGAAA CGTGGGCGCT GCTGGAACCC
TGCTTCCCGG ACTGGACTCT CCGCCTGGTG GGTGACGGGC CGGAAAAACG CTCCCTTCAG
GAATTCTGCG AGGAACACCG CCTGAAGCAC GTCTCCTTTG AAGGCTTCCA AAATCCTGCC
CCGTATTACG AACAGGCCTC CCTGCTCTTT TTAACCTCGG AATATGAAGG ACTTCCTCTC
GCTATGGTGG AAGCTATGTC CTTCGGCGTC TCCCCCATTG TTTACGGAAG CTTCTCCGCC
GCCTATGACC TGGTGGACCA CGGAAAAGAC GGCTGCATCC TGCCCGCGGC CGGCGGTTTC
CAGGCGCATC GGATGGCGGA AATGGCCGCA GGGCTGATGC GGGAACCGGC CGCCCTGCGC
GCCATGGCGA GGAACGCCAT AGCCAAAAGC CGGAAATTCA CGCGGGAACA TATCATTCCC
CAGTGGGAAA AAGCTTTCCT CCCAGACGCC TCCTGA
 
Protein sequence
MNILFLLGKF PSIGGVETVT AILANEFSAR GHAVHVVSFE QVTEKPSPAL DERVTLHRLS 
YPVSSRSNRN ALRDILATCR IDVIINQWCL PFHVTRLCRK AMRGLPCRLL AVHHNAPDCN
ARLEGLRMRM ARTGNPVNRA SLRLLLKGCA MATGASLRYV YAHSDRYILL SDSFHQAFRN
ITGLKDTGKL LTIPNPITVE NPEFRYEPGL KKKEVLFVGR LEPNQKRVSR VLETWALLEP
CFPDWTLRLV GDGPEKRSLQ EFCEEHRLKH VSFEGFQNPA PYYEQASLLF LTSEYEGLPL
AMVEAMSFGV SPIVYGSFSA AYDLVDHGKD GCILPAAGGF QAHRMAEMAA GLMREPAALR
AMARNAIAKS RKFTREHIIP QWEKAFLPDA S