Gene Amuc_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0943 
Symbol 
ID6274222 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1123289 
End bp1124344 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content56% 
IMG OID642612997 
Productglycosyl transferase family 2 
Protein accessionYP_001877556 
Protein GI187735444 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.799144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAA CCCGCCCGCC ACAGCCCCAG GTTTCCGTCA TCGTTCCCGT TTACAATCAG 
GAACAATATC TTGAAGAGTG CATCAGCAGC ATCCGCCGGC AAACCCTTGC GGAGTGGGAA
TGCATCATCG TCAACGACGG GTCCTCCGAC TCGTCAGGGG AAATCGCCCG GCGCTTTTCA
GAGGAGGACT CAAGAATCCT CTGCCTCGAA CAGGAAAACA GGGGCGTTTC CTCCGCCCGC
AATCTGGGCA TGCGGCACGC TTCCGGGCGC TATTTGTGTT TTGTTGACGG TGACGATTTC
ATCGACGCGG CCTTTCTCAA ACATCTTCTG GACGCCTCGG ACCGCGGAGC AAGCGATTTG
ACCGTAGCGG GAAAGCTGTT CTGCGACAGG TTTCCGCTGG ACAAAATCCC CGCCCTCCCC
ACCTGCGGCA TATTTCTGCG CCGGGAGTTC CCCTTGAAAA ACAATCTGGA ATTCCCGGAA
GGCATTCACC CCTGTGAGGA CGGCCTCTTC TCGCATTTCG TGCTCGCGCT GACAGAAAAA
ATTTCCTTCT GTCCGGAGGC CGTTTACCAT TACCGCCAGC ATGAACAGGG CAACCACCAC
CAGATACGGA AAAGAACCGC CGACATCCTG CCCATGATCC CCCGGTGGCT TTCCCTGATT
GAAGAGTTTT ATGAACAACG CCATCTCTGG AAAAGAAAAG CCGGCCATCT CGTCCGCTTT
ATTGAGCATG AACCATTTGA ACTGAGGCTG CTTGGCATGC CTTTCTCCCC GCCGGAGCAG
GAAATACTTT ACAGCATCAT CCGGGACTTC CTGAACGCCC ATTGCACGGC CGCCGAGTGC
CGGAGGGCCT CCCTGCATCT TCCTTTCCGC CTGTTGTTGC AATCCTCCGG CTTTTCAGAC
TTCGGAAGAA GGCTCCGGAG GGCCGGCAAA AACACCGGAA TCCGCCGGAA GCTCCTCCAT
TTCTGCCCTG TCCCCTCATG GAGGAGGAAT GGCAGGGCAC AGCTGCGGCA AGTACGCGAA
CAGCTGGAGG AAATACGCAG GAATATCACG TTTTAA
 
Protein sequence
MTRTRPPQPQ VSVIVPVYNQ EQYLEECISS IRRQTLAEWE CIIVNDGSSD SSGEIARRFS 
EEDSRILCLE QENRGVSSAR NLGMRHASGR YLCFVDGDDF IDAAFLKHLL DASDRGASDL
TVAGKLFCDR FPLDKIPALP TCGIFLRREF PLKNNLEFPE GIHPCEDGLF SHFVLALTEK
ISFCPEAVYH YRQHEQGNHH QIRKRTADIL PMIPRWLSLI EEFYEQRHLW KRKAGHLVRF
IEHEPFELRL LGMPFSPPEQ EILYSIIRDF LNAHCTAAEC RRASLHLPFR LLLQSSGFSD
FGRRLRRAGK NTGIRRKLLH FCPVPSWRRN GRAQLRQVRE QLEEIRRNIT F