Gene Amuc_0643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0643 
Symbol 
ID6274155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp755860 
End bp757449 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content49% 
IMG OID642612695 
Producthypothetical protein 
Protein accessionYP_001877261 
Protein GI187735149 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGTGGT CTGATGATTT CAGTCTGCCG GAGAACGGAG GCCACCTGGA AAAAAGCTAC 
GGATTCCAAA TCCGTACCCA TATAAAAATG TTTCGTTGTA ATAGGGTAAA TGGATGTTGT
ATTGCATGGT GGTCTATTTT CTCATTTCTG ATTTGTTATT ATTCCAGTTA TGTGTCTCCC
TATTATGAAA TTCCCGTAAG TTGTGATCCC ATTATTTATC ATCTCATGGG ACGCGGAATG
ATGGAAGGGT TGATGCCTTA TCGGGATTTG TTTGATCAAA AGGGGCCCTT CATTTTTTTG
ATATATGGCT TTTCCTACGC CTTGTGTGGT TCTTACTGGC TTGTATTCGT TCTTGAGGTA
TTCGCAGTGA CGGCCTCCAT GATCTTTTGT TACAAGGCGG CACTTCTCTT CGTATCTGGA
AAAAAAGCGT TTATCGTTTC GCTGCTCATC CTTTATTCCA TGTGCAGTTT CCACTATTAT
GCAGGAGGAG GTCATCCCTC CGAATTTATT CTGCCTTTCC AGTTTGCCGC GATTTATGGA
GTGTGCCGCC TGTATCGGGA TCCCGATCGC TTCGTAAGAA CGGGCGTGGT GTTTGGCGCG
GGCATCGGGA TGGCGCTTCT GTTGAAATTC AACTTGGCCG CTTTTTGGTT CGTTCCCTTC
CTGTATGTCC TGTATCGTGC GCGGATGACC GGGAAGGCAT GGATTTTTAC AGGAAGCGTG
GGAGGAACGC TCCTGTTGAT GCTGACGCCC TGCCTCTGGT ATTTTTACAG CCGCGGCGCG
TTGCTGGATT TATATGAAGG ATACATTGTT TTTAATGCCG GTTACGGCGC CGATGCGGTA
TCGTTCAAGG AGATGTGCCG GAATTATTCC CAATGGGTGA AAAGGGATAT CATGTATCCT
AGCATGTTGA TGTACATCGT GGGAGGAGCG GGGGTGATTT TAAGCCGGAT GCCCGCCAGG
GAAAAGATTT ATTATACAGC CGCTTTTCTG ATTACCTGCA TGGGCGTCCA GGGAAATGGA
AAGACGCATT TCAACCATTA CGCCCAGACA CTTATTCCCT TCGCTGCTGT GGGATACCTG
GTGGTGGCCA GGATGATCCA TGTGGAAGAG GTTGTTTCCC GAAAGTGGCG CCGATGGCTT
TTTCCTCTGG TTGTTGCGGG AGTGATGGCT TGCACCTTCC TGAATTGCGT GGATTTTTCA
TGCAGAAGCC GCGCCCGCGA ATGGATGTCC GCTTTTGCTC TTCCAGATAC CAGTGATGAA
CCGGAGGGGG AGAATGCTTG TGCAGTCCTT GGGCGGCATG CCGTATGGTT TTACTGTGTC
AATGGTTTGG TGCCGCCCAT ACGCACGTTT ACGATTTGCG CTTCCGACAA ACAGGGTGCA
GAACGGGAAA GGATGCGTCA GTATCAGGAA ATCCGCCGCG GAAAGATAGA CTATGTGGTG
ATAGCGGGAG AATACTTCAA GCCGGACAAC ACGGAATCTC CCTCCCTGAG GGCTATGCGG
GATTTGCTAA GTACGGAATA TTGCCGGGAA CGATCCTTTT TCAGAGAAGT GACGAACTTG
GATATTTACC GCCGTGTGAA GAGGGATTAA
 
Protein sequence
MRWSDDFSLP ENGGHLEKSY GFQIRTHIKM FRCNRVNGCC IAWWSIFSFL ICYYSSYVSP 
YYEIPVSCDP IIYHLMGRGM MEGLMPYRDL FDQKGPFIFL IYGFSYALCG SYWLVFVLEV
FAVTASMIFC YKAALLFVSG KKAFIVSLLI LYSMCSFHYY AGGGHPSEFI LPFQFAAIYG
VCRLYRDPDR FVRTGVVFGA GIGMALLLKF NLAAFWFVPF LYVLYRARMT GKAWIFTGSV
GGTLLLMLTP CLWYFYSRGA LLDLYEGYIV FNAGYGADAV SFKEMCRNYS QWVKRDIMYP
SMLMYIVGGA GVILSRMPAR EKIYYTAAFL ITCMGVQGNG KTHFNHYAQT LIPFAAVGYL
VVARMIHVEE VVSRKWRRWL FPLVVAGVMA CTFLNCVDFS CRSRAREWMS AFALPDTSDE
PEGENACAVL GRHAVWFYCV NGLVPPIRTF TICASDKQGA ERERMRQYQE IRRGKIDYVV
IAGEYFKPDN TESPSLRAMR DLLSTEYCRE RSFFREVTNL DIYRRVKRD