Gene Amuc_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0633 
Symbol 
ID6274179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp744587 
End bp745624 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content53% 
IMG OID642612685 
Productglycosyl transferase family 2 
Protein accessionYP_001877251 
Protein GI187735139 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.810954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.460894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAC CGTTCTTCAG CATTATTATC CCGGCCTATA ATCTGGAGAA TTATATTGCT 
GCCACCCTTC AGTCAGTACT GGTTCAAACA TTTCAGGATT TTGAGATCAT CATCGTGGAT
GACGGTTCTT CCGATGAGAC TGTTTCCATC ATCCAATCTT TTCATGACCC CAGAATTCGC
CTGGTTTCCC AAGTTAACGG CGGCGTATCG CGAGCGCGAA ACGCAGGGAT GAAGAAGGCC
GTGGGGGCTT ACATCGCTTT CCTGGACGGA GACGATTACT GGTATCCCGA GCATCTGGAG
CTGGCAGCCG ATTTTTTCAA CCGTCATCCG GAGATATTGG CCTATGCCAA CCGCTACATG
AGGGATGAAC TGGAGGCCAT CCCGCCGCGC CCTCCATCTT ATCCCGAATC TATCCGGAGA
TTGGGGATAC GGGGAGTGCT TTTCATGAAT TCCAGCAGCG TAATCCTGAA TTCGTCTCTT
GCGTCCCGGC TTCCCCCCTG GGAAGAAGCG ATGCCCTATG GGGAAGACGG CCTGTACTGG
ACACGGTGCA TGCGGGGGAC AGGCCTGATC GGGCTGGGAG GCTCCGTCAC CTCCATCTAC
AGGCAGAGAG CTTCTTCCGC CATGCATGAC GAGCATTACC AGCATGTCTC CCTCCACTCG
CTCATTGCGC CTCTGCTGAA TGAGCTTGAA GCCATGAAAA ATCCCAAATG GCAATTTGCC
GTCCATTATC TGGTCATCAG GGAATTGCAT CCCAAAAGAC TGTTATCGCT CAACGCAGAG
GAGCGGATTT CCCTGACGGG CAGGATCAGG AAAATCATGC ACCCATGCCT GAACCGGCCG
TTTTTGGACT CCTATATGAA AGCGTGTTCC GCAAGGGCAG GCATGGAACA GTCATTTTCC
GCGCTCATGG ACAGAACCAT GTTCTCCTGC AAATGGCTGG ACCGCCTGGA AAGGATGGGC
CGCTCCCTGT TTTTCCGGCT GCAAACCAAT AACGGAATGG GGGGCAAACA CCAAGATCCA
GTCCGTTCAC GCTCATGA
 
Protein sequence
MQKPFFSIII PAYNLENYIA ATLQSVLVQT FQDFEIIIVD DGSSDETVSI IQSFHDPRIR 
LVSQVNGGVS RARNAGMKKA VGAYIAFLDG DDYWYPEHLE LAADFFNRHP EILAYANRYM
RDELEAIPPR PPSYPESIRR LGIRGVLFMN SSSVILNSSL ASRLPPWEEA MPYGEDGLYW
TRCMRGTGLI GLGGSVTSIY RQRASSAMHD EHYQHVSLHS LIAPLLNELE AMKNPKWQFA
VHYLVIRELH PKRLLSLNAE ERISLTGRIR KIMHPCLNRP FLDSYMKACS ARAGMEQSFS
ALMDRTMFSC KWLDRLERMG RSLFFRLQTN NGMGGKHQDP VRSRS