Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1077 |
Symbol | |
ID | 6274031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1287274 |
End bp | 1288482 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613128 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001877684 |
Protein GI | 187735572 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0000145174 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAACGCC CACGCATTCT ACATATTTTC AGCCGTTACG GCGAGGTTGG GGGGGAAGAA ATCTGCTTCC ATGCCATTAC GGAGGCTTTG GGCGCCATAG CGGATGTCAC ACCCTTTGTC TATTCAACGG AGGAGCTGTT CCATAGCCCC CACGGCGCCC TGACGAAAAT GGGTTATTTG CTCCACAACA GGGACGTGGA GCAAAAACTG CGCGAATGCC TGCGTGAAAA CCGCTATGAC GCATGGATCA TCCACAATAC GTTCCCGGCC ATGTCCCCCT GCGTCTATGA ACTGGCCCTG CATCAACCCG CTCCCGTCAT CCACTACATG CACAATTACC GCTCCGGCTG CCTCAACGGA GTATTTTACC GGGACGGAGC GCCCTGCTTT TCCTGCCAAG GCGGCAACTA TTTTCCCGGC ATCATGCACG CCTGTTGGAG GAAAAACGCC GCGTACTCAT CCCTGGCCGC CGCCGTCCTG TATAAAACGC GCCGCATGGG AGCCTGGAGC CGCTTTTCCT CCTACATTGC CATCAGCCGG CGCCAGCGGG AACTTCTCAT CCAAACCGGA ATACCGGAGG ATAAAATCAG GGTTATTCCA CATTTCATCC GGCAAAACCC CGCCCCTTCC GCCGGCCCGC CCCGCCGGGA CGTCCTTTAC GCCGGACGCC TGACGCAGGA AAAAGGAGTC CTGCAACTGG TTCAGGCGTG GGAACTCCTA TCCCCCCCCG GCCGCATTCT CTACCTGATG GGAGACGGCC CCCTGCGCGG AGAACTGGAG CGTTATATCT CTTCCCGCCA TCTTGAATCC ATCCGCCTGA CCGGGTTCAT TCCCCATGAG GAACAAGGAG CCGTCCGCGC CGCCTGCGGC CTCTCCGTAG CGCCCTCCCT CTGGGAGGAA ACCTTCGGTA TGGTTGTCCT GGAATCATGG CTCCACGGCA CGCCCGTCAT CGTTACCCCG AACGGCGGCC TGCCGGAGCT CATCACCCAC GGCAGGAATG GCTGGATTGC ACAGGAACCT TCCGTGGAAT CCCTGGCGGA GACGCTGCAC ACCGCCCTGA AGCAAGAAGA ACGCTGGCCG GCCATGGGCG CGCACGGGCA ACAACTTTTG TCCTCCACAT ACTCCCCCGC CGCATGGCTC CGGTCCATGG AAGCCCTTCT TGGCGAGCTC CGCGTTTTCC ATTCATCTTC CACCCCACCA ACATCATGA
|
Protein sequence | MQRPRILHIF SRYGEVGGEE ICFHAITEAL GAIADVTPFV YSTEELFHSP HGALTKMGYL LHNRDVEQKL RECLRENRYD AWIIHNTFPA MSPCVYELAL HQPAPVIHYM HNYRSGCLNG VFYRDGAPCF SCQGGNYFPG IMHACWRKNA AYSSLAAAVL YKTRRMGAWS RFSSYIAISR RQRELLIQTG IPEDKIRVIP HFIRQNPAPS AGPPRRDVLY AGRLTQEKGV LQLVQAWELL SPPGRILYLM GDGPLRGELE RYISSRHLES IRLTGFIPHE EQGAVRAACG LSVAPSLWEE TFGMVVLESW LHGTPVIVTP NGGLPELITH GRNGWIAQEP SVESLAETLH TALKQEERWP AMGAHGQQLL SSTYSPAAWL RSMEALLGEL RVFHSSSTPP TS
|
| |