Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0943 |
Symbol | |
ID | 6274222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1123289 |
End bp | 1124344 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612997 |
Product | glycosyl transferase family 2 |
Protein accession | YP_001877556 |
Protein GI | 187735444 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.799144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGAA CCCGCCCGCC ACAGCCCCAG GTTTCCGTCA TCGTTCCCGT TTACAATCAG GAACAATATC TTGAAGAGTG CATCAGCAGC ATCCGCCGGC AAACCCTTGC GGAGTGGGAA TGCATCATCG TCAACGACGG GTCCTCCGAC TCGTCAGGGG AAATCGCCCG GCGCTTTTCA GAGGAGGACT CAAGAATCCT CTGCCTCGAA CAGGAAAACA GGGGCGTTTC CTCCGCCCGC AATCTGGGCA TGCGGCACGC TTCCGGGCGC TATTTGTGTT TTGTTGACGG TGACGATTTC ATCGACGCGG CCTTTCTCAA ACATCTTCTG GACGCCTCGG ACCGCGGAGC AAGCGATTTG ACCGTAGCGG GAAAGCTGTT CTGCGACAGG TTTCCGCTGG ACAAAATCCC CGCCCTCCCC ACCTGCGGCA TATTTCTGCG CCGGGAGTTC CCCTTGAAAA ACAATCTGGA ATTCCCGGAA GGCATTCACC CCTGTGAGGA CGGCCTCTTC TCGCATTTCG TGCTCGCGCT GACAGAAAAA ATTTCCTTCT GTCCGGAGGC CGTTTACCAT TACCGCCAGC ATGAACAGGG CAACCACCAC CAGATACGGA AAAGAACCGC CGACATCCTG CCCATGATCC CCCGGTGGCT TTCCCTGATT GAAGAGTTTT ATGAACAACG CCATCTCTGG AAAAGAAAAG CCGGCCATCT CGTCCGCTTT ATTGAGCATG AACCATTTGA ACTGAGGCTG CTTGGCATGC CTTTCTCCCC GCCGGAGCAG GAAATACTTT ACAGCATCAT CCGGGACTTC CTGAACGCCC ATTGCACGGC CGCCGAGTGC CGGAGGGCCT CCCTGCATCT TCCTTTCCGC CTGTTGTTGC AATCCTCCGG CTTTTCAGAC TTCGGAAGAA GGCTCCGGAG GGCCGGCAAA AACACCGGAA TCCGCCGGAA GCTCCTCCAT TTCTGCCCTG TCCCCTCATG GAGGAGGAAT GGCAGGGCAC AGCTGCGGCA AGTACGCGAA CAGCTGGAGG AAATACGCAG GAATATCACG TTTTAA
|
Protein sequence | MTRTRPPQPQ VSVIVPVYNQ EQYLEECISS IRRQTLAEWE CIIVNDGSSD SSGEIARRFS EEDSRILCLE QENRGVSSAR NLGMRHASGR YLCFVDGDDF IDAAFLKHLL DASDRGASDL TVAGKLFCDR FPLDKIPALP TCGIFLRREF PLKNNLEFPE GIHPCEDGLF SHFVLALTEK ISFCPEAVYH YRQHEQGNHH QIRKRTADIL PMIPRWLSLI EEFYEQRHLW KRKAGHLVRF IEHEPFELRL LGMPFSPPEQ EILYSIIRDF LNAHCTAAEC RRASLHLPFR LLLQSSGFSD FGRRLRRAGK NTGIRRKLLH FCPVPSWRRN GRAQLRQVRE QLEEIRRNIT F
|
| |