Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0939 |
Symbol | |
ID | 6274226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1118530 |
End bp | 1119645 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612993 |
Product | glycosyl transferase family 8 |
Protein accession | YP_001877552 |
Protein GI | 187735440 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCC CCGATGCCAT GACAACCCCG CCCGTTCCCG CATCCCCGGA GAAATCACGC ATCCCCGTCA TGTTTTCCGC CACCGGCGGC TGGGGCCTGC CCCTGGGCGT AGCCATTCAC ACCCTTTGCC TTCACGCCAG TTCCGGACGC TTTTATGACA TTCACATCGT TCACGACGGA ATGGACGCGC GAATAATACA GGAGCTGAAC CAGGTTGCCG CCCCCTTTCC GCAGGTTTCC CTTTCTTTCC TGCAACTTCC GGAAGAATTC CGCCATCTCT TTCAAAACGG CAACAAGGAC CGCTACTCCC CCCTTGCGTA TGCCCGCCTG ATGGCCGGCA GCCTGTTCCC GCAGTACGGC AGGATCGTTT ATCTGGACGC AGATGTCCTG CTGGCCGGAG ACGTAGCCGA ACTGTATTTT TCCGATTTGC GGGGAGCTTC CGTAGCGGCG GCCGGAGACG GCCTGGCCCT CTGGAGCATT GAAAAAGGAA CGATGCACCC CCATCTGGAA TATATGGGCA ACTACCTTTC CTTCCCCCTT TCCTACTGCA ATTCCGGCGT CCTGGTGCTG GATCTGGACC AGATGCGCCG CCGCAACCTG GAACACCGGC TGCTCCAACA GCTCCGGAGC CGCCCGGACC CCTTCCCCTA TCCGGACCAG GACATCTTAA ATATCGCCCT GCACGGAGAC ATGACGACGC TGCCTCCGGA ATGGAACTTC CAATTCCTGT CCTGGACCTG GGATGAAGAA AAAACACGGC TCCTGCGCGG AACCGAATTT GAAAACGTTC CGACCATATC CTGCGGGCGT TCATGGAAAC TGCTGCACAT GGTAGGCCCG GAAAAACCAT GGCGGCTCCC TGACACCCCC GGAACCATGG GGCAGTTCCA CTGGATCCTG TACTCCTTTT TCTGGTGGCC GGAAGCAAAG AGGCTTCCCG TGTTCCGGGA GGAACTGGAT GCGATTTCCC AGGGGCTGGC CCCGCTCCTC CAGCGCCATA TCCGCGGCCA GCAATGGAAA CTGTTCTTCT CCCGGGGCCA TATTTTCCGG AAACGCCGGG ACAAGATCAG GTGGCTGAAA AAATTGCTGT CCATTCTTGA CGGCAGAAAA CCGTAA
|
Protein sequence | MKFPDAMTTP PVPASPEKSR IPVMFSATGG WGLPLGVAIH TLCLHASSGR FYDIHIVHDG MDARIIQELN QVAAPFPQVS LSFLQLPEEF RHLFQNGNKD RYSPLAYARL MAGSLFPQYG RIVYLDADVL LAGDVAELYF SDLRGASVAA AGDGLALWSI EKGTMHPHLE YMGNYLSFPL SYCNSGVLVL DLDQMRRRNL EHRLLQQLRS RPDPFPYPDQ DILNIALHGD MTTLPPEWNF QFLSWTWDEE KTRLLRGTEF ENVPTISCGR SWKLLHMVGP EKPWRLPDTP GTMGQFHWIL YSFFWWPEAK RLPVFREELD AISQGLAPLL QRHIRGQQWK LFFSRGHIFR KRRDKIRWLK KLLSILDGRK P
|
| |