Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0637 |
Symbol | |
ID | 6274169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 750079 |
End bp | 751239 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642612689 |
Product | glycosyl transferase group 1 |
Protein accession | YP_001877255 |
Protein GI | 187735143 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.948946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCT ATGAAATGGC CTGCAGCCTG AGCTCCCTGG ACTGGGATAT CCACGTCCTC ACCCGCCCGC TGGATTCACC GGATGAGTCC TTCCGCTCCC ACCAATACAG CCCTCTCACA ACCCTGCCGG ACACCCTGAC GAAGGACAGT GCCCGTTTAA GGGAATACCT GGACGGGTTT CTGAAAGATT TCCGGCCGGA TGTTATTATC TACCACTCCT GGGCGGACTG GTGCCGGGAA GAATTGCTGG ACGCGGCACG GGATTCAGGC ATTCCGTTTT TCCTCCGCTC CCATGGTGCA GCTACCAATT TCCGCTCTTT TTTCCGCTTC AATTACCCTC CTTTCTTCGG CCTGAAAAAA TGGCTCTGCT CCTTTTTTCA AGTACGCAGG GATATCCTCA ACGTATGCCG GAAATCACCG TTAAACCGTC TCGTTTTTCT CGATCCTTAC GGGACACTGT TCAAGAGCTT TGATTATTAC TGCGCATCCA GAAGCAAACT TGCCCATTAC TCCTGCATTC CCAACACATT CCCGGCTCTG AAAAGAACCG CTCCTTTTTT CCGGGAAAAA TACGGACTTT CCTCCGCCCC CGTTTTTACC TGCCCTGCCG GCGCCAGCAT GAGGAAACGG CAGCTTCTCT TCATCCGCCA TGTGAAACGC TCCCGTCTGC GGCATATCAT TTTTCTTTTT CTGATTCCCC AGCACAATGC CTACGCGGAA CAAATGGAAC AAGCCATCGG GGATGACCCC AGATTCAGGA TTCTCTACAG GCTCCCCCGT TTGGAAGTAG AAGCCGCCAT TATGGAAAGC GATGCCGTTT TCCTTTACTC TTATCAGGAA CAGCAGCCCC TCTCCATTCT GGAGGCGATG TCATGCGGCG TTCCCTGGTT CGCTCCGGAC GCAGGAGCTC TTTCCACCCT GGAGGGGGGA ATCGTCCTGA AAAACACTTC CCCCTCCGTG CTGGAAAAAG CCGTGGAATC GTTGACGGAC GAAAAAACAC GCAAACTACT GGGGAGTAAA GGCCGCCGGC AATGGGAAGC CTGTTTCGCC CCCGACGCAG TAAACCAGGA ATGGGAGCAA CTGCTTTTTT CCTCCATCCG CCCGGAAGGA AAGCCTCCGT TCGCGTCTTC CATCGTCCGG GAGCATTTAC CCACCTGCTA A
|
Protein sequence | MAAYEMACSL SSLDWDIHVL TRPLDSPDES FRSHQYSPLT TLPDTLTKDS ARLREYLDGF LKDFRPDVII YHSWADWCRE ELLDAARDSG IPFFLRSHGA ATNFRSFFRF NYPPFFGLKK WLCSFFQVRR DILNVCRKSP LNRLVFLDPY GTLFKSFDYY CASRSKLAHY SCIPNTFPAL KRTAPFFREK YGLSSAPVFT CPAGASMRKR QLLFIRHVKR SRLRHIIFLF LIPQHNAYAE QMEQAIGDDP RFRILYRLPR LEVEAAIMES DAVFLYSYQE QQPLSILEAM SCGVPWFAPD AGALSTLEGG IVLKNTSPSV LEKAVESLTD EKTRKLLGSK GRRQWEACFA PDAVNQEWEQ LLFSSIRPEG KPPFASSIVR EHLPTC
|
| |