Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1074 |
Symbol | |
ID | 6274034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1282440 |
End bp | 1284098 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613125 |
Product | sulfatase |
Protein accession | YP_001877681 |
Protein GI | 187735569 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0111181 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACCTT TGAAAACCAT CATCGCCGGA ACTCTGGCCC TGCTGGCGGC AGCTCCCCTC TCAGCTCAAA CCAAGGCTGA GGAAAATAAA AAACCGAATA TCCTCTTTAT CATTACGGAC GACCACGCCT ACCAGACGCT GGGCACCGGC AATAATGATT CCCCCGTGGC CCTGCCCAAT TTCAACAAAC TGGGACGCCA AGGCATGGTT TTTGACCGCA GCTACTGCGC CAACTCCCTG TGCGGCCCCT CCCGCGCCTG CATCCTGACC GGCAGGCATT CCCACATGAA CGGTTTTGTC TTCAACGGAC AAAGACCGCT GGACGGCTCC CAGCCCACTT ACCCGAAAAT GCTGCAGAAG GCCGGATACC AGACGGGCCT TTTCGGCAAA TGGCATCTGG AATCGGACCC CACCGGGTTC GACACGTGGG AAATCTTCCC CGGCCAGGGC AGCTACTACA ATCCGGACTT TATCAGCCTC AAGCCGGACG GCAAACGCCA GACAAAGCGT TTTCCCGGAT ATGCCACGGA CGTGGTCACG GACAAATCCA TCCAGTGGCT GGGAAACCGG GACAAGAACA AACCTTTCCT GCTCGTTGTG GGCCACAAGG CTCCCCACCG CGCCTGGTGC CCTGCTCTGC GCCACCTGGG CAAGGTGGAC ACTTCCAGCA TGACGCCGCC CGCCAACTTC CATGACGACT ATGCCAACCG TCCGGAATTC CTGAAGAAAA ACCAGCAGAC AGTCGCCAAT CACATGGCGA TTTATTCCGA CCTCAAAGTG CTTAAGGACC AGGTTCCGGA AGAAATGCGC AAAAGCATCG TTTCCCCCGG TTACGGCTGG GACCTGGGCG AGTTGAACCG CATGACTCCG GAAGAAAAGA AAACCTGGAC GGACTATTAC GCCAAGCGCA CCAAATCCCT GGTGGACGGC ATGAAATCCG GAAAACTGAA GGACCCGAAA GCGTTTGCGG AATGGAAGTG GCATGCCTAC ATGGAGGATT ATCTGGGATG CCTTCTGTCC GTGGACGACA GCATCGGCCG CCTTATGGAA TATCTGGACA AAGAGGGGAT TGCGAAAGAC ACGCTGGTCA TCTACTGCGG AGACCAGGGG TTCTACATGG GAGAACACGG CATGTACGAC AAGCGCTGGA TTTTTGAAGA ATCCCTCCGC ATGCCCCTCA TCATGAGATG GCCCGGCAAA ATTCCCGCGG GCATCCGCAA CAACACCATG GTGCAGAATA TCGACTACGC TCCCACCATC GTTTCCGCGG CAGGGGCGGA CACCCCGGAA AACATGAATA CCTTCCAGGG CGTATCCCTG CTTCCCACCG CTTTCACGGG CAAAACTCCC GACAACTGGA GGGATGCCAT TTACTACTGT TTTTACGAAA ATCCCGGCGA ACACAACGCC CCGCGCCACG ACGGCATCCG GACGGACCGC TACACGCTTT CCTACATCTG GACCAGCGAC GAATGGATGC TCTTTGACAT GAAAAAGGAT CCCATGCAAA TGAAAAACGT CATTGACGAT CCTGCCTACA AGACTACGGT GGAACAGCTC AAGAAGCGTT ACCACGAACT GCGCAAAACC TATAAAGTTC CGGAAAACAG CCCCGGAGGC AAAGGAACGC CTATCCCCAA ATTCGACGCT TCCTGGTAA
|
Protein sequence | MRPLKTIIAG TLALLAAAPL SAQTKAEENK KPNILFIITD DHAYQTLGTG NNDSPVALPN FNKLGRQGMV FDRSYCANSL CGPSRACILT GRHSHMNGFV FNGQRPLDGS QPTYPKMLQK AGYQTGLFGK WHLESDPTGF DTWEIFPGQG SYYNPDFISL KPDGKRQTKR FPGYATDVVT DKSIQWLGNR DKNKPFLLVV GHKAPHRAWC PALRHLGKVD TSSMTPPANF HDDYANRPEF LKKNQQTVAN HMAIYSDLKV LKDQVPEEMR KSIVSPGYGW DLGELNRMTP EEKKTWTDYY AKRTKSLVDG MKSGKLKDPK AFAEWKWHAY MEDYLGCLLS VDDSIGRLME YLDKEGIAKD TLVIYCGDQG FYMGEHGMYD KRWIFEESLR MPLIMRWPGK IPAGIRNNTM VQNIDYAPTI VSAAGADTPE NMNTFQGVSL LPTAFTGKTP DNWRDAIYYC FYENPGEHNA PRHDGIRTDR YTLSYIWTSD EWMLFDMKKD PMQMKNVIDD PAYKTTVEQL KKRYHELRKT YKVPENSPGG KGTPIPKFDA SW
|
| |