Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0565 |
Symbol | |
ID | 6275592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 664189 |
End bp | 665817 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642612614 |
Product | sulfatase |
Protein accession | YP_001877183 |
Protein GI | 187735071 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.776698 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATTCC ATATTCCTTT CATGTGCTCT ACCCTGCGTT CCTCCATCAT GTCCAGCATA GCCTTCTGGT TGCTTACCCT TGTCCCTTCA CTGGCGGACA GGCCTAATAT CGTCCTTATC CTGGCTGATG ACATGGGATG GTCCGACCCG GGCTGCTACG GTTCGGAAAT ACCCACTCCA GCCCTGGATA CCCTGGCCAG ACAGGGGATG CTGGCAACCC GGCTCTACAC CGCTTCCCGC TGTTCACCCT CCCGGGCCTC CATCATGACC GGCTGCGAAC CTCACAAGGT GGACGTAGGG CTCCTGGATG ACGACAGCGG GCGCCCCGGC TACCGCGGAC GCCTGAACCC GGGTATCCCC ACCCTGCCGG AACTTCTGAA AAAAGCGGGA TACCGTACCT ATCTTTCCGG GAAATGGCAT CTTGGAAAAG TTCGGGGATC CTACCCATGG GACCGCGGCT TCGACCGTTC CCGCGGTTTG CTGGGTGGAG CGGCAGATTA CTACAGGCCC ATGCCGGACA GCCCCTTCGG TGAAAACGGG AAACTGCTCC GTCCGGAGGA TCTGCCGGAG GATTTCTACA TGACGGACGA TATCACCAAA ACGGCTCTGG CCTATATTGG CGATGCCGCC AAAAGCAGGC AGCCCTTCTT CCTTTACGTG GCTTATACGG CTCCGCATAC ACCCCTCCAG GCCCCCCGGA GGGAAATAGA AAAAATGCTC CCGTTCTACA ACGGCAAATC CCCCCATGCC ATTGCTTCCA AAAGACTGGA AAAACAAAAG CTGCTGGGAA TCGTCCCTCC TGCCGCCAAA CTGGGCATGG CCGGCAAATT CAATCCAGAA GGCTATGAAA AAACTTCCGC AAAGCGGAAG GATTATATTG CCGAATGCAT GGCCACCTAC GCCGCCCAAA TCGTTATCAT GGACCGCGGC ATAGGCCGCA TTCTCGCGTC CCTAGAACGT CACCGCCTCA GTGACAATAC CATCGTCATG TTTTTATCAG ACAATGGCGC AACAGCGGAA ATGCCCCAGA ACAATAAAAA CAAGAAGACT ACCCTCCCCA CAGGCCCGCT GGGAGAAGTC GGATGCAGGG ACGGATACGG CCCCATGTGG GCGGCTGTGT CCAATACCCC TTACCGCCAG TATAAAATAG AAACCTTTGA CGGAGGGCTG TCCGCCCCCT TCATCATTCG CTACCCTTCC AAAATACGTC CGGGATCGCG CTACCACTCG CCTTTCCTGC TTCAGGACAT CGCCCCAACC TGCCTTGCGT GGGCCGCTCT CCCAATTCCG GCCCATATGG ACGGCAAGCC GCTCAACACC TACTGGAATA ATCCTCCGGA ACTTCCTCCG TCCAAGGTGT GGGACTTCAT TCCCAATACC TGCCCTCCCC GCACTATCTT CTGGGAACAT CAGAGGAACC GGGCGGCCCT GACAAGTCAA TTCAAGCTGG TGGCCCCCAA CCGCGGCCCC TGGCAGGTGT ACGACATCAG GGACAGGACG GAACAGAACA ATCTGGCATC CAGGCACCAG ACGCTTGTAG AACAATTGTC CGCACAGTAC AGGAAATGGG CGGCGGAAAA CCATGCCGAA CGACACAGCC CGGCAGAAAA ACGGGCATAC GCGCCCTAA
|
Protein sequence | MIFHIPFMCS TLRSSIMSSI AFWLLTLVPS LADRPNIVLI LADDMGWSDP GCYGSEIPTP ALDTLARQGM LATRLYTASR CSPSRASIMT GCEPHKVDVG LLDDDSGRPG YRGRLNPGIP TLPELLKKAG YRTYLSGKWH LGKVRGSYPW DRGFDRSRGL LGGAADYYRP MPDSPFGENG KLLRPEDLPE DFYMTDDITK TALAYIGDAA KSRQPFFLYV AYTAPHTPLQ APRREIEKML PFYNGKSPHA IASKRLEKQK LLGIVPPAAK LGMAGKFNPE GYEKTSAKRK DYIAECMATY AAQIVIMDRG IGRILASLER HRLSDNTIVM FLSDNGATAE MPQNNKNKKT TLPTGPLGEV GCRDGYGPMW AAVSNTPYRQ YKIETFDGGL SAPFIIRYPS KIRPGSRYHS PFLLQDIAPT CLAWAALPIP AHMDGKPLNT YWNNPPELPP SKVWDFIPNT CPPRTIFWEH QRNRAALTSQ FKLVAPNRGP WQVYDIRDRT EQNNLASRHQ TLVEQLSAQY RKWAAENHAE RHSPAEKRAY AP
|
| |