Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1655 |
Symbol | |
ID | 6275705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2000025 |
End bp | 2001689 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613714 |
Product | sulfatase |
Protein accession | YP_001878255 |
Protein GI | 187736143 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.789399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.0842498 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG TTTGCGGAGT ATTGTTCCTG TTTGCCGGGG CATCCGCTCT GGGCAATGGT TCCAACACTG TTTCCGGAAA GAAGCCCAAT ATCATCGTTT TCCTGGTAGA TGACATGGGA TGGCAGGATA CTTCCCACCC GTTCTGGTCC GACAGCCAGG GCAACCCGAA GAAAACCTTT TTGAACAGGC GCTACCGGAC GCCGAATATG GAGAAGCTGG CTTCCCAGGG CATGACGTTT ACGGATGCAT ATGCCCATCC CCTTTGCACT CCTTCCCGGG TGAGCCTGAT GTCCGGCATG AATCCGGCGC GGCACCGCGT GACCTGCTGG GTACGGGAAC AGAACGGAAC GACGGACGCC AACAGCAGGA GCCTCCTGCC TCCGGACTGG GCGTTGAACG GCCTTCAGCC TATAGGCACT CCCGCCAGGG GAACGACAAA ACGCCCCATT TCCGGGGAGG ATATGCGCTA TCACATGACG CGTCCTTTTG CCACGGCGGC TACACTGCCG GAGATGCTGA AGAAGTGCGG TTATGTTACC GTCCATTGCG GGAAGGCCCA TTTCGGCACG CAGGGAACTC CGGGTTCCAA TCCGTTGAAT ATGGGATTTG ATTATAATAT CGCAGGTACG GAGATCGGCC ATCCGGCGGA TTACCGCGGT TCCAGGCATT ACGGAAAGGG GTTTAACCAT GTGCGCGGAC TGGATGAGAA TAATTATTAC CAGGACGATG TATTTTTGAC GGAGGCCCTG ACGCGGGAGG CCATTAAACG CCTGGAAGCC ATCAGGACCA ATCCCAGGGA GGCTGGCAAG CCCTTTTATC TGTACATGGC CCATTACGCT TTGCATTCTC CGCTGGATGA GCGCGCTTAC GACAAGAGGT TTGCGGATGC CTACAAAAAC CCGGAGGACG GCCACAAGTG GTCCCGGACG GAGAAACATT ATTCCGGGCT GATCGAGGGG ATGGACAAAA GCCTGGGGGA TATCATGAAG TATCTCAGGG AACATCATCT GGAAAAAAAT ACCGTACTGG TGTTTATGTC GGATAACGGA GGCCTGGCCA TCTCCGGCAG ACTGGGCAAT GAAGAGGCCA ATTACCCTCT TTCCTTCGGC AAGGGGTCAT GCATGGAAGG CGGTATCCGG GAGCCTATGA TTGTTTCCTG GCCGGGCGTG ACGAAGGGCG GTTCAAGGTG TGCCGTTCCG GTGGTTATTG ACGATTTTTT CCCAACTCTT CTGGATATCG CCGGATGCCG GAACGTAGAA GTTCCGCAGA AGCTTGACGG CTTAAGCCTG GTTCCTTTGC TCAAAGGCGG CCGGTTTCCT GAAGACCGCC CCCTTTTGTT CCACCAGCCG AATAATTGGG GGGAAGGCAG CCGGCAGGCG CCTCAGTATA CTTCTTCCAC CGCATTGCGC CAGGGGGATT GGAAATTGAT TTACCGCCAC CTGACCCAGA GCTTTGAGCT GTACCATTTA AGGAAAGATA TCGGCGAGAA GGAAAACCTG GCTTCCAGGG AGCCGCGGAA AACCAGGGAA ATGGCTGTTG TCATGGGCAG GTTGCTCCGG GAGAGGAAAG CGCAGATGCC AACCTATAAG AAGGACAATG AGCTGGGCGC TCCTGCCGGG AGTTCCGTTC CGTGGCCCGA CCAGGTGAAG GGGAATGGTT TTTGA
|
Protein sequence | MSAVCGVLFL FAGASALGNG SNTVSGKKPN IIVFLVDDMG WQDTSHPFWS DSQGNPKKTF LNRRYRTPNM EKLASQGMTF TDAYAHPLCT PSRVSLMSGM NPARHRVTCW VREQNGTTDA NSRSLLPPDW ALNGLQPIGT PARGTTKRPI SGEDMRYHMT RPFATAATLP EMLKKCGYVT VHCGKAHFGT QGTPGSNPLN MGFDYNIAGT EIGHPADYRG SRHYGKGFNH VRGLDENNYY QDDVFLTEAL TREAIKRLEA IRTNPREAGK PFYLYMAHYA LHSPLDERAY DKRFADAYKN PEDGHKWSRT EKHYSGLIEG MDKSLGDIMK YLREHHLEKN TVLVFMSDNG GLAISGRLGN EEANYPLSFG KGSCMEGGIR EPMIVSWPGV TKGGSRCAVP VVIDDFFPTL LDIAGCRNVE VPQKLDGLSL VPLLKGGRFP EDRPLLFHQP NNWGEGSRQA PQYTSSTALR QGDWKLIYRH LTQSFELYHL RKDIGEKENL ASREPRKTRE MAVVMGRLLR ERKAQMPTYK KDNELGAPAG SSVPWPDQVK GNGF
|
| |