Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1972 |
Symbol | |
ID | 6274897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2394351 |
End bp | 2395376 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614034 |
Product | HhH-GPD family protein |
Protein accession | YP_001878566 |
Protein GI | 187736454 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.224545 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.000205487 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGACG GCACCCCCCA TTTTAACATC CATGCTTTCC GGAACGCGCT GGTAGAATGG TTCAGGCGCG AAGGGAGGGA TTACCCGTGG CGGCGGACAA CGGATCCGTG GCACATCCTT GTTTCCGAGC TGATGCTGCA GCAGACTACC ATTCCCACCG TTTTGGGAAG ATATGACAGA TGGATGCGCC AGTTCCCCAC TCCGGCGCAT CTGGCTGCCG TGGACGAACA GACGGCCCTG CGCTCCTGGG AAGGGTTGGG CTATTACCGC CGCGTGCGTT CCCTCCAGGC CATCGCCAGG GAAATCGTCA ACGAATTCGG AGGGCGGTTC CCGGACAATG CGGAAGGGCT GAAACGCCTG CCGGGCATCG GCCCCTACAC GTCGGGAGCG CTCCTCTCCT TCGCCTTCAA CAAGGCGGCT CCCATTGTAG ACGCCAATGT CGCGCGCGTC CTGGCCCGCA TTGACAATTA TTCCGTTCCC GTGGATTCCA CGGAAGGCCA GAAATACCTG TGGAGCCGCG CGGAAAGCCT GGTGGATCCG GAACATGCCC GTGAATTCAA TTCAGCCATC ATGGAACTTG GGCAAACCTG CTGCAGCATC AGTTCTCCAG ACTGCCTGCT GTGCCCCGTG CGCCCCTTCT GCTCTGCGGA ACGGCCGGAA ACGCTTCCCG TCAAAAATCC CAAGCCTCAA GTCACCCGGG TGGAACACCA CGATATTCTT TACATCCGCG GCAAATCCGT CCTGCTGGCC AAATGCCCGG AAGGCAAGCG CCATGCCGGC ATGTACCGCT TCCCCCAGAG GGAGGACGAG CACACCCTTT CCCTGCCCCA TGTCCTGAAA CAAACCTACA GCATTACCCG CTACAGGGTG ACCCGCTACA TCCACCATGT GACGGATACG CCTCTCCTCA GGGAAGGAGA GGAATTCGTG CCGTTGGACA AAATCCACGG GCTGCCCATG GCCTCACCGG ACCGCAAGGC ACTGAACTCC CCCACCCTCG GCAAACTGCT GGACCATATC AGATGA
|
Protein sequence | MTDGTPHFNI HAFRNALVEW FRREGRDYPW RRTTDPWHIL VSELMLQQTT IPTVLGRYDR WMRQFPTPAH LAAVDEQTAL RSWEGLGYYR RVRSLQAIAR EIVNEFGGRF PDNAEGLKRL PGIGPYTSGA LLSFAFNKAA PIVDANVARV LARIDNYSVP VDSTEGQKYL WSRAESLVDP EHAREFNSAI MELGQTCCSI SSPDCLLCPV RPFCSAERPE TLPVKNPKPQ VTRVEHHDIL YIRGKSVLLA KCPEGKRHAG MYRFPQREDE HTLSLPHVLK QTYSITRYRV TRYIHHVTDT PLLREGEEFV PLDKIHGLPM ASPDRKALNS PTLGKLLDHI R
|
| |