Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0551 |
Symbol | |
ID | 6275295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 651511 |
End bp | 652485 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612601 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_001877170 |
Protein GI | 187735058 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.0957804 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCCG GCCTTCCCCA ACACCTCACG CTGGATTATC TGCGCGCCCT GCTATCCCGG GGCGTGGAGA AAACCACCGT CACGGAGGAA GCCAGAATGG TCCTGAGAAA ATGGGTCATG GACGCGCGCC GGATATCCGG AAGCCCCCTC CCCGCTTCCG TTTACGCCAA GCCTGCCGCA AATGAAACAC GGCCGAAACA GCCCACCCCG GAGCAGACGC CTCCAGATTC CGTGGATGAA GCATCCTTCG GCAATGAACT CCGGGATATC CTCAACGGAG TACAGCCGCA CAGAACGGAG GAGGATGCTC CGGTACCCCG CCATATTTCC TTTGATCTGG AAGGTGAAAC GGAGGAAGAA AAACTGTCCT CCCTGCGGGA GCTGGTCGTT AACTGGCCGC CGCTCCGGAA CATGGATTCC TTGAGGGAAA CGCCCGTGTT TTCCTCCGGC AATCCCAGGG CGGACATCAT GATGGTAACG GACGCCCCCG GCCTGTATGA AGAAAAACAG GGGGTTCCCC TGGCCGGGCC TTCCGGACAA AAGCTGGACG CCATGCTGAA AGCCATGGGG CTTTCCCGTT CCGATATTTA TCTGACCCAT CTGGTCAAAT ACCGTCCGGC CCTCCCCCGG CAGCTTACCA ATAACCGCCC GCCTACAGAC CGGGAGATAG AAATTTCCCT GCCCATTCTC CGGGAGGAAA TTATGCTGGT GCGCCCGAAA GTAGTGGTGG CCCTGGGAGC AATCTCCGCC CGCGGCATCC TCCAGTCAGG AGAGACGCCT CTTTCCGCCC TGAGAGGCAC CTTCCACACA GCTTTCAACA CGCCCGTGCG CGTTACTTAC AATCCCAGTT ATCTTCTCCG CACGGAAGAT ATTTCAGAAA AGCGAAAGGT TTGGGAGGAT ATGCTGTGTG TCATGGAACA GGCAGGCCTG CCCATCTCCG AAAAACAACG TTCCTATTTC CTGCCCAAAA AGTAA
|
Protein sequence | MSAGLPQHLT LDYLRALLSR GVEKTTVTEE ARMVLRKWVM DARRISGSPL PASVYAKPAA NETRPKQPTP EQTPPDSVDE ASFGNELRDI LNGVQPHRTE EDAPVPRHIS FDLEGETEEE KLSSLRELVV NWPPLRNMDS LRETPVFSSG NPRADIMMVT DAPGLYEEKQ GVPLAGPSGQ KLDAMLKAMG LSRSDIYLTH LVKYRPALPR QLTNNRPPTD REIEISLPIL REEIMLVRPK VVVALGAISA RGILQSGETP LSALRGTFHT AFNTPVRVTY NPSYLLRTED ISEKRKVWED MLCVMEQAGL PISEKQRSYF LPKK
|
| |