Gene Amuc_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0551 
Symbol 
ID6275295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp651511 
End bp652485 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content58% 
IMG OID642612601 
Productphage SPO1 DNA polymerase-related protein 
Protein accessionYP_001877170 
Protein GI187735058 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0957804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG GCCTTCCCCA ACACCTCACG CTGGATTATC TGCGCGCCCT GCTATCCCGG 
GGCGTGGAGA AAACCACCGT CACGGAGGAA GCCAGAATGG TCCTGAGAAA ATGGGTCATG
GACGCGCGCC GGATATCCGG AAGCCCCCTC CCCGCTTCCG TTTACGCCAA GCCTGCCGCA
AATGAAACAC GGCCGAAACA GCCCACCCCG GAGCAGACGC CTCCAGATTC CGTGGATGAA
GCATCCTTCG GCAATGAACT CCGGGATATC CTCAACGGAG TACAGCCGCA CAGAACGGAG
GAGGATGCTC CGGTACCCCG CCATATTTCC TTTGATCTGG AAGGTGAAAC GGAGGAAGAA
AAACTGTCCT CCCTGCGGGA GCTGGTCGTT AACTGGCCGC CGCTCCGGAA CATGGATTCC
TTGAGGGAAA CGCCCGTGTT TTCCTCCGGC AATCCCAGGG CGGACATCAT GATGGTAACG
GACGCCCCCG GCCTGTATGA AGAAAAACAG GGGGTTCCCC TGGCCGGGCC TTCCGGACAA
AAGCTGGACG CCATGCTGAA AGCCATGGGG CTTTCCCGTT CCGATATTTA TCTGACCCAT
CTGGTCAAAT ACCGTCCGGC CCTCCCCCGG CAGCTTACCA ATAACCGCCC GCCTACAGAC
CGGGAGATAG AAATTTCCCT GCCCATTCTC CGGGAGGAAA TTATGCTGGT GCGCCCGAAA
GTAGTGGTGG CCCTGGGAGC AATCTCCGCC CGCGGCATCC TCCAGTCAGG AGAGACGCCT
CTTTCCGCCC TGAGAGGCAC CTTCCACACA GCTTTCAACA CGCCCGTGCG CGTTACTTAC
AATCCCAGTT ATCTTCTCCG CACGGAAGAT ATTTCAGAAA AGCGAAAGGT TTGGGAGGAT
ATGCTGTGTG TCATGGAACA GGCAGGCCTG CCCATCTCCG AAAAACAACG TTCCTATTTC
CTGCCCAAAA AGTAA
 
Protein sequence
MSAGLPQHLT LDYLRALLSR GVEKTTVTEE ARMVLRKWVM DARRISGSPL PASVYAKPAA 
NETRPKQPTP EQTPPDSVDE ASFGNELRDI LNGVQPHRTE EDAPVPRHIS FDLEGETEEE
KLSSLRELVV NWPPLRNMDS LRETPVFSSG NPRADIMMVT DAPGLYEEKQ GVPLAGPSGQ
KLDAMLKAMG LSRSDIYLTH LVKYRPALPR QLTNNRPPTD REIEISLPIL REEIMLVRPK
VVVALGAISA RGILQSGETP LSALRGTFHT AFNTPVRVTY NPSYLLRTED ISEKRKVWED
MLCVMEQAGL PISEKQRSYF LPKK