Gene Amuc_0561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0561 
Symbol 
ID6275519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp660401 
End bp661639 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID642612610 
Productsun protein 
Protein accessionYP_001877179 
Protein GI187735067 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.494153 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGAACA ATCCCCCCTC CCCCCGCCAG ACAGCACTAA ATTGCCTGAG GAGCTGGCAT 
GCAGGCCGCT CCTTCGCGGA AACCCTCGTG GACCGGGAAT GTTCACGAAC CGCGCTTCCA
TCGGCAGACA GGCACCTGGT GCAGGCTTTG GTCTTCAGCG TATTGCGCAA CCAGACCTGG
CTGGACCACG TCATCGGAAC CCTCCGGAAA GGCAGGCTGG ACGTGGAAGC GCGTCTTATT
CTCCAACTGG GGTTGAGCCA GCTTTTTCTG CTGGGCATGG CGGACCACGC CGCTGTGTAT
GAAACCGTGA ATCTCGCGTC CGTACGCCTG AGAGGACTGG TAAACGCTAT CCTGCGCAAC
GCTTTGCGGC GGGAGAAAAC CATTCTGGAG GAACGGGAAA AACTTCCGCT TTCCATTCAT
TATTCCACCC CCGCGTGGCT GGTACGGAGA TGGACGGAAC AAATGGGGCC GCAAATGGCC
CGCGACCTGC TCCGCTGGAA CAATACCACG CCGCGCCTGT ATGTGCGCGC CAATCCTCTG
ATGCCCATGA AAAATATTCC GGCCTCCCTC GCCCCGCTGG ACCGCGCGCC CGGCTGGTTC
TCCGTGGAAG GCCTTCTGCC GCTGGAGGAA ATTAAAACAG GCTCCCTTTA CGTAGCGGAT
CCTTCCACCC GTTATTCCAT TGATTTGCTG GCCCCACAGC CCGGAGAGGA AATTCTGGAC
GCCTGCGCCG CCCCCGGCGG CAAATCCGCC GCCATCATCG CCGCTACCGG AGGCAAAGCC
CACCTGACCG CCACGGATCT CCACGAACAC CGGCTGCCCA CCCTGAAGGA AAACCTGGAC
AGGCAGGGTT CTTCCTTCGT CAGGACGGCG CAGGCGGACT GGTCCCTTCC CTGCCGCACG
GAATGGAAGG GCCGCTTTGA CGCCGTGCTT CTGGACGTTC CCTGTTCCAA CACCGGAGTC
ATCCAACGCC GCGTGGACGT GCGCTGGCGC CTGACTCCGG AGGAAATCCG TCGCCTGACC
GCACTCCAGA AGACCATCCT GGAAAATGCC TCCCGCGCCG TCAAACCGGG CGGCAGACTG
GTTTATTCCA CCTGTTCCAT TGACGCGGAG GAAGACGGAC TGCTGATCAG GGACTTTTTG
CAGAACCATC CGGAATGGAC GCTGAAAGAA GAAAAACTTA TCCTTCCCCA CGAGGAAAAA
TCGGACGGCG CGTATGCGGC CCTTTTGATC TGTGCTTGA
 
Protein sequence
MKNNPPSPRQ TALNCLRSWH AGRSFAETLV DRECSRTALP SADRHLVQAL VFSVLRNQTW 
LDHVIGTLRK GRLDVEARLI LQLGLSQLFL LGMADHAAVY ETVNLASVRL RGLVNAILRN
ALRREKTILE EREKLPLSIH YSTPAWLVRR WTEQMGPQMA RDLLRWNNTT PRLYVRANPL
MPMKNIPASL APLDRAPGWF SVEGLLPLEE IKTGSLYVAD PSTRYSIDLL APQPGEEILD
ACAAPGGKSA AIIAATGGKA HLTATDLHEH RLPTLKENLD RQGSSFVRTA QADWSLPCRT
EWKGRFDAVL LDVPCSNTGV IQRRVDVRWR LTPEEIRRLT ALQKTILENA SRAVKPGGRL
VYSTCSIDAE EDGLLIRDFL QNHPEWTLKE EKLILPHEEK SDGAYAALLI CA