Gene Amuc_0247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0247 
Symbol 
ID6275266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp304260 
End bp305948 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content54% 
IMG OID642612295 
Productribosomal protein S1 
Protein accessionYP_001876871 
Protein GI187734759 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA CTGAACTGGC GGAACTTATT GACAGCAAGT TCCGCGAATT GCGTGAAGGT 
TCCATTGTTA CCGGAACCAT CCAAGAAATC CGTCCCCAAG TCGTTTTGGT GGACATCGGC
TACAAGTCCG AAGGCGCTAT TTCCATTTCC GAGTTTGAAG ACGAGGAAAT CGAAGTGGGG
GACCAAATTG AAGTCCTTTT GGAACGCCTC GAAAACGACG AAGGCATCGT CGTCCTTTCC
AAGGAAAAGG CCGCCCATAA GCAGAACTGG GATAAGATCG TGGGCGTGTA CCGCGATGGC
GGCCTGGTTA AGGGTAAAGT GAAGAGCGTC GTCAAGGGCG GTCTTATGGT CAATGTTGGC
GTGGAAGCTT TCCTGCCCGG TTCCCAGGTG GATATTATTC CTCCTCGCGA CCTGAACGAG
TATGTTGGAA AAGTTTACGA ATTTAAGATC GTCAAGGTAA ATGACGACCG TAAAAATATC
GTCCTTTCCC GCCGTGAGGT GATTGAAGCC GAACGCGCCG ACCAGCGCCA GCGCTTCCTT
GAAACCGTCA AGGAAGGCGA CAAGGTGGAA GGTATCGTGA AGAATATCAC GGACTTCGGC
GCTTTTGTCG ACCTCCGCGG CATGGACGGC CTGCTCCATA TCACGGATAT GAGCTGGGGC
CGCGTGAACC ATCCGAGCGA AATGCTCCAT ATCGGTCAGT CCCTGGAAGT CGTGATTCTG
GAAGTGGATC GCGAAAAGGA ACGCGTTTCC CTGGGCCTGA AGCAGATGAC AGACAACCCC
TGGGCGGATA TCGAACGCAA ATACCCGATC AATTCCCATG TCAAGGGCCG CGTGACCAAG
CTCCTGCCTT ACGGCGCCTT TGTGGAATTG GAAAAGGGCG TGGAAGGCCT AGTGCACGTT
TCCGAATTGT CCTGGGTCAA GAGAATCACC CGTCCGAGCG ATGTATTGAA GCTGGACCAG
GAAATCGAAG CCGTGGTTCT TTCCATTTCT GTGAAGGAAC AGAAGATTTC CCTCGGTGTC
CGCCAGTTGG AAGACAATCC CTGGGCGGAT ATCGAATCCC GTTTCCCGAT TGGTACCGTC
ATCAAGGGCC AGGTTCGCAA CCTTACTCCC TACGGCGCTT TTGTGGGACT GGAAGAAGGC
ATCGACGGCA TGATCCACGT GTCCGATATG AGCTGGACCC GCAAGATCAA TCATCCCTCC
GAAGTTCTCA AGAAGGGCGA CGAAGTGGAA GCCATCGTTT TGGAAATCAA GAAGGAGGAT
CAGCGCGTCT CCCTTGGTAT CAAGCAGCTT GAGTCCGATC CGTGGGAATC CATCAATGAC
CGCTTCAAGG TGGGCGATAT GGTGACTGGC CAGGTGGCCA AGATTGCCAG CTTCGGCGCC
TTTGTGAATC TGGACGGCGA TATTGACGGC CTGATTCATA TCTCCCAGTT GAGCGAAGAC
CATGTGGAAC GCGTGAAGGA TGTGATCAAG GTGGGTGATG AAATCACTGC CCGCGTGATC
AAAGTGGACA GCATCGAACG CCGTATCGGC CTTTCCATCA AGGCCGTCAA TTACGACACC
GAACAGCTCC GCCGCGAAAC CGCTTCCTTT GAAGCCCTCC GCCCGAGCAG CGATATGGTG
GGTCTGGAAC ACGCCTTCAA TCTGGCTACC CGTGAAAACG AAGAGTGGAG CCCTTCTGAA
GAGAAGTAA
 
Protein sequence
MSTTELAELI DSKFRELREG SIVTGTIQEI RPQVVLVDIG YKSEGAISIS EFEDEEIEVG 
DQIEVLLERL ENDEGIVVLS KEKAAHKQNW DKIVGVYRDG GLVKGKVKSV VKGGLMVNVG
VEAFLPGSQV DIIPPRDLNE YVGKVYEFKI VKVNDDRKNI VLSRREVIEA ERADQRQRFL
ETVKEGDKVE GIVKNITDFG AFVDLRGMDG LLHITDMSWG RVNHPSEMLH IGQSLEVVIL
EVDREKERVS LGLKQMTDNP WADIERKYPI NSHVKGRVTK LLPYGAFVEL EKGVEGLVHV
SELSWVKRIT RPSDVLKLDQ EIEAVVLSIS VKEQKISLGV RQLEDNPWAD IESRFPIGTV
IKGQVRNLTP YGAFVGLEEG IDGMIHVSDM SWTRKINHPS EVLKKGDEVE AIVLEIKKED
QRVSLGIKQL ESDPWESIND RFKVGDMVTG QVAKIASFGA FVNLDGDIDG LIHISQLSED
HVERVKDVIK VGDEITARVI KVDSIERRIG LSIKAVNYDT EQLRRETASF EALRPSSDMV
GLEHAFNLAT RENEEWSPSE EK