Gene Amuc_2151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2151 
Symbol 
ID6275471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2619364 
End bp2620386 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content59% 
IMG OID642614212 
Productprotein of unknown function DUF185 
Protein accessionYP_001878740 
Protein GI187736628 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.510759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0912925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATACGCC TTTCAGACCA CATCGCGGCC GCTGGCGGCT GGCTCTCCCT GGAAGCTTTC 
ATGCAGCTGG CCCTGCACCA CCCGCAGGAA GGGTACTACT CCTCTTCCAT TGAAAACATC
GGTCAACGCG GGGATTTTTC CACCACCCCC ACGCTTTCCC CCATCCTCGC AAAAGCCATT
GTCGCGCACT GGAAAGAAGC CTGCTCCCGC TGCGGCAGGC GGCTGCCTCT GCTGGAAATA
GGCGCCGGCT CCGGTGCCCT GGCCGTTAAA ATCCTGGAGC AGCTGGGATT CTGGAACCGC
CTGAATACGG ATTACGTGAT TGTGGAATCT TCCCCACGTC TGCGCGAATT CCAGCACCTT
CTGCTGGGAG GCCGCGCTAA AATTTACTCC ACCATGGAAA AAGCGCTGAA ACACTGCGGA
GGCAAGGCCT TTATTTTCTC CAACGAGCTG GTGGATGCCT TCCCGGCGCG CGTATTTGAA
TACACGGAAC AGGACTGGAA AGAAGTGGGG CTTGTCGTGA AAAACGGAGC CGTCCGGGAA
GAACTGCGCC CCGTCCGGCA GCAGCCGCTT TTCTCCCATA TGCTGGAATA CGGCTCCCAG
CCGGGGCAGC GGGTGGAAAT TCACGACTCC TACGCGCGCT GGTTTACGAG CTGGCTTCCC
CTCTGGAACA TGGGCGTCAT GACGGTCATC GACTACGGGG ATGAAATGGA GCGGCTGTAC
TATCGCCGCC CCCGGGGTTC CCTGCGCGGG TACAAAAGCC ACCAGGTGCT GACGGGGGAG
GAACTGTACC GTAACCCCGG CCTCACGGAT TTGACCTGTG ACGTCAATTT TACGGACCTG
CTGGAACTAT CCCGCAACTG TCTGGGAGAC CGGGTCACTT TCATGACCCA GCGGGACTAC
CTGCTCCCCC ATGCGGAAAA CACGCCGCAG GATGCCTTTC TAACGGATGA ATACGGTGCC
GGAGAACACT TCCACGTACT CATTCAGGAA CGCCAGCGGC TGCAACCGGA AGGCACCCAG
TAA
 
Protein sequence
MIRLSDHIAA AGGWLSLEAF MQLALHHPQE GYYSSSIENI GQRGDFSTTP TLSPILAKAI 
VAHWKEACSR CGRRLPLLEI GAGSGALAVK ILEQLGFWNR LNTDYVIVES SPRLREFQHL
LLGGRAKIYS TMEKALKHCG GKAFIFSNEL VDAFPARVFE YTEQDWKEVG LVVKNGAVRE
ELRPVRQQPL FSHMLEYGSQ PGQRVEIHDS YARWFTSWLP LWNMGVMTVI DYGDEMERLY
YRRPRGSLRG YKSHQVLTGE ELYRNPGLTD LTCDVNFTDL LELSRNCLGD RVTFMTQRDY
LLPHAENTPQ DAFLTDEYGA GEHFHVLIQE RQRLQPEGTQ