Gene Amuc_0823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0823 
Symbol 
ID6274354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp968155 
End bp969216 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID642612873 
Producthypothetical protein 
Protein accessionYP_001877437 
Protein GI187735325 
COG category[S] Function unknown 
COG ID[COG3528] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.26689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0377182 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGTA CGGCTTTGTG CATGTGCGCC GCCCTGCTGC CGGTGGGCTT CCTGCAGGCA 
GGAACCCAGC AGATAGAAGC GCCGCAGGAG GGCTCCGTTA TCAGTTTCCA TCTGGAAAAC
GATATGTTCG TGGGGGATGA TGATAATTAT ACCAACGGCG TCCGCTTTGC GTGGATGTCC
GGCACCACGT CCCGGAGCCA TACGTTTTCC GGCATGCTGG GAACAGTGCT GGGCGGCACG
AACGCCTCGG ATTCCTGGCG GCGGTTCATG GGCATGAACG GTTCCGCCAA CCTGCGCCAG
CAGTGGGGTT TGGACCTGAC CCAGCTCATG TACACCCCGG AGCAGAAGGC CACCTATCCC
ATCTACAACC AGCACCCCTA TGTGGGCAAC CTGACGCTGG GGCTGACCTC CCTGGTCAAG
AATGAAGACC GGGCCAATTC CCTGGAGCTG CAACTCGGCA CCACGGGCAC GAATTCCCTC
GCCAAGGGCT CCCAGCATTT CATCCATAAG CTGTGGGGTA TGGAGCAATG GCCCGGCTGG
GCCAACCAGC TCCCCGGAGA GATGACCGCC AATTTGTTTT TCAAGCGGTA TTACCGCCTG
CGCGGACTGG AGAAGCGCTA CGGCTCCGGT TTTGAAACGG ATGCCCTGGC TTACTGGCAT
GCGGACGCCG GCACAGTAAA GGTGCAGGCG GGGGGCGGCA TGTCCTTCCG CTTCGGCTAT
AATCTGGGCA ATACTTCTCC GGAGAACAGC ATTCGCGGAG CGACCAGTGC AGCACCTCCC
TTCGTTTATA ACAGGATGTC CGTTTCCAAT TGGGGGTATT ACGGTTATAT TCATGCTGCC
GTGCGAGCCG TGGCTCATGA CCTGTATCTG GATGGTACGG TGTTTCGTTC CTCCCCCAAG
TATGTGAACA AGTATCCCGT AGTGGGAGAA TGGGGTTATG GCTTCGGCTT CCGGTACAAG
CGCTCGGAAT TGCTTTTCGG CCTGCATTAC ATGACCAAGG AATACACCCA GCAGGAATCC
ATGCAGTGTG TGGGCATTCT CCAGCTTCGG CATACTTTTT AA
 
Protein sequence
MKCTALCMCA ALLPVGFLQA GTQQIEAPQE GSVISFHLEN DMFVGDDDNY TNGVRFAWMS 
GTTSRSHTFS GMLGTVLGGT NASDSWRRFM GMNGSANLRQ QWGLDLTQLM YTPEQKATYP
IYNQHPYVGN LTLGLTSLVK NEDRANSLEL QLGTTGTNSL AKGSQHFIHK LWGMEQWPGW
ANQLPGEMTA NLFFKRYYRL RGLEKRYGSG FETDALAYWH ADAGTVKVQA GGGMSFRFGY
NLGNTSPENS IRGATSAAPP FVYNRMSVSN WGYYGYIHAA VRAVAHDLYL DGTVFRSSPK
YVNKYPVVGE WGYGFGFRYK RSELLFGLHY MTKEYTQQES MQCVGILQLR HTF