Gene Amuc_1155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1155 
Symbol 
ID6273868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1386032 
End bp1387018 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID642613206 
Productprotein of unknown function DUF323 
Protein accessionYP_001877761 
Protein GI187735649 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0094568 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCATCA ATTTGCAAAA AGTCATTATA GCCGTATGTG CGGCCGCAAC GGTCAGTGCG 
GGATTCGCAG CCCGCACCGA GGGGGCCAAA CTGCACCGAT TCAGCACTAC TATCGAAAAA
GAGAGGCCGC AGTTGAATGA AGAAACGAAA CGGTTGATTG CTTCCTATCG CCGGGACCCG
TCCGAGGCTA ATCGGTCGGC TTTGAGGAAA CAGGTCGGGA TTAATTACGA CAAGGTCCTT
GACCGAAAAA AGGCGAAGCT TGAAGAATTG AAACGAACGG CCCGGCATGC GTCCAAGATA
CAGGAAATGC AGGAGATCGT GGATGAAATG GTTCAAAACA GGGAAGCGCG CATCGACCAA
AGCATGCGGC GTTTCAGTGA TCCGCGATTC AGACCAGGTG CCCGCTATGC AGATGGCGGG
TATTTGCCGG TGCTCGGAGC TTCCTGGAAT GTATCAATTG CATATACTCC GGTTACCAAT
GAGGATTATG CCCAATTCCT GAAGGCAACG GGAAGGAAGG CTCCTAAGGA TTGGGATAAT
GGCGCCATGC CTGCCGGCAA AGGGAGGCAT CCCGTGGTGA ATGTTTCCTA TGACGACGCT
TCTGCCTATT GCCAATGGTT GTCGCTCCAG GATAGGAAAG CCGTTTATCG CCTGCCGACT
GAAGAGGAGT GGGAGTTTGC CGCCGGGCAT ATGCCCAAGG ATGCGGATTT TAATTGCGGC
GAGAGAAACG GGACATCTCC CGTGGACGCC CATGCCGCGA CATCCGGCGC CTGCGGAGCG
ATTGATATGT GGGGCAATTG CTGGGAATGG ACTTCTTCTT CAGTTGAAGT TTCCAAGGCC
GTGGCACACG GTAAAACGGT CATGGCGGTG AAGGGAGGTT CCTGGCGTTC GCCGCGGACG
AGCTGCCGAA CTGAGCGGAA AGGAGAAGGA AGGGAATCTT CTTTCGCTTT CAGTGATGTC
GGTTTCCGGG TCGTCCGCGA AGGTTGA
 
Protein sequence
MSINLQKVII AVCAAATVSA GFAARTEGAK LHRFSTTIEK ERPQLNEETK RLIASYRRDP 
SEANRSALRK QVGINYDKVL DRKKAKLEEL KRTARHASKI QEMQEIVDEM VQNREARIDQ
SMRRFSDPRF RPGARYADGG YLPVLGASWN VSIAYTPVTN EDYAQFLKAT GRKAPKDWDN
GAMPAGKGRH PVVNVSYDDA SAYCQWLSLQ DRKAVYRLPT EEEWEFAAGH MPKDADFNCG
ERNGTSPVDA HAATSGACGA IDMWGNCWEW TSSSVEVSKA VAHGKTVMAV KGGSWRSPRT
SCRTERKGEG RESSFAFSDV GFRVVREG