Gene Amuc_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0225 
Symbol 
ID6275305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp281234 
End bp282277 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content60% 
IMG OID642612270 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001876849 
Protein GI187734737 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAATGG CCATGATGAA AAAAGTTGTC TCTTTATGTG CGGCTTCCTC CGCCTTCTTT 
TTTTTGTGCG GATGTGACCG CGCCCCTTCT TCCGGGGAAA AGCGGGCTGC TTCTTCTGCG
GAACGGGAAA AGCCGTCTGC GGTGCGGAAA GAGGCGGAGA CATGGAGGGA GCGTCTGGAC
GCTGCTGTTG CCGGCTCCGT CCGCGAGAAG GAAAATGGTA GCGAGCATGT GAAAGCCCTC
AATGATGTGG CTGCCCTGTA TGCGGAGGGA TTGCAGAATG GCTGGGTTCA TCCGCTGGAT
GTGCGGGCCT GGTGTGATTC CGTGGCGGAG GCCGGTTCCG GGTATTCCGG GGAAACGGTC
ATCGGCGCCA TGTTCCTGTA TGGGACGGGA ATCAAGCGTG ATGCCGTAGC GGCCAGGGAG
TGGTTTGAGT ACGGGCTTGC CCGCCCGGGG ACCCAGCGGG GAAACGCTCT GTACATGTTG
GGAATGATGT ATTTCAAGGG GGATGGCGCG GATCAGGACC TGAATAAGGC GCTGGGGCTG
TGGCACAAGG CAGCGGATGA GGAACATCCG GCAGCCATGG GGCTGCTGGG CCGGGCTTAC
ATGGAGGGGA AGATGGGGGT TGAGAAGGAT GCCGCTTCCG GCCTGGCGCT GCTGGAGAAG
GCCGCCAACG GGGGGAATAC GCCTTCGTCC GTCTATCTGG GGAACATTTA TGCAAAGGGG
CAGGGGGTGG AGCGGGATAT GGAGCGTGCC ATGAAGTGGT ATGAACAGGC GGCTTCAGCC
GGAGACGCCC ATTCCCAGTA TATTGTGGGA CTGGCCTGTC TGGAAGGTTC CGGCGTGCCT
GTGGATGAGG GCAAGGCGTT CAGCTGGCTC CGGCTGGCCG CCGGGCAGGA CCACGTCAAC
GCCATGCTGA TGCTTTCCGT CTGCTACAGC ACAGGAAAAG GGACCCCTCA GAATGCGGAT
ATGGCGGAAG TCTGGAAAAA GAAGGCGCTT CAACTGAATG CGGAACGCGA GGGAAGTTCT
GCGCCGCAAA CGCAAAAACG TTAA
 
Protein sequence
MVMAMMKKVV SLCAASSAFF FLCGCDRAPS SGEKRAASSA EREKPSAVRK EAETWRERLD 
AAVAGSVREK ENGSEHVKAL NDVAALYAEG LQNGWVHPLD VRAWCDSVAE AGSGYSGETV
IGAMFLYGTG IKRDAVAARE WFEYGLARPG TQRGNALYML GMMYFKGDGA DQDLNKALGL
WHKAADEEHP AAMGLLGRAY MEGKMGVEKD AASGLALLEK AANGGNTPSS VYLGNIYAKG
QGVERDMERA MKWYEQAASA GDAHSQYIVG LACLEGSGVP VDEGKAFSWL RLAAGQDHVN
AMLMLSVCYS TGKGTPQNAD MAEVWKKKAL QLNAEREGSS APQTQKR