Gene Amuc_1482 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1482 
Symbol 
ID6275775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1772177 
End bp1773292 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content55% 
IMG OID642613542 
Productprotein of unknown function DUF805 
Protein accessionYP_001878085 
Protein GI187735973 
COG category[S] Function unknown 
COG ID[COG3152] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.118245 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.324958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAT CTCCTGCTTC TCCTGCTTCT CCTGCTGTTT CCTCCGTTCC TCCAACCTCC 
ACGCAGGCAT CTTCCCTCCT TTCCCCCCTC TCCTGCTGGA AAAAAGGTTT TCTTCATTAT
GCGGACTTTC GGGGCTGCGC TTCCCGTGCG GAATTCTGGT GGTTCATGGC TCTTCCCCTC
CTGGCGCTGA TTCCAGCCCT GGCAGGGTAT ATCCTGACGG ACTGGCTGCA CATCCCTGAT
ACAAGACTGA GCATCTACGG AGACGCCTTA ACCATCCTCC TGTGGGCGGT GTTATTCATC
CCAAGCATAT CAGCCGCGTT CAGACGCCTG CATGATACAG GCAGGAGCGG CCTCTGGCTT
TTTTCCCTTT TCATTCCCTT CGGGCTGGGG CATCTGATCT TTTTTTATCT GACGCTAGGA
GAAAGCAAGG CGGACGGCAA CAAATACAGC CGCCGTCCGG AGCCCCAACC GGCTGATCCC
CCTGCCGGAA AACTGAAAGA GCAGCCATTG ACTCCGTTTT ACCTTTACTG GCTCATCAGC
CTGCGGAAAT TGAATACGGT GGCAGGCCGC GCGTCCCGGA CGGAATTCTG GTCCTTTTTC
CTCCTTTCCG TCCTCCTGTT CCTTCCGCTG GGCTACAGCA TGATAGACGT TGACAGCCAG
CCGGCGGGTT TTTATGTCTC TCCTTCCCTC CAAATCCTGT TATATGCCGC CCATCCGCAA
GATGCTCTGA TCCTGCTGGC TCACTCCTGC TTCAATCCCA CCTTTTACTT TTTCTACCAA
TCCGGAGAGC TGAGCATGCT TTCCCTGGAG CTTCTGGCAG CCGTGGCGGG GCTCAATATC
CTCTTCAATC TGCCGGTCGC CGTGCGCCGC CTGCATGACA GCAATCTGAG CGGAAAATTC
ATCCTGATTC CCATTCTTAT TTTCATCGTC ACTTTCCTGC TGATTTTCCT GCTGCGCCTG
GTCCCGGAGG ACATGGCCCC CTATCTGGAC TACCTGGGAA TGGTGTCCAG CCTGATGGAT
CTGCTTTCCA TCCTCTTCCT GTCCATGATG CTTCTTAAAA GCTCGCCAGG CCCCAATGAA
TACGGCGTGC TTCCGCAAAA AATAACCGTA TCCTGA
 
Protein sequence
MPESPASPAS PAVSSVPPTS TQASSLLSPL SCWKKGFLHY ADFRGCASRA EFWWFMALPL 
LALIPALAGY ILTDWLHIPD TRLSIYGDAL TILLWAVLFI PSISAAFRRL HDTGRSGLWL
FSLFIPFGLG HLIFFYLTLG ESKADGNKYS RRPEPQPADP PAGKLKEQPL TPFYLYWLIS
LRKLNTVAGR ASRTEFWSFF LLSVLLFLPL GYSMIDVDSQ PAGFYVSPSL QILLYAAHPQ
DALILLAHSC FNPTFYFFYQ SGELSMLSLE LLAAVAGLNI LFNLPVAVRR LHDSNLSGKF
ILIPILIFIV TFLLIFLLRL VPEDMAPYLD YLGMVSSLMD LLSILFLSMM LLKSSPGPNE
YGVLPQKITV S