Gene Amuc_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0502 
Symbol 
ID6275455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp592354 
End bp593475 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content55% 
IMG OID642612552 
Productputative transmembrane protein 
Protein accessionYP_001877121 
Protein GI187735009 
COG category[S] Function unknown 
COG ID[COG4299] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.326971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAC TTTCCGATAC CAGGCCGCAG AGAATTGCGG CCATTGACGC CCTGCGGGGA 
TTTGACATGT TTTTCCTGAC CGGGGGCCTG GCTCTGGTTG TGGCCGGCAT CAATCTTTTT
TACGACCGGA GCCCCGAGTG GCTGGTGAAG CACAGCACGC ACGTGGCTTG GGAGGGATTC
GCCGCCTGGG ATCTGGTGAT GCCCCTCTTT TTGTTCATTG TGGGAACGGC CATGCCGTTT
TCCTTTTCCA AACGCATCGG TTCGGAACCT CTGTGGAAGA TTTACCTGAA GGTTGCCAGG
CGGGTAGTGG TGCTTTTTTT GCTGGGCATG GTGGTGCAGG GCAATCTGCT GAGTTTTGAA
CCGTCCAGGA TGTCCCTGTA CTGCAATACG CTCCAGGCCA TCGCCTCCGG CTACCTGATT
GCGGCCATTT GCCTTCTTCA TCTGTCCATC CGGTGGCAGG TAGCGGCAAC GGGGGGGCTG
CTGGCTGTGT ACTGGCTGGT CATGAAGTTT GTTTCCTTTT CTGACCCCGC GGTGGGTTCC
TGTGCGGCAG GAATGCTTGA ACCGGGGAGG AATCTGGCCC TGCTGCTGGA TAAATACCTG
ATGGGAAACT GGCAGGATGG AACGAATTAT GCGTGGATTC TGGCGCAGTT CGGTTTTGGC
GCCATGACCA TGCTCGGTCT GCTGGGCGGC CAGATTCTGA AGCGGGTGCA GGGGCACGGG
AAAAAGCTGG CGTGGCTGTT ATGTGCGGGC GCGGGCTGCC TGGCGCTGGG ATATGCCTGG
AGCCTGGATT TTCCGATGAT CAAGCGTTTG TTCACCAGTT CCATGGTATT GTGGGCGGCG
GGATGGTGCT ATTTTCTGCT GTTCCTGTTC TATCTGCTGA CGGATGTGCT GAAATTGAAC
TGGTTGACAT TCTTTTTCTC CGTAATAGGG AGCAATGCCA TTTTCGTGTA CATGTGGGTA
TCTCTGTGCC CCCCTACGGG CAATTTCTCC CGGGTATTGT TTGCCGGGTT TAGCGAGTGC
TTCGGGGATG CGGACAGGTT TGTCTTTTAC CTGTGCAATT ACGCCCTGAT TTGGGGCGTG
TTGTATTACA TGTACAAAAA CCGGACCTTC ATCAAGGTCT AG
 
Protein sequence
MSSLSDTRPQ RIAAIDALRG FDMFFLTGGL ALVVAGINLF YDRSPEWLVK HSTHVAWEGF 
AAWDLVMPLF LFIVGTAMPF SFSKRIGSEP LWKIYLKVAR RVVVLFLLGM VVQGNLLSFE
PSRMSLYCNT LQAIASGYLI AAICLLHLSI RWQVAATGGL LAVYWLVMKF VSFSDPAVGS
CAAGMLEPGR NLALLLDKYL MGNWQDGTNY AWILAQFGFG AMTMLGLLGG QILKRVQGHG
KKLAWLLCAG AGCLALGYAW SLDFPMIKRL FTSSMVLWAA GWCYFLLFLF YLLTDVLKLN
WLTFFFSVIG SNAIFVYMWV SLCPPTGNFS RVLFAGFSEC FGDADRFVFY LCNYALIWGV
LYYMYKNRTF IKV