Gene Amuc_1585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1585 
Symbol 
ID6273643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1905728 
End bp1907386 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content57% 
IMG OID642613645 
Producttype II secretion system protein E 
Protein accessionYP_001878186 
Protein GI187736074 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.834386 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCA ACCTCACACT GGAACTTTTC ATCGGCCGGG GAATGATTGA CAAATCCCTG 
GCAAAGGACA TCAAGGAGGA AATGATCGCC TCCGGCAAGG AGCTGCCGGA AGTGCTTGCA
GACTTCGGCA TCATCGGCAG CAAGGATGAT ATCTGGCAGA TGATTGCCAG CGACCTGGGT
ACGGAATTCA TTACACTGGA CAACTTCCAG CCGGATCCGA ACGTGCAGAA CATGATGCCG
GCCACGCTCG TGCGCCTGCA CGGGGCGCTC CCTGTGCGGC ATGGTCCGGA AGGCCTGTAC
GTCTGCCTGG TGGATCCCCT GAATCCCCAG ACGGTGGAAG ACCTGCGCTT CGCCCTCGGC
CAGGACATCC ATGTTCTGGT AGCGCCGGAT TACCAGATTT CCGAACGCAT CAATGAGCTT
TATGGAGGCG AATCCGCCGC CATGTCCGAC CTGATGCAGG AGCTGAACAA CATGCAGGTC
AACAATGAGA CGGAGGACTC CGCCGCCGCT CCCGTCATCC GCTTTGTGGA CCTCGTCATT
ACGCAGGCCA TCAAGGAAAA GGCCTCCGAC ATTCACTTCG AACCTTTTGA GAAGGAATTC
AAAATCCGCT ACCGTGTGGA CGGCGCCCTG TATGAAATGC AGCCTCCCCC CGTCCACCTG
TCCGTGCCGG TCATTTCCCG CGTCAAAGTC ATGGCGAACA TGAACATCGC GGAACGCCGC
ATTCCGCAGG ACGGACGCAT CGTCAAGCAG ATAGGAAACC GTTCCGTGGA CATGCGCGTT
TCCTCCCTTC CCACTCAGTA CGGAGAATCC GTGGTGCTCC GCGTTCTGGA CCGCTCTTCC
GTCAACTTGA ACATGGACAA CCTGGGGCTT CCCGCGCATA TCCACGAATA TATTCTGGAT
ACGGTCCACA AGCCCAACGG CATTTTCATC GTTACCGGCC CCACCGGCGC CGGCAAGACA
ACTACGCTGT ATGCCGCCCT GCGTGAAATC AATACCATTG ATTCCAAGGT GCTGACGGCG
GAAGACCCTG TTGAATACGA TATTGACGGC ATCATCCAGA TTCCTATCAA TGAAGCCATC
GGCCTGGACT TCCCAATGGT GCTCCGCGCC TTCCTGCGAC AGGACCCGGA CCGTATTCTG
GTGGGGGAAA TGCGAGACAT GGCAACAGCG CAGATCGCCA TCCAGGCATC CCTGACGGGT
CACCTGGTTC TCTCCACCCT GCACACGAAC GACTCCGCCG GAGCCATTAC GCGACTGGTG
GACATGGGAT GCGAACCTTT CCTGGTGGCG GCTTCCCTGG AAGGGGTGCT TGCACAGCGC
CTGGTGCGCA CCATCTGTCC GGACTGCCGC ACGCCGTATG AACCCTCATC CACCATCCTC
TCCCAGCTTG GCGTCTCTCC CTATGAACTG GGAGACAAGC ACTTTTTCAC GGGCCGAGGC
TGTGATAAAT GCTCCAATTC CGGCTACAGG GGCCGCAAGG GGATTTATGA GCTCCTGGAT
ATTAACGATA CCCTGCGCGA CATGATTACG GATCGCGCTC CTTCCGTGGT GCTGAAGCAG
AAAGCCATTG AAATGGGCAT GTCCACGCTG CGGGAAGACG GGCTGAGAAA TATTTATGAC
GGCAACACCA CCATTGAAGA AGTGCTGAAA TATACTTAA
 
Protein sequence
MDTNLTLELF IGRGMIDKSL AKDIKEEMIA SGKELPEVLA DFGIIGSKDD IWQMIASDLG 
TEFITLDNFQ PDPNVQNMMP ATLVRLHGAL PVRHGPEGLY VCLVDPLNPQ TVEDLRFALG
QDIHVLVAPD YQISERINEL YGGESAAMSD LMQELNNMQV NNETEDSAAA PVIRFVDLVI
TQAIKEKASD IHFEPFEKEF KIRYRVDGAL YEMQPPPVHL SVPVISRVKV MANMNIAERR
IPQDGRIVKQ IGNRSVDMRV SSLPTQYGES VVLRVLDRSS VNLNMDNLGL PAHIHEYILD
TVHKPNGIFI VTGPTGAGKT TTLYAALREI NTIDSKVLTA EDPVEYDIDG IIQIPINEAI
GLDFPMVLRA FLRQDPDRIL VGEMRDMATA QIAIQASLTG HLVLSTLHTN DSAGAITRLV
DMGCEPFLVA ASLEGVLAQR LVRTICPDCR TPYEPSSTIL SQLGVSPYEL GDKHFFTGRG
CDKCSNSGYR GRKGIYELLD INDTLRDMIT DRAPSVVLKQ KAIEMGMSTL REDGLRNIYD
GNTTIEEVLK YT