Gene Amuc_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1586 
Symbol 
ID6273640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1907415 
End bp1909055 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content58% 
IMG OID642613646 
Producttype II secretion system protein E 
Protein accessionYP_001878187 
Protein GI187736075 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.562805 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACTCCA ACGAATCATA TCTCATCGAA CTTCTTACGC AGGCCGGATA TCTGAATGAA 
GAAATTCTTC AGTACGCGCG CAGTCAAAAA TCTCCCACCC AGGACCTGGT CGATTTTCTG
ATTCAGGCCG ATTATCTTAC GGAAGACGTC ATCGCTCAGG TGGCGGCGTC AAACTCCCTG
CTCCCCGTCG TGGACCTGGG CTCCATGCAC ATCCCCCAGG AGGTGCGCGA ACTCATTACG
CCGGAACTTG CCAGGCGCTT CCGGGCCATC CCCATTTCCG ACGACGGTTT TTCCATCAAC
ATTGCTATTG ACGACCCCCT GAATCTGGAA ACCATGGACA GCCTGCCCCA GCTGATGGGG
CGGGATGTGA TTTTCAGCGT CGCCACCCAC AGCGCGGTAG AAAGCCGGTT GAATGAATTT
TACCGGGATC TGAGCGTTCC GGAAGAAACC GACGGGCTGG AAGGGGAGGA TGCTCCCATT
ATCCGGCTGG TGCAGCAGAT GCTGACGGAC GCTTTCAAAA TGAGGGCTTC CGACATCCAC
ATTGAGCCCA TGGAAAACAG GCTCCGCATC CGCTACCGCG TGGACGGCAA GCTGGTGGAA
GTGGCCACGC ATCCCAAAAA ACTGCTCAGC CCCATCATCG CTCGCCTCAA GGTAATGAGT
ACCACCATGA GCATTGCGGA AAAACGCATG CCCCAGGACG GGCGAATCCA GATGAGCATC
GGCGGCAAGC AAATCGACCT CCGTGTTTCA TCCGTCCCCA GCAACCACGG GGAAAGCATC
GTCATGCGTA TCCTGGACAA ATCCGCCCTG GTGCTGGGCC TTCCCCAGCT CGGATTTTTC
TCGGATGATG AAGCTGTGTT CGACCGCCTC ATCACGCTGC CGGACGGCAT TATCCTGGTG
ACGGGTCCTA CCGGTTCCGG TAAAACCACG ACCCTTTACG CGTGCCTGAA CCACATCAAC
CGCCCGGATA AAAAAATCAT CACGGTGGAA GACCCTGTGG AATACGAACT CTCCGGCATC
AACCAGGTAA TGGTAAAGGC GGATATCGGC ATGACCTTCG CCGCCGCCCT GCGCGCCATG
CTCCGCCAGG CTCCCAACAT CATCATGATC GGGGAAATTC GAGACATGGA AACAGCCAGC
ATCGCCATCA ACGCCTCCCT GACGGGGCAC CTCGTATTCT CCACCCTTCA CACCAATGAC
GCTCCCAGCG CCGTGGCCCG TCTGGCGGAC ATCGGCATCA AACGCTTCCT GATCGCCTCC
TCCGTCCGCG CCATCATGGC CCAGCGTCTT GTCCGCAAGC TGTGCGACCG CTGCAAGGTG
GACGGCACTC TGACGGAAAA GCAGGCGCAT ACGCTGAACA TTGACATGTC CCGCCTTGCC
CAGGGCCAAA TCAAGGCGCC CCACGGCTGC GACTTTTGCC GCGGCGGCGG ATTCAAGGGC
CGGATGGGGC TGTTTGAGAT TTTCGAAATC GACGACGAGG TGCGCCGCAT GATTAACGAA
AATCTGACTT CCCCCCAGCT GCGCCAGCGC GCCCGGGAAC TGGGCATGAG AACCTTGAGG
GAAGACGGCG TACGCAAAGT GCTGGCCGGC CTTACTTCTC CGGAAGAAGT GCTGAACGTC
ACCATGGGAG ACGCCAACTG A
 
Protein sequence
MYSNESYLIE LLTQAGYLNE EILQYARSQK SPTQDLVDFL IQADYLTEDV IAQVAASNSL 
LPVVDLGSMH IPQEVRELIT PELARRFRAI PISDDGFSIN IAIDDPLNLE TMDSLPQLMG
RDVIFSVATH SAVESRLNEF YRDLSVPEET DGLEGEDAPI IRLVQQMLTD AFKMRASDIH
IEPMENRLRI RYRVDGKLVE VATHPKKLLS PIIARLKVMS TTMSIAEKRM PQDGRIQMSI
GGKQIDLRVS SVPSNHGESI VMRILDKSAL VLGLPQLGFF SDDEAVFDRL ITLPDGIILV
TGPTGSGKTT TLYACLNHIN RPDKKIITVE DPVEYELSGI NQVMVKADIG MTFAAALRAM
LRQAPNIIMI GEIRDMETAS IAINASLTGH LVFSTLHTND APSAVARLAD IGIKRFLIAS
SVRAIMAQRL VRKLCDRCKV DGTLTEKQAH TLNIDMSRLA QGQIKAPHGC DFCRGGGFKG
RMGLFEIFEI DDEVRRMINE NLTSPQLRQR ARELGMRTLR EDGVRKVLAG LTSPEEVLNV
TMGDAN