Gene Amuc_1584 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1584 
Symbol 
ID6273650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1904390 
End bp1905658 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content57% 
IMG OID642613644 
Producttype II secretion system protein 
Protein accessionYP_001878185 
Protein GI187736073 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID[TIGR02120] general secretion pathway protein F 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.738971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAT ATCAATACAC AGCACTTGAC CATAAAGGCG ACCAGAAAAC AGGTACCCTG 
GAGGCCAATT CCGAAGCGGA GGCCATGGAA TCCATCCGGG CGCATGGCCT GTACCCCACC
CAGATCGTAG AAGCGGGCAA GGGCAAGATT CAGCAGACGC CTGCCGCCAA GAAAAAGGCC
AAGGGAGCCA AGAAGCAAAA AGGCAAGCTG GGAGGCAAAA TCAAGGCCAA GGCTCTGATG
ATTTTCACCC GCCAGCTTGC TACGCTGATT GACGCGGGGC TTCCCCTGCT CCAGAGTTTG
AACGTGCTGG CCAAACAGGA GGCAAACCCC AACCTGCGCG TAACCATTGA GGCTCTTGGA
GATTCCGTTC AGGGCGGCTC CACCTTCTCG GAAGCCCTGG CCCAACACCC CAGAATTTTT
GACCGCCTGT TTGTCAACAT GGTAAAGGCC GGGGAACTGG GCGGTGTGCT GGAAGTCGTG
CTGAACCGTC TGGCGGAATA CCAGGAAAAG GCCCAAAAGC TGAAAAGCAA GGTGATCTCC
GCCATGGTGT ATCCCTCCAT CGTCCTGTTT ATCGCCGTAG GCATCGTGAT CTTCCTGATG
CTGGTCATCG TGCCCAAATT CAAGGCGATG TTCGCAGAAC AGAACCAGGA ACTTCCCGGT
ATTTCCGAGT TTGTGTTCGG CATCAGCGAC TGGTTCATGG CCGCCCCCTT CTTTGTGCCC
AATGCCGTCA TTCTGGCCGC GGTAGTCGCC ATCCTGTACG CTGTTTTCAC GGCCATGAGC
AAGACGCCCA ACGGACGCCG CAAGATTGAC TCCGCTCTGC TGACCATGCC GGTCATCGGC
AATGTGCAGA GCAAAAGCGC CATCGCCCGC TTCGCCCGAA CCTTCGGTAC GCTGGTGACT
TCCGGCGTCC CCATCCTCCA GGCGCTTACC ATCACGAAGG ATACCGCCGG CAACATGATC
GTGGGAGACG CCATCGGCCT CATCCATGAC TCCGTCAAGG AAGGCGAATC CGTAGTTACG
CCCATGTCCT CCTCCAAGCT TTTCCCGCCC ATGGTAATCT CCATGGTGGA CGTGGGGGAA
GAAACCGGCC AGTTGCCGGA CATGCTCCTG AAAATCGCGG ACGTGTATGA TGATGAAGTG
GACAATGCCG TGGGAGCTAT GACCTCCATG CTGGAACCCA TCATGATCGT ATTCCTGGCC
GTGGTCGTGG GCGGCATCGT GTTCGCCATG TTCCTTCCCC TCCTGCAGGT TATTGAAAAG
ATGGGATAA
 
Protein sequence
MPKYQYTALD HKGDQKTGTL EANSEAEAME SIRAHGLYPT QIVEAGKGKI QQTPAAKKKA 
KGAKKQKGKL GGKIKAKALM IFTRQLATLI DAGLPLLQSL NVLAKQEANP NLRVTIEALG
DSVQGGSTFS EALAQHPRIF DRLFVNMVKA GELGGVLEVV LNRLAEYQEK AQKLKSKVIS
AMVYPSIVLF IAVGIVIFLM LVIVPKFKAM FAEQNQELPG ISEFVFGISD WFMAAPFFVP
NAVILAAVVA ILYAVFTAMS KTPNGRRKID SALLTMPVIG NVQSKSAIAR FARTFGTLVT
SGVPILQALT ITKDTAGNMI VGDAIGLIHD SVKEGESVVT PMSSSKLFPP MVISMVDVGE
ETGQLPDMLL KIADVYDDEV DNAVGAMTSM LEPIMIVFLA VVVGGIVFAM FLPLLQVIEK
MG